There are no reviews yet. Be the first to send feedback to the community and the maintainers!
990-xml-reader
IRSx: Turn the IRS' versioned XML 990 nonprofit annual tax returns into standardized python objects, json, or human readable text with original line number and description.whatwordwhere
Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.parsing-prickly-pdfs
NICAR 2016 talk about PDFs!990-xml-database
Django app to consume and store 990 data and metadatapdf17
nicar 17: advanced pdf manipulationirsx_cookbook
IRSX Cookbookpdf_bbox_utils
Helpers to create .csv files of word-level bounding boxes from text-based pdfs, or from hocr output.990-xml-metadata
metadata describing the 990 xml release, to be used by 990-xml-reader and related projectsplpython_textmatch
Add some fuzzy string match operations to postgreSQLpdf20
Advanced PDF manipulation with pdfplumber for NICAR 2020 / New Orleansdoc-wrangler
Noodle with document cloudtexas_rrc
some railroad commission oil / gas production filesreconcile-legislators
Test open refine reconciliation service to match legislators namespaper_fec
Parse the OCR'ed paper FEC filings (as well as the electronic ones)nicar-nonprofit-datarelease
Documentation for nonprofit data released at NICAR 2020easy-stats-113
Data from the census bureau's "easy stats" site--the first available on the 113th Congress.freefcc
house_disbursements
muck with sunlight house disbursement csvssenate_disbursements
process--partially--the senate clerk's report on spending.inspectfile
like inspectdb, but for filesirs_527
proces 527 data to csvslegacy_0809_acs_exporter
Legacy export of ACS processing from 2008 3-year ACS for R and PostgreSQL990-xml-admin
Keep tabs on 990 filingsfec_ftp
another bucket of scripts for grabbing the fec's ftp data etc for django + postgresLove Open Source and this site? Check out how you can help us