There are no reviews yet. Be the first to send feedback to the community and the maintainers!
watson-word-watcher
A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptionsjournalism-syllabi
Computer-Assisted Reporting and Data Journalism Syllabuses, compiled by Dan Nguyenabbyy-finereader-ocr-senate
Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned formsgithub-for-portfolios
A layperson's step-by-step guide to building webpages with Githubpython-notebooks-data-wrangling
Python 3.x notebooks about real-world data cleaning and visualizationfacebook-trending-rss-fetcher
Python code to scrape and collect data from the RSS feeds Facebook uses to augment its Trending Sectionsmalldata_journalism
An online reference for data journalismlearn-data-csv-cli
A work-in-progress guide showing how and why you should learn command-line tools (xsv, csvkit) to work with databashfoo
My personally curated list of bash/command-line commands and snippets that are very useful yet I keep on forgettingdatajournalism-primer
a general list of resources and articles for people interested in getting into data journalismcongress-colleges
What fancy schools do U.S. legislators go to?gis-geospatial-fun-python3x
Tracking my progress in doing GIS/Geospatial work in Python 3.xnicar-2019-pdfplumbing
NICAR 2019 workshop on using Python and PDFplumber to extract text from PDFsCongressmiles
A tutorial on using Face.com's and NYT Congress's API + Sunlight datadannguyen.github.io
I'm making a Github Pages repo!scrape-senate-financial-disclosures
looking at U.S. Senators' disclosures, including how to parse and track themlocal-news-data
how hard is it to get a list of all local news sites in the United States (LOL)python-at-stanford
Python Courses at StanfordNICAR-Google-Refine
The lesson and source files for Dan Nguyen's NICAR 2012 lesson on Google Refinepdftotablestable
Comparing the programs that extract tabular data from PDFs, e.g. ABBYY FineReader, Tabula, CometDocshouse-financial-disclosures
Scraping House representative financial disclosuresclinton-hillary-email-fbi-investigation-docs
OCR copy of the 2015-2016 FBI Investigation into Hillary Clinton's emailspydataproject-template
dan's personal reference for properly creating an empty/fresh python-based data wrangling projectpadjo-2017-sql-exam
PADJO 2017 SQL Exam - Now with extra election and disbursement data!aws-textract-pdf-to-csv-demo
Testing the new AWS Textract when it comes to extracting data tables from PDFs (pdf-to-csv) and whether it can deliver us from our endless tormentsnhtsa-complaint-data
Some scripts/data description for NHTSA complaint dataquickdataproject-template
a template I use for quick data project examples where collection, wrangling, and exploration can be done by standalone shell/python scriptsscreencappy
A command-line tool for making it easier to create and save screenshots as a bloggerdmv-vanity-plate-rejections
A repo of collected data and records from U.S. state DMVs regarding rejected vanity license platescsvkitcat
csvkitcat has been archived (Oct. 2020), and is being carted over to csvmedkitfrozen.analytics.usa.gov
A "frozen" version of https://analytics.usa.gov to practice network traffic inspection and web scrapingwrithub
A simple Python-based static post generator, because I just need to post, not make an entire websitejournaling-on-github
My personal repo for doing quick journaling on Github with Markdown, plus some helper TOC scriptsacp-2017-finding-stories-in-data
"How to Find Stories in Data" for the Associated Collegiate Press 2017 San Francisco Midwinter Conventionkfc-scrape
chickentil
A simple static Jekyll blog of things I've learned, day-to-day, particularly in programming and data journalismaltair-dataviz
Visualization in Python with the Altair library. Done in Jupyter Notebooks.mechanical-unmurk-ocr
For the OCRing of scanned, murky documents where privacy, speed, accuracy, and cost are all prioritiesseeing-is-beliebing
Instagram util for finding photos taken shorty before and after near where another photo was takensimplestuff-sqlite
A data/lesson repo teaching SQL syntax and concepts with a very simple SQLite databasesmalldata
A list of small datasets for examples of exploration in spreadsheetscms_medicare_fee_data
Data notebook for CMS Medicare fee datamarktoc
A Python library for generating a table of contents and anchor markup for a Markdown filesf-shelter-waitlist-daily-snapshots
A compilation of daily snapshots of San Francisco's emergency shelter reservation wait-list during the COVID-19 pandemicseshkit
seshkit is a command-line tool for creating transcripts from audio filesexcsv
goofin around with a command-line utility for quickly inspecting CSV filesmerle
A command-line tool for getting meta information from a URLDepGal
Build out a gal using RMagickcsvviz
please i would like someday a tool that is like csvkit but for making charts from the command linesupcli
supcli: my personal guide to modern CLI, including third-party replacement for classic Nix toolsxkcd-on-reactjs
Just playing around with React.js to make a searchable xkcd archiveyearbook
ny-gis-cartodb-fun
Examples of GIS with New York data and CartoDBsf-ethics-lobbyist-sql
A repo of San Francisco lobbyist data compiled into SQLite form, including data-handling scriptsemojicsv
Machine-readable emotions in machine-readable CSVcommand-line-basics-mz2022
command line lessons for 2022 quickie repoSCOTUS-Transcript-Viewer
A Backbone.js viewer of SCOTUS transcriptsShakyspeare
Analyzing the Bard's work with Ruby!death-data
bts-transstats-t100-domestic-demo
Demo of data processing for BTS transtatsmiddleman-meta-tags
Meta and SEO tag helpers for Middlemanbashappy_helpers
A bunch of helper functions I wrote to use for my own macOS terminal convenienceair_skift
Air railssecdataexploring
fetching and exploring SEC structured data for fundod-leso-1033-data
A repo for collecting data/records regarding the Defense Logisticsmatplotlib-styling-tutorial
A quick iPython notebook showing how to create and style Matplotlib charts with roughly same flexibility as ggplot2texas-state-salaries
playing around with texas state salary data courtesy of the Texas Tribunehealthcare.gov
A copy of healthcare.gov when it was built on Jekyll, before they removed the source codejekyll-datasite-template
Trying to make a template that scaffolds a basic jekyll site with bootstrap and vendor d3v5pgark
pgark (page archiver): Python library and CLI for archiving URLs on popular services like Wayback Machine [alpha, just spitballing]nature-inspired-algorithms-in-python
Going through Jason Brownlee's "Clever Algorithms: Nature-Inspired Programming Recipes" http://cleveralgorithms.com/nature-inspired/stochastic/random_search.htmllookups-of-note
Lookup tables and data referencescensusscout
making my own lightweight version of Census Explorer because y notmotherfuckingwebdesignguide
just do itfoodscrape
A demonstration of scraping health inspection websites and doing statistical analysisnicar-2019-github-intro
Intro to git and github for journalistsSinatra-Fun
Testing out sinatrajekyll-bootstrap-starter
a basic jekyll theme that sits atop of Bootstrap 4.x. For my convenience onlydata-wrangling-fakebook
The Little Data Wrangling Fakebookfoiastories
a curated list of interesting foia/foil requestsastronautdata
A repo of astronaut datadanssphinx-template
This is a bunch of examples of things I forget how to do in Sphinx and reSTsql2md
A bash script for converting SQLite query into Markdown-ready-pastable resultspoynter-census-data-2019
Poynter Census Data Workshop 2019, using Sphinx-hieroglyph slidemakerstanford-public-affairs-data-journalism
sf-evictions
just collecting san francisco evictions datad3choro-template
yaddaydaydaydamerde
Shitdigital-jo-2017
Quickie repo for digital journalism notes for stanford journalism 2017twitkit
yet another attempt at making a personal twitter data exploration command-line toolwire-glossary
the fuck did I dohigh-charty
wikipedia-trends
revelecture
A command-line tool to turn Markdown files into Reveal.js powered slideshowshello-svelte
need to practice this javascript thingok-earthquakes-RNotebook
Using R's ggplot2 and rgdal to examine earthquake activity in Oklahomafatal-encounters-and-census-sql
SQLite database exercises for analyzing Fatal Encounters (police officer involved homicides) and Census datapython-audio-playtime
experimenting with Python audio visualizers and extraction librariesscrapespeare
A collection of The Bard's text for basic programming exercises and data mining.twitch-stream-exploring-ppp-with-cli
Just some notes and data and files for a twitch stream on how to data wrangle the PPP loan dataLove Open Source and this site? Check out how you can help us