Joshua Tauberer (@JoshData)

Top repositories

1

python-email-validator

A robust email syntax and deliverability validation library for Python.
Python
1,095
star
2

pdf-diff

A PDF comparison utility in Python.
Python
446
star
3

jot

JSON Operational Transformation (JOT)
JavaScript
353
star
4

pdf-redactor

A general purpose PDF text-layer redaction tool for Python 2/3.
Python
183
star
5

convert-outlook-msg-file

Python library to convert Microsoft Outlook .msg files to .eml/MIME message files.
Python
179
star
6

hackathon.guide

A logistics guide to running a successful hackathon.
HTML
176
star
7

rdfabout

Archival. Things I wrote about RDF from the mid-2000's. The validator is no longer maintained, sorry.
109
star
8

fast_diff_match_patch

Python package for Google's diff-match-patch native C++ implementation.
Python
73
star
9

crs-reports-website

The build process for EveryCRSReport.com.
Python
63
star
10

praat-py

From my PhD days: Praat-Py is a custom build of Praat, the computer program used by linguists for doing phonetic analysis on sound files, to allow for scripts to be written in the Python programming language, rather than in Praat's built-in language.
C
61
star
11

xml_diff

Compares two XML documents by diffing their text.
Python
40
star
12

why-use-cartograms

Analysis for a blog post on cartograms.
Python
29
star
13

party-platforms

The 2012 Democratic, Libertarian, and Republican Party platforms, plus every Democratic platform since 1840, cleaned up into nice XML.
26
star
14

parsey-mcparseface-server

[Archive] A simple Python Flask app to run Parsey McParseface.
Python
25
star
15

cmusphinx-alignment-example

How I got cmusphinx's transcript alignment tool to work.
Java
25
star
16

cartogrid

A grid-based cartogram generator.
Python
14
star
17

opengovdata.org

The website opengovdata.org.
CSS
14
star
18

globe-gores

Globe gores, in Javascript.
JavaScript
12
star
19

dc-code-editor

Prototype tool for editing the DC Code.
JavaScript
9
star
20

wmata-track-locations

WMATA Track Geospatial GIS Location Data
Python
9
star
21

dc-code-prototype

Unofficial Code of the District of Columbia in XML, produced under contract with the Council of the District of Columbia. Last updated in 2014.
7
star
22

crs-reports-scraper

Downloads Congressional Research Service (CRS) reports from the CRS.gov website (which is only visible from within the U.S. Capitol computer network).
HTML
7
star
23

thunderbird-spf

Archival: An anti-phishing/anti-spam Mozilla Thunderbird 3 extension for doing Sender Policy Framework (SPF) checks on incoming mail.
JavaScript
7
star
24

semweb-dotnet

Archival: A C#/.NET library for manipulating RDF. No longer in active development.
C#
6
star
25

s-p-500-simulator

Simulates an investor randomly choosing S&P 500 stocks.
Python
6
star
26

historical-state-population-csv

Historical Population of the U.S. States 1900-present in a CSV Spreadsheet
Python
6
star
27

django-annotator-store

A Django backend for okfn/annotator storage.
Python
6
star
28

printable-district-maps

High-resolution, print-quality congressional district maps and an example of loading Open Street Map (OSM) into Postgres.
Python
6
star
29

official.dccode.gov

The future website for https://official.dccode.gov.
Shell
5
star
30

nyc-traffic

An analysis of New York City traffic patterns on the arterial roads.
Python
5
star
31

color-scales

Color Scale Generator Using a Perceptually Valid Color Space
HTML
5
star
32

opengovdata.io

The website for my book, Open Government Data: The Book.
HTML
4
star
33

myhomepage

My (@JoshData's) homepage.
HTML
4
star
34

html5-stub

An HTML5/Bootstrap website template for starting new projects.
HTML
4
star
35

endsecretlaws

This is how I feel about surveillance.
CSS
3
star
36

infinite-tree

An infinite tree.
HTML
3
star
37

dchbx

DCHBX Health Exchange Plans
Python
3
star
38

exclusiveprocess

A simple Python 3 module for ensuring that your code does not execute concurrently in multiple processes, using POSIX file locking.
Python
3
star
39

django-pubmybook

A Django website for publishing a LaTeX book online in HTML.
Python
3
star
40

marcos

A generative model for natural language using a markov chain over syntactic relations, rather than serial order.
Python
3
star
41

wobblegram

A Python module to create a wigglegram, which is a sort of steeographic image, using a "MPO" file as input, which is created by some cameras.
Python
2
star
42

my2012district

The website my2012district.com, which helps U.S. voters find their new 2012 congressional district.
JavaScript
2
star
43

cotaskme

A task list where every task for you also appears "outgoing" on the task list of the person who requested the task. Based on an idea by Matthew Burton.
Python
2
star
44

datastore-loader

Utility script to load tabular data into the CKAN Datastore.
Python
2
star
45

dc.opendataday.org

The website for Open Data Day DC.
HTML
1
star
46

dc-bega-emails

Emails in 2017-2018 retreived through DC FOIA requests related to the Board of Ethics and Government Accountability's Office of Open Government.
1
star
47

JoshData

Config files for my GitHub profile.
1
star
48

apophenia-python

Python
1
star
49

census2000-to-rdf

(Archival) Perl script to turn the 2000 US Census into RDF.
Perl
1
star
50

dc-street-henge

Like Manhattanhenge but for the District of Columbia. For each day of the year identifies DC streets that line up with sunrise or sunset.
Python
1
star
51

battlelibs

A mad libs helper for Battledecks.
1
star
52

django-database-storage-backend

A Django 1.7-1.10 storages backend backed by your existing database.
Python
1
star
53

browser-padlock-guide

A Javascript library to render an example of a browser security padlock.
CSS
1
star
54

arfticle-three

Uhm. Too much time spent on this.
Python
1
star
55

py-fist-pump

Given 3D accelerometer data, compute the frequency of rhythmic motion and predict the next beat
Python
1
star
56

readlet

A bookmarklet that creates a Spritz speed-reading "reticule" for any web page you are viewing.
JavaScript
1
star
57

alexa-transit-times

An Alexa skill for getting the next WMATA Metro rail or bus times for your common trips.
JavaScript
1
star