District Data Labs (@DistrictDataLabs)

Top repositories

1

yellowbrick

Visual analysis and diagnostic tools to facilitate machine learning model selection.
Python
4,259
star
2

baleen

An automated ingestion service for blogs to construct a corpus for NLP research.
Python
86
star
3

machine-learning

Code & Data for Introduction to Machine Learning with Scikit-Learn
Jupyter Notebook
81
star
4

intro-to-nltk

Code and Notebooks for the Natural Language Processing with Python course.
Jupyter Notebook
66
star
5

blog-files

Public code files for the DDL blog
Python
56
star
6

cultivar

Multidimensional data explorer and visualization tool.
HTML
52
star
7

entity-resolution

Tutorial code and data for the entity resolution workshops.
Python
45
star
8

science-bookclub

Generating the next read for our book club- with Data Science!
Python
40
star
9

PyCon2016

Code bases, tutorials, posters, and other content for PyCon2016.
JavaScript
38
star
10

partisan-discourse

A web application that identifies party in political discourse and an example of operationalized machine learning.
Python
27
star
11

spark-workshop

Data and code for "Fast Data Applications with Spark and Python"
Python
25
star
12

yellowbrick-docs-zh

Chinese translation of Yellowbrick documentation
Python
19
star
13

brookings-nlp

Teaching materials for the text analytics course
Jupyter Notebook
18
star
14

minke

Graph extraction and NLP analysis for Baleen Corpora
Python
18
star
15

minimum-entropy

Minimum Entropy is a DDL hosted question/answer site for beginners who need answers to Data Science questions.
Python
16
star
16

PyCon2017

Resources and materials related to PyCon 2017.
HTML
11
star
17

django-data-product

An example data product using Django
Python
8
star
18

ceb-training

Notebooks and materials for DDL/CEB training.
Jupyter Notebook
7
star
19

topicmaps

Fast topic survey with associated word cloud visualization on completion.
HTML
7
star
20

diconf

Notebooks and code for "Visual Pipelines for Text Analysis" at the Data Intelligence Conference: June 23, 2017.
Jupyter Notebook
5
star
21

city-dash

City Intelligence Dashboard Project
Jupyter Notebook
5
star
22

Brookings_Python_DS

Jupyter Notebook
5
star
23

yellowbrick-docs-tr

Turkish translation of Yellowbrick documentation
Python
5
star
24

brookings

Teaching materials for web scraping class
Jupyter Notebook
5
star
25

bigtooth

Finding how common the strangers in your life are (reword)
Python
5
star
26

dod-ds-overview

Data Science and Big Data Overview Training
Jupyter Notebook
5
star
27

navyfcu-ml

Notebooks and data for Machine Learning course.
HTML
4
star
28

dos-managers-executives

Business Data Analysis for Managers and Executives Training
Jupyter Notebook
4
star
29

yellowbrick-datasets

Yellowbrick datasets management and deployment scripts.
Python
4
star
30

logbook

A simple web application for activity tracking and event aggregation.
Python
4
star
31

03-data-bandits

DATA BANDITS
JavaScript
3
star
32

brookings-sql

Teaching materials for the SQL course
R
3
star
33

dos-advanced-excel

Advanced Excel and Power BI Training
Jupyter Notebook
3
star
34

semnet-similarity

NLE implementation of similarity computation using semantic networks.
Python
3
star
35

content-optimization

Jupyter Notebook
2
star
36

03-mineralytics

HTML
2
star
37

mapreduce

A multiprocess implementation of MapReduce in Python
Python
2
star
38

supervised_ml_R

Code and slides for supervised machine learning in R
HTML
2
star
39

pycon2018

resources for pycon 2018
1
star
40

04-team4

Repository for Incubator 4 Team 4
CoffeeScript
1
star
41

transportation-project-1

Transportation Project for District Data Labs Incubator
Python
1
star
42

02-ppm-data

Private repo for PPM Data team.
Jupyter Notebook
1
star
43

03-EMU

Python
1
star
44

yellowbrick-docs-es

Spanish translation of the Yellowbrick documentation
Python
1
star
45

political_history

A machine learning approach to recording and analyzing the 2016 election.
Jupyter Notebook
1
star
46

sports-project-1

Retail Project for District Data Labs Incubator
Python
1
star
47

03-censusables

Private repo for Team 7.
JavaScript
1
star
48

04-team5

Repository for Incubator 4 Team 5
Jupyter Notebook
1
star
49

02-synthesizers

DDL Incubator 2.0 repository for the Synthesizers team.
Python
1
star
50

02-labormatch

Private repo for team Labor Match
Python
1
star
51

company-clustering

Intuitive Hierarchical Text-Based Clustering Research Project
Jupyter Notebook
1
star