• Stars
    star
    424
  • Rank 101,699 (Top 3 %)
  • Language
  • Created over 7 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Start here!

Data for Democracy

Data for Democracy's mission is to be an inclusive community for data scientists and technologists to volunteer and collaborate on projects that make a positive impact on society.

Who runs D4D?

The community does! Anybody who wants to contribute to making D4D better is highly encouraged to do so! Everything in D4D thus far has come from grassroots ideas proposed and pushed forward by a community member just like you and there's always more room for improvement. Whether it's starting a new project or proposing a new tool for the community to use, don't hesitate to just go for it!

Getting Started

  • Read more about our our origin story.
  • We use Slack and GitHub as our primary means of organization and communication. Email team [at] datafordemocracy.org for an invite to the team.
  • Read about our current projects, get involved in your city, or check out how you can volunteer across the community.
  • If you see a project you are interested in, join the project Slack channel, introduce yourself to the community and the project lead(s).
  • Visit project readme for specific instructions on who to contact or how to get started.
  • Not sure which project you want to join? Introduce yourself in #general. Talk to a few project leads, let them know what you are interested in. Project leads can help you determine if a project is a good fit for you. The leads for each project are listed are listed below.
  • Have an idea for a new project? Join #project-ideas to start a discussion on a new project.
  • Looking for a certain type of task in any project? Check the task digest.
  • Not sure how to get started? Contact @gecky in the Slack, or check out the #onboarding channel, and we'll help you out. Also check out #micro-projects for quick projects that'll help you get your feet wet.
  • Want to pick up some skills before diving in? Check out the tutorials which are all relevant to D4D projects and chat about it in #learning.
  • Check out our data.world page at data.world/data4democracy, where data can be shared. See our guide to get started. Contact @scuttlemonkey in Slack with questions

Across the Community

While most of our work is project based, the community is large and vibrant enough that we're looking for common needs across projects that the organization can support. These are ongoing roles that can be time consuming, but are also essential to making the community helpful to our partners and a valuable experience for our volunteers.

Storytelling

All of our projects are doing some interesting data gathering and analysis that we regularly share with the world on our blog and our YouTube channel. If you'd like to help with this, ping @lillianj in Slack!

Matchmaker

The community has grown to a point where there is a lot of activity and a steady stream of new volunteers, which is awesome! We could use more people who know about the various projects that are happening along with the skills of the incoming volunteers, and can both match people to projects (ideally, to specific tasks), along with routinely surfacing new opportunities to the broader community for folks to get involved at all skill levels. Ping @gecky in Slack if this sounds like fun to you!

Project Leads

You've probably noticed that there are a few people in every active project keeping the group on track, taking questions, and helping new folks get settled in -- that's by design. Leading a project can be a big time commitment, but it's hugely valuable to making sure we're doing useful work, and volunteers are getting the most out of volunteering. You don't need to be super technical, just organized and enthusiastic! Ping @mo in Slack if you're interested in leading a project.

City groups

Data for Democracy volunteers are all over the world, and many cities have dedicated channels in Slack, and even regular in-person meetups to collaborate in person and work on data projects specific to their city or state. This is mostly US-based for now, but that's not on purpose, so feel free to spin up a group where you live! (We're looking at you London, Berlin, Warsaw, Bangalore, Sydney, and...). And of course, if you are in the US, there are probably at least a few D4D-ers where you live, so don't hesitate to start your own city group!

Austin

Channel: #city-austin Leads: @jonathon

Boulder

Channel: #city-boulder Leads: @sam

Chicago

Channel: #city-chicago Leads: @lillianj

DC

Channel: #city-dc Leads: @chris_dick, @redoakmark

Denver (meetup page)

Channel: #city-denver Leads: @chooliu, @gretchenriggs

Louisville

Channel: #city-louisville Leads: @eric_bickel,[@margeaux](https://datafordemocracy.slack.com/messages/@margeaux/, @mattgolsen

Los Angeles

Channel: #city-losangeles Leads: @chelsea.ursaner, @adamdrake

New York (meetup page)

Channel: #city-nyc Leads: @bkey, @henripal, @danielah, @ckmarcelus

Seattle (meetup page)

Channel: #city-seattle Leads: @zachmueller, @grichardson, @jtorrez

San Diego

Channel: #city-sandiego Leads: @eric_bickel, @ryanes

There are dozens of city-specific channels in the Slack, so just search for "city-yourcity" (e.g. "city-austin") to find people near you!


Current projects

We have many projects going in our slack community. Below is a list of some of our more active projects that are actively seeking help.

Boston Crash Modeling

Project Description: This is an official collaboration between the City of Boston's Analytics Team and Data for Democracy. We want to build a crash prediction modeling application. This would combine things like our crowdsourced safety concerns dataset and crash reports with a wide range of other data from our own open data portal as well as from other sources like the state's transportation and transit agencies, weather conditions feeds, and traffic reports from sites like Google. The end result will be a set of dynamic predictions we can use to identify potential trouble spots and direct timely safety interventions--everything from protected bike lanes and better-marked crosswalks to new street lights and traffic light timings. We'll make this publicly-available to everyone through a hosted map application, and develop the project as an open source template that can be adapted for use by other cities.

Slack: #boston-crash-modeling

Project Leads: @therriault (City of Boston lead), @niccolley (Data for Democracy lead)


Election Transparency

Slack: #election-transparency

Project Description: Aggregating and normalizing county-level election results to be shared with the broader community. This team now has a fantastic dataset that's difficult to obtain. In addition to the ongoing data engineering effort, they could use help from statisticians, political scientists, and anyone ready to analyze this data.

Project Leads: @chris_dick, @scottcame, @rachelanddata

Data: data.world/data4democracy/election-transparency


Immigration Connect

Project Description: Collecting, analyzing, and modeling data related to immigration. We're also building tools (visualizations, email parsers, matching algorithms to connect to connect pro bono lawyers with immigrants in need) to support the work of organizations working on immigration law and other immigration related issues.

Slack: #immigration-connect

Project Leads: @tylerjrichards @jtorrez @chavan


Task digest

If you're looking for a certain type of task (e.g. beginner-friendly) to lead you to a project, try checking the links below:

  • analysis: Analysis questions, ranging from basic, exploratory questions to more complex questions.
  • beginner-friendly: Good for people new to the project or technology stack used by the project. If you are experienced we encourage you not to claim these issues but instead post in comments offering to mentor a new person.
  • bug: Bug report.
  • data-collection: Data collection task.
  • data-viz: Data visualization task.
  • discussion: Indicates this issue is a public discussion but may not necessarily have a specific task associated to it.
  • documentation: Request for help with documentation.
  • enhancement
  • first-pr: For first timers only. Issues with this label are meant to be a digestible to help people get up to speed with cloning the repository, making a change and creating a pull request. Please do not claim this issue unless it is your first pull request for this project.
  • help wanted: Catch all label soliciting for help.
  • modeling: Data modeling, machine learning task.
  • proposal
  • question
  • status-blocked: Not ready for development.
  • status-dev-ready: Development task that is fully defined and ready to start.
  • status-in-progress: Someone is actively working on this issue. Please reach out to the person assigned this issue if you would like to contribute.
  • status-needs-grooming: Issue is not fully defined. Either the person who opened the issue is not sure how to proceed or did not have time to complete. Indicates further research will be required to determine best path forward.
  • storytelling: Need help crafting a story. Writing assignment.

Project Ideas

If you have an idea for a project or are looking to start a new one, check out the #project-ideas channel in Slack, and also the project-ideas repo in this organization.

You can check here for projects that are looking for help and here for projects that are looking for leads.

More Repositories

1

ethics-resources

165
star
2

election-transparency

A Data4Democracy community working to make elections and elections data more transparent
HTML
88
star
3

tutorials

A place for tutorials relevant to D4D projects.
Jupyter Notebook
85
star
4

drug-spending

Project to understand pharmaceutical spending, currently focused on US government programs.
HTML
71
star
5

usa-dashboard

A dashboard of key metrics for the USA
Python
69
star
6

github-playground

A safe place to experiment with git & GitHub in a group setting
Jupyter Notebook
44
star
7

internal-displacement

Studying news events and internal displacement.
Jupyter Notebook
43
star
8

immigration-connect

Building tools to connect and coordinate efforts to help those affected by immigration law changes in partnership with the NILC
Python
41
star
9

assemble

NOT AN ACTIVE PROJECT -- Check readme for data sources
Jupyter Notebook
36
star
10

far-right-analysis

Analysis related to the behavior of extreme far right online communities
Python
35
star
11

hate_speech_detector

Jupyter Notebook
32
star
12

media-crawler

Web scraper for generating a graph of media connections via articles, twitter, reddit, and more
Python
32
star
13

collect-social

Simply collect social media content
Python
28
star
14

zip-code-to-county

The code that generates the zipcode to look up csv's based on government data. Dataset can be found here https://data.world/niccolley/us-zipcode-to-county-state
Jupyter Notebook
26
star
15

indivisible

Aggregating call to action sites into a single application.
Python
25
star
16

project-ideas

A place for project ideas to live, be discussed, and be brought to life!
25
star
17

discursive

Twitter topic search and indexing with Elasticsearch
Python
21
star
18

house_expenditures

HTML
18
star
19

are-you-fake-news

16
star
20

datafordemocracy.org

Our public-facing website!
HTML
15
star
21

docker-scaffolding

Template for Docker-based projects
Python
14
star
22

campaign_expenditures

Jupyter Notebook
12
star
23

just-politics

Identifying vulnerable house and senate seats in the 2018 midterm elections
12
star
24

colorado

Highlighting code & projects from #city-denver and #city-boulder.
Jupyter Notebook
12
star
25

india-nfhs

analyze India's National Family Health Survey r.e. women's empowerment, wealth
HTML
11
star
26

smtk

NOT ACTIVE -- get in touch with @bstarling on slack if you have ideas
Python
9
star
27

womens-march

This is a place to join and analyze data related to the Women's Marches. Potentially this can lead to further joining and analysis of data on resistance to Trump and support for rights or issues that Trump opposes, as the Scientists' March on Washington (and possibly others) are discussed about and planned.
R
8
star
28

quorum

Python
7
star
29

ati-broadcastapp

The ATI Broadcast app group is building a Facebook publishing app for the group America the Indivisible. Via this app an AITD trusted partner will be able to broadcast a message to multiply Facebook groups from one interface. This will allow the local groups in a region, of which there are many, to better communicate and mobilize toward specific actions.
JavaScript
7
star
30

disinfo-campaigns

Research on disinformation operations, influence campaigns.
R
6
star
31

reddit-api-miner

code to use the reddit api to get data from subreddit|(s)
Python
6
star
32

wh-financial-disclosures

Data collection and analysis around the financial disclosures from the Office of Government Ethics
Jupyter Notebook
6
star
33

democratizing_weather_data

Jupyter Notebook
6
star
34

incarceration-trends

Analysis of incarceration data to inform bail reform legislation in Colorado. Data for Democracy x ACLU of Colorado.
HTML
6
star
35

nyc-accessibility

HTML
6
star
36

propublica-org

Organizing info on ongoing ProPublica projects and team maintenance
5
star
37

micro-projects

Jupyter Notebook
5
star
38

town-council

Tools to scrape and centralize the text of meeting agendas & minutes from local city governments. NOT ACTIVE -- looking for new lead(s)!
Python
5
star
39

louisville-hackathon-mar2017

Repo storing all of the goodies from the March 2017 Louisville Hackathon
Java
4
star
40

city-sandiego

Working on awesome projects for the City of San Diego
HTML
4
star
41

official-foreign-travel

Python
4
star
42

predicting_poverty

We are going to try to use satellite imagery and publicly available wealth data to predict poverty using deep learning.
4
star
43

VA-open-data

HTML
3
star
44

political-ads

Python
3
star
45

immigration-germany

Exploring German immigration data
3
star
46

data-dot-gov-archive

3
star
47

Verum

Project dedicated to showing voters how their candidate represents them through the analysis of legislature data.
Python
3
star
48

metadata-for-democracy

Python
3
star
49

equifake

Credit monitoring but for propaganda
HTML
2
star
50

ethics-research

Repository for the Ethics Research GDEP working group
2
star
51

across-the-news

Build a tool that helps readers quickly see all perspectives of a story
2
star
52

educational-hub

Resources and tools to connect educators and students interested in exploring capstone/community service projects with D4D
CSS
2
star
53

residential-education-segregation

Exploring segregation in residential education
2
star
54

ethical-assessment

As of January, every new project at D4D goes through an ethical assessment in order to have a repo. Because the Global Data Ethics Project is a flagship partner project for D4D, we are incorporating ethical frameworks into our data work!
2
star
55

lobbyist-blacklist

Investigations to compile a database of Americans' IDs that were appropriated for industry-aligned comments
1
star
56

datafordemocracy.org-v2

JavaScript
1
star
57

untitled-beta

1
star
58

d4d-ambassador

Repository for the D4D Ambassador GDEP working group
1
star
59

police_cheat

Jupyter Notebook
1
star
60

d4dverse

Building R tools for D4D projects
1
star
61

tx-secession

Exploration of Texas secession conversation on Twitter
Jupyter Notebook
1
star
62

uk-elections

Identifying insights and patterns from analysis of UK election data.
1
star
63

announcements

Any announcements made in the #announcements channel in the D4D slack, for transparency.
1
star
64

fact-check

A home for all things fact-checking-related in D4D.
1
star
65

whitehousedotgov

Code that supports d4d's harvesting, munging, and analyzing data from current and past administrations’ public websites
R
1
star