• Stars
    star
    236
  • Rank 170,480 (Top 4 %)
  • Language
  • Created about 7 years ago
  • Updated about 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

List of datasets to apply stats/machine learning/technology to the world of social good.

Datasets for Social Good Projects

I was inspired to create this after taking many project-based CS and AI classes at Stanford, where I would spend more time finding data for a problem I actually cared about than writing the baseline algorithm.

The list is divided by sector, and each link has a (D), (T), or (C) next to it. (D) represents a dataset; (T) represents a tutorial; (C) represents an online challenge you can download data from and contribute knowledge to.

I am sure there are many great datasets I have missed. If you have datasets to add, please create a pull request!

Health

Education

Environment

Government

Public Good

Other Good Lists of Datasets

More Repositories

1

gpt3-sandbox

The goal of this project is to enable users to create cool web demos using the newly released OpenAI GPT-3 API with just a few lines of Python.
JavaScript
2,900
star
2

create-ml-app

Template Makefile for ML projects in Python.
Python
521
star
3

toy-ml-pipeline

Toy example of an applied ML pipeline for me to experiment with MLOps tools.
Jupyter Notebook
206
star
4

overleave

Chrome extension that opens and syncs Overleaf compiled pdfs in a new window.
JavaScript
123
star
5

m1-setup

Notes on how I set up my new M1 MacBook Pro
89
star
6

debugging-ml-talk

Code accompanying the "Debugging machine learning in production" talk
Jupyter Notebook
29
star
7

web3-reading-list

List of good readings on web3.
17
star
8

spade-experiments

Experiments to assess SPADE on different LLM pipelines.
Python
16
star
9

ml-dataval-tutorial

Tutorial: Data Validation for Machine Learning Techniques
Jupyter Notebook
9
star
10

planner

A "smart" planner that determines when to study, work on assignments, etc.
Python
8
star
11

oreilly-monitoring

Jupyter Notebook
7
star
12

research-ideas

List of proposed abstracts I'd love to work on, if I had the time.
7
star
13

vython

Versioning Python scripts.
Python
5
star
14

questions

Questions I have that I would love to explore if I have time.
5
star
15

datatracker

WIP experimental project to make for a better ML development UX.
Python
5
star
16

shreyashankar

4
star
17

anxiety-extension

Chrome extension for logging moods.
JavaScript
3
star
18

mltrace-ifc-demo

Project demo for CS294 Privacy-Preserving Systems.
Jupyter Notebook
3
star
19

shreyashankar.github.io

Astro
3
star
20

streams

STREAMS: A Benchmark of Naturalistic Streaming Data for Online Continual Learning
Jupyter Notebook
2
star
21

prompteng

Experiment scaffold for trying out different prompts.
Python
2
star
22

wakeupnow

Detect when people fall asleep at the wheel and wake them up
Java
2
star
23

motion-sigmod-demo

SIGMOD Demo 2024 Submission for Motion
Python
2
star
24

commitwriter

Using gpt-4 to write docstrings and commit messages.
Python
1
star
25

news-classifier-ui

UI for News Classifier
HTML
1
star
26

bazarre

Java
1
star
27

sentiment

Python
1
star
28

Pi-ke

CS107E Final Project
C
1
star
29

healthy-eating

Python
1
star
30

needle-in-the-real-world

Jupyter Notebook
1
star