• Stars
    star
    112
  • Rank 312,240 (Top 7 %)
  • Language
    Jupyter Notebook
  • License
    Other
  • Created almost 10 years ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Material that I use for a variety of classes and tutorials

This repository contains notes for various classes and seminars that I teach at NYU. They are focused on teaching programming for data science to non-CS majors. The emphasis is on offering live examples that students can use directly to complete their goals.

Accessing your Data Science Environment

We setup and deploy our data science environment (effectively, Jupyter with Python and R support, plus MySQL) using docker. As our default option, we allow students to connect to a JupyterHub server that runs on Kubernetes. We also give the option to students to run the same environment locally on their laptops, or deploy the Docker image on AWS or Google Cloud.

Data Sets

License

More Repositories

1

ReadabilityMetrics

A web service that computes a set of readability metrics for text. We currently support the following metrics: Automated Readability Index, Coleman-Liau Index, Fleschโ€“Kincaid Grade Level, Flesch Reading Ease, Gunning-Fog Index, SMOG score, and SMOG Index.
Java
71
star
2

Get-Another-Label

Quality control code for estimating the quality of the workers in crowdsourcing environments
Java
69
star
3

introduction-to-databases

Jupyter Notebook
42
star
4

introduction-to-python

Notes for the "Introduction to Programming for Data Science" class
Jupyter Notebook
37
star
5

Mturk-Tracker

Depracated - Software for gathering historical data from Amazon Mechanical Turk Service
Python
36
star
6

Troia-Server

Quality Control API for Crowdsourcing Applications
Java
15
star
7

mturk_demographics

Analyzing MTurk demographics
Jupyter Notebook
14
star
8

WikiSynonyms

Extracts synonyms for various terms, exploiting the redirects between terms in Wikipedia
PHP
12
star
9

Intrade-Archive

A crawler that downloads and stores data for Intrade prediction markets. Built for Google app Engine using Java.
Java
10
star
10

scholar_update

Small script that scrapes Google Scholar and creates a JSON file with my publications
Python
4
star
11

mturk-surveys

Continuous surveys of the Mechanical Turk workers
JavaScript
3
star
12

urlannotator

Python
2
star
13

get-another-label-continuous

get-another-label-continuous
Java
2
star
14

Troia-Web

JavaScript
2
star
15

Troia-Java-Client

Java
1
star
16

Troia-Tester

Java
1
star
17

CheaterLeaker

Detecting leaked versions of the oDesk tests
Java
1
star