• Stars
    star
    16
  • Rank 1,311,288 (Top 26 %)
  • Language
    HTML
  • Created about 8 years ago
  • Updated over 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.

More Repositories

1

building-spark-applications-live-lessons

Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.
Jupyter Notebook
66
star
2

Opinion-Mining-Project

Feature-Based Sentiment Analysis in Python
Python
58
star
3

spark-install

Installation guide for Apache Spark + Hadoop on Mac/Linux
Shell
57
star
4

self-study-resources

DSI Self Study Resources
Shell
18
star
5

ml-workshop

12
star
6

dataweek-workshop

Machine learning workshop using Python, pandas, and scikit-learn. The first half of the day covered supervised classification using Logistic Regression and how to use cross validation to evaluate your models . The second half of the day covered unsupervised clustering with Kmeans as well as an overview of the data science process.
10
star
7

zipfian-distribution

A self contained environment to do data science with {Python | Shell | R | Hadoop}. This is a Vagrant box built on Ubuntu 12.04 LTS
Ruby
10
star
8

probabilistic-programming-intro

Introduction to probabilistic programming using PyMC3
Jupyter Notebook
6
star
9

python-anti-patterns

A presentation of commonly observed beginner-mistakes.
Jupyter Notebook
5
star
10

ZA-Final-Project

Zipfian Academy Final Project - Twitter Community Detection
Python
3
star
11

stats-shortcourse

The statistics short course is both a resource and survey of the areas of probability and statistics that are foundational for the data science immersive at Galvanize.
3
star
12

chrome-statistics-data

Python
2
star
13

Project-PPMI

Analyze Data from a Large Parkinson's Clinical Study
2
star
14

government-shutdown

R
2
star
15

DS-Glossary-RPT1

student-led glossary of data science terms
2
star
16

oakland-law-network-data

1
star
17

live_coding

Live Coding repo for showing to an info session
Jupyter Notebook
1
star
18

performotron

A little utility for allowing students to post the performance of their models to slack.
Python
1
star
19

RFT4-Capstones

Directory of Final Capstones for Galvanize Data Science, Remote Full Time, Cohort 4
1
star
20

example

An example repository containing an exercise and lecture used as a reference for instructor mock lectures.
Python
1
star
21

ad-hoc-lectures

Python
1
star
22

george

Helpers for Zipfian Academy curriculum and whatnot
Python
1
star
23

link-functions

1
star
24

july-2-tom-screencast

1
star
25

ds-nyc-project-shell

a shell for a data science repo for capstone projects
Jupyter Notebook
1
star
26

IntroToMachineLearning

Intro To Machine Learning Galvanize
Jupyter Notebook
1
star