• Stars
    star
    24
  • Rank 981,178 (Top 20 %)
  • Language
    Ruby
  • Created over 12 years ago
  • Updated over 12 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A collection of MapReduce tasks translated (from Pig, Hive, MapReduce streaming, Cascalog, etc.) into Scalding.

More Repositories

1

restricted-boltzmann-machines

Restricted Boltzmann Machines in Python.
Python
940
star
2

dirichlet-process

Introduction to Nonparametric Bayes, Infinite Mixture Models, and the Dirichlet Process (+ McDonald's)
R
297
star
3

ggplot2-tutorial

Quick introduction to ggplot2 (no knowledge of R assumed)
R
236
star
4

link-prediction

Solution to Facebook's link prediction contest on Kaggle.
Scala
205
star
5

scaldingale

Movie recommendations and more in MapReduce and Scalding
Scala
117
star
6

streaming-simulations

Simulating the performance of various streaming algorithms. #experimentalmathematics
R
59
star
7

minifolds

ggplot2-inspired d3 app to make instant interactive visualizations
CoffeeScript
55
star
8

unsupervised-language-identification

An unsupervised language identification algorithm in Ruby, built originally for detecting English-language tweets.
Ruby
39
star
9

gap-statistic

An implementation of the gap statistic algorithm to compute the number of clusters in a set of numerical data.
R
39
star
10

sarah-palin-lda

Topic Modeling the Sarah Palin emails.
Scala
34
star
11

principal-components-analysis

Python/Numpy PCA using the transpose trick.
Python
28
star
12

information-propagation

Information Propagation in a Social Network
R
28
star
13

sparta

Instantly turn your data into charts and dashboards. It's like a mini Tableau.
JavaScript
27
star
14

prediction-strength

An implementation of the prediction strength algorithm from Tibshirani, Walther, Botstein, and Brown's "Cluster validation by prediction strength".
R
19
star
15

data-hacks

Command-line utilities for data analysis.
Ruby
18
star
16

lstm-explorer

Web app for exploring LSTMs.
JavaScript
17
star
17

kickstarter-data-analysis

Digging into data from kickstarter.com.
16
star
18

twss-classifier

A That's What She Said classifier, built off a simple unigram Naive Bayes model.
Ruby
16
star
19

dangle

Playing around with Tangle + d3.
JavaScript
11
star
20

gradient-svd

A simple SVD + LSI implementation in Ruby, based on gradient descent. Useful if you have a *small* matrix with missing values.
Ruby
8
star
21

d3-tutorial

Quick introduction to d3.
JavaScript
6
star
22

nvd3

D3 graphing library, originally forked from nvd3.js
JavaScript
6
star
23

scalding-book

5
star
24

echen.github.io

HTML
5
star
25

old-blog

JavaScript
5
star
26

embedding-explorer

JavaScript
3
star
27

hurricane-sandy-outages

Power outages during Hurricane Sandy.
R
2
star
28

pinterest-evals

JavaScript
1
star