Ted Dunning (@tdunning)
  • Stars
    star
    2,924
  • Global Rank 9,934 (Top 0.4 %)
  • Followers 668
  • Following 20
  • Registered about 14 years ago
  • Most used languages
    Java
    65.7 %
    Julia
    11.4 %
    C
    5.7 %
    R
    5.7 %
    Python
    2.9 %
    Dockerfile
    2.9 %
    TeX
    2.9 %
    HTML
    2.9 %

Top repositories

1

t-digest

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
Java
1,833
star
2

MiA

Mahout in Action Example Code
Java
348
star
3

log-synth

Generates more or less realistic log data for testing simple aggregation queries.
Java
252
star
4

anomaly-detection

A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals
Java
102
star
5

Plume

Explorations relative to cloning FlumeJava
Java
92
star
6

knn

Large scale k-nn experiments
Java
69
star
7

pig-vector

Mahout vector encoding for pig
Java
54
star
8

bandit-ranking

HTML
51
star
9

feature-extraction

Sample techniques for a variety of feature extraction methods
Java
32
star
10

python-llr

A python implementation of the most commonly used variants of the G-test
Python
23
star
11

open-json

Open JSON - a truly open source JSON implementation
Java
17
star
12

probability-book

A copy of the source for Grinstead and Snell's lovely probability book
TeX
13
star
13

k-means-auto-encoder

Some quick exploration of how k-means auto-encoders work
R
11
star
14

t-digest-benchmark

A simple JMH benchmark for various versions of t-digest
Java
9
star
15

in-memory-cooccurrence

Analyze for significant cooccurrence using Mahout sparse matrices
Java
8
star
16

sequencemodel

The sequence anomaly detector from our second in the Practical Machine Learning Series
8
star
17

Chapter-16

Example server for Chapter 16 of Mahout in Action
Java
6
star
18

pcap-filter

Experiments in PCAP file decoding at speed
Java
6
star
19

ancient-stats

The ancient C version of the LLR statistics and related utilities. This is for reference only.
C
5
star
20

ponies

Sample recommender flow for search as recommendation
5
star
21

mahout-examples

Mahout Examples
Java
4
star
22

sequence-model

A simple implementation of a probabilistic model for event sequences
4
star
23

parksim

Java
4
star
24

TDigest

Native Julia implementation of t-digest
Julia
3
star
25

cluster-hinting

R
2
star
26

config-print

Prints Hadoop configuration variables
Java
2
star
27

freezer

A hybrid discrete/continuous simulation of a freezer and its users
Java
2
star
28

graph-demo

Demonstrates use of the multi command in Zookeeper
Java
2
star
29

H3Geometry

H3 convenience package
Julia
2
star
30

h2o-matrix

Demonstration of Mahout compatible matrix and vector types based on h2o
Java
2
star
31

G2

Implements the G^2 test for comparing counts
Julia
1
star
32

meta-ep

Recorded-step meta-mutation implementation and paper
C
1
star
33

timeSkew

Quick test for timers
Java
1
star
34

ubuntu-bounce-host

A simple container that supports bouncing a login to another host via ssh
Dockerfile
1
star
35

OpenUnits

1
star
36

image-rep

A repo for storing images referenced in other projects
1
star
37

split-search

Quick tests of the basins of attraction in an ERT optimizer
Java
1
star
38

t-digest-example

Java
1
star
39

ibm2ieee

Julia package to convert between IBM floating point (aka hexadecimal floating point or HFP) to IEEE floating point
Julia
1
star
40

work-group

Simple management framework for a fixed set of workers that come up at somewhat unpredictable times.
Java
1
star