Olivier Grisel (@ogrisel)

Top repositories

1

parallel_ml_tutorial

Tutorial on scikit-learn and IPython for parallel machine learning
Jupyter Notebook
1,589
star
2

notebooks

Some sample IPython notebooks for scikit-learn
Jupyter Notebook
556
star
3

pygbm

Experimental Gradient Boosting Machines in Python with numba.
Python
177
star
4

pignlproc

Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
Java
159
star
5

python-appveyor-demo

Demo project for building Python wheels with appveyor.com
PowerShell
153
star
6

docker-distributed

Experimental docker-compose setup to bootstrap distributed on a docker-swarm cluster.
Shell
92
star
7

spylearn

Repo for experiments on pyspark and sklearn
Python
79
star
8

paper2ebook

Utility to re-structure research papers published in US Letter or A4 format PDF files to typically remove the 2 columns layout.
Java
53
star
9

text-mining-class

Introduction to web scraping and text mining
Python
43
star
10

dbpediakit

Python utilities to do work with the DBpedia dumps for analytics.
Python
38
star
11

euroscipy-2022-time-series

Tutorial on time-series forcasting with scikit-learn
Jupyter Notebook
32
star
12

wheelhouse-uploader

Script to help maintain a wheelhouse folder on a cloud storage.
Python
31
star
13

my-linux-devbox

Vagrant / Salt configuration with Ubuntu to work on projects related to the scipy stack under Python 3 and Python 2
Scheme
26
star
14

oglearn

ogrisel's utility extensions for scikit-learn
Python
24
star
15

eegssl

Experiments on Self-Supervised Learning on EEG data
Python
16
star
16

mahout

Personal development repository to prepare contributions and patches for Apache Mahout
Java
15
star
17

euroscipy_2017_sklearn

Notebooks for the EuroScipy 2017 tutorial (based on Adult Census income data)
Jupyter Notebook
15
star
18

corpusmaker

clojure utilities to build training corpora for machine learning / NLP out of public wikimedia dumps: status - partially stalled - will probably be reworked as cascalog scripts -- this project is in stalled mode right now: the pignlproc project is likely to replace it due to licensing constraints for future integration in Apache projects
Clojure
14
star
19

python-winbuilder

Tools to script a build environment on Windows for Python project
Python
9
star
20

codemaker

Neural nets-based utility to build low dimensional codes or/and sparse codes
Python
9
star
21

pycon-pydata-sprint

Experimental work for using IPython.parallel with scikit-learn
Python
8
star
22

salt-ipcluster

Salt states and modules to setup an IPython cluster
Scheme
7
star
23

docker-openblas

Docker container with an automated build for OpenBLAS stable branch:
Shell
5
star
24

stanbol-isbn

Demo stanbol extension for detecting and linking ISBN in text document
Java
5
star
25

silva

Leaf recognition prototype
4
star
26

bbuzz-semantic-hackathon

Sandbox for the Berlin Buzzwords semantic hackathon
Java
3
star
27

research

Draft research notes, code and todos
Jupyter Notebook
3
star
28

scikit-learn-github-actions

Test repo for github actions workflows
Python
2
star
29

ipython-azure

Utilities to deploy a IPython parallel cluster on Windows Azure
Python
2
star
30

lsh_glove

Script to build various LSH / ANN indices on glove word embeddings
Python
2
star
31

cardice

Cloud compute cluster setup with SaltStack
Python
2
star
32

brain2vec

Brain embedding by contextual predictions (draft)
Python
2
star
33

energy_charts

Jupyter Notebook
2
star
34

decks

Slide decks for conferences
CSS
2
star
35

instrumentalist

Python scripts to read XBee sensor data and push it to a couchdb database
Python
2
star
36

mnist-sbi

Simulation Based Inference for the important problem of drawing digits
Python
2
star
37

scikit-learn.org

Source repository to build the HTML website for the scikit-learn project.
Python
1
star
38

camera-html5

Test repo for HTML5 camera access on mobile phones
1
star
39

sandbox

1
star
40

cpython-nightly

Automated build of the master branch of CPython for Continuous Integration purposes
1
star
41

docker-sklearn-openblas

Shell
1
star
42

us-housing-prices-v2-parquet

Exploratory Data Analysis on a parquet dump of https://www.dolthub.com/repositories/dolthub/us-housing-prices-v2 using duckdb and Ibis
Jupyter Notebook
1
star