• Stars
    star
    210
  • Rank 187,585 (Top 4 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created almost 9 years ago
  • Updated about 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A centralized repository to report scikit-learn model performance across a variety of parameter settings and data sets.

scikit-learn benchmarks

Join the chat at https://gitter.im/rhiever/sklearn-benchmarks

A centralized repository to report scikit-learn model performance across a variety of parameter settings and datasets.

Downloading the benchmark data

Please refer to PMLB to gain access to the curated datasets from this study. PMLB provides an easy-to-use Python interface to download the datasets.

Contributing

We welcome you to check the existing issues for bugs or enhancements to work on. If you have an idea for an extension of this project, please file a new issue so we can discuss it. Make sure to review our contribution guidelines before starting any work on this project.

Citing

If you use any of the code, data, or results from this project, please cite the following paper.

Randal S. Olson, William La Cava, Zairah Mustahsan, Akshay Varik, Jason H. Moore (2017). Data-driven Advice for Applying Machine Learning to Bioinformatics Problems. arXiv e-print

BibTeX entry:

@misc{OlsonLaCava2017,
    author={Olson, Randal S. and La Cava, William and Mustahsan, Zairah and Varik, Akshay and Moore, Jason H.},
    title = {Data-driven Advice for Applying Machine Learning to Bioinformatics Problems},
    year = {2017},
    howpublished = {arXiv e-print. https://arxiv.org/abs/1708.05070},
}

Support for this project

This project was developed in the Computational Genetics Lab with funding from the NIH. We're incredibly grateful for their support during the development of this project!

More Repositories

1

Data-Analysis-and-Machine-Learning-Projects

Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Jupyter Notebook
6,107
star
2

TwitterFollowBot

A Python bot that automates several actions on Twitter, such as following users and favoriting tweets.
Python
1,309
star
3

datacleaner

A Python tool that automatically cleans data sets and readies them for analysis.
Python
1,054
star
4

reddit-analysis

A Python script that parses post titles, self-texts, and comments on reddit and makes word clouds out of the word frequencies.
Python
285
star
5

optimal-roadtrip-usa

Contains maps for the article, "Computing the optimal road trip across the U.S." and similar articles
HTML
230
star
6

python-data-visualization-course

Course materials for teaching data visualization in Python.
Jupyter Notebook
169
star
7

reddit-twitter-bot

Looks up posts from reddit and automatically posts them on Twitter.
Python
137
star
8

name-age-calculator

Analyzes a name and guesses the age range of a person with that name.
HTML
43
star
9

redditviz

An interactive map of reddit: the "front page of the internet"
CSS
38
star
10

MarkovNetwork

Python implementation of Markov Networks for neural computing.
Python
36
star
11

ipython-notebook-workshop

Beginner's IPython Notebook Tutorial
19
star
12

baby-name-explorer

HTML
17
star
13

network-analysis-scripts

A bunch of useful scripts for analyzing networks.
Python
13
star
14

active-categorical-classifier

A tool that evolves small brains capable of scanning and classifying an image.
Jupyter Notebook
12
star
15

k-fold-cv-benchmark

Python
9
star
16

optimized-us-capitol-road-trip

HTML
9
star
17

crowd-machines

Jupyter Notebook
8
star
18

xrff2csv

A Python tool that converts XRFF files to CSV format.
Python
7
star
19

edd

A tool that evolves small brains capable of scanning and classifying an image.
C++
7
star
20

rhiever.github.io

Dr. Randal Olson's personal website
HTML
5
star
21

Collective-Cognition-Increases-Accuracy

Code for the model in the paper, "Accurate decisions in an uncertain world: collective cognition increases true positives while decreasing false positives."
Python
5
star
22

rhiever-bot

Bot that monitors /r/MUWs and runs the MUW script.
Python
4
star
23

big-ten-twitter-network

Interactive visualization of the Big Ten football teams on Twitter
JavaScript
3
star
24

biped-hyperneat

ODE implementation of a walking biped robot with HyperNEAT evolving the neural controller
PHP
3
star
25

dissertation-topic-network

Dissertation topic network
3
star
26

big-data-hw

2
star
27

Intro-to-Evolutionary-Modeling

Material for teaching biologists to work with digital evolutionary models.
2
star
28

rmagic-tutorial

A brief tutorial showing how Rmagic can be used in IPython Notebook.
2
star
29

marriage-divorce-stats

144 years of marriage and divorce in 1 chart
HTML
1
star
30

EvoRoboCodeGECCO2013

Description of our EvoRoboCode competition submission to GECCO 2013.
1
star
31

drug-alcohol-mentions

1
star
32

2014-01-30-mit

Software Carpentry bootcamp at Massachusetts Institute of Technology on January 30-31, 2014
Python
1
star
33

betting-game

Game Theory: betting game
C++
1
star
34

temp-repo

HTML
1
star
35

eos-old

Evolution of Swarming Platform
C++
1
star
36

ipython-example

Example notebook showing how to do statistics in IPython Notebook.
Python
1
star
37

AMT-biped-analysis

1
star
38

eos-active-perception

EOS with agents who have to actively perceive the environment with a fine-grained retina.
C++
1
star