• Stars
    star
    663
  • Rank 67,991 (Top 2 %)
  • Language
    PHP
  • License
    BSD 3-Clause "New...
  • Created almost 12 years ago
  • Updated 2 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Open Machine Learning

License

OpenML: Open Machine Learning

Welcome to the OpenML GitHub page! 🎉

Contents:

Who are we?

We are a group of people who are excited about open science, open data and machine learning. We want to make machine learning and data analysis simple, accessible, collaborative and open with an optimal division of labour between computers and humans.

What is OpenML?

Want to learn about OpenML or get involved? Please do and get in touch in case of questions or comments! 📨

OpenML is an online machine learning platform for sharing and organizing data, machine learning algorithms and experiments. It is designed to create a frictionless, networked ecosystem, that you can readily integrate into your existing processes/code/environments, allowing people all over the world to collaborate and build directly on each other’s latest ideas, data and results, irrespective of the tools and infrastructure they happen to use.

As an open science platform, OpenML provides important benefits for the science community and beyond.

Benefits for Science

Many sciences have made significant breakthroughs by adopting online tools that help organizing, structuring and analyzing scientific data online. Indeed, any shared idea, question, observation or tool may be noticed by someone who has just the right expertise to spark new ideas, answer open questions, reinterpret observations or reuse data and tools in unexpected new ways. Therefore, sharing research results and collaborating online as a (possibly cross-disciplinary) team enables scientists to quickly build on and extend the results of others, fostering new discoveries.

Moreover, ever larger studies become feasible as a lot of data are already available. Questions such as “Which hyperparameter is important to tune?”, “Which is the best known workflow for analyzing this data set?” or “Which data sets are similar in structure to my own?” can be answered in minutes by reusing prior experiments, instead of spending days setting up and running new experiments.

Benefits for Scientists

Scientists can also benefit personally from using OpenML. For example, they can save time, because OpenML assists in many routine and tedious duties: finding data sets, tasks, flows and prior results, setting up experiments and organizing all experiments for further analysis. Moreover, new experiments are immediately compared to the state of the art without always having to rerun other people’s experiments.

Another benefit is that linking one’s results to those of others has a large potential for new discoveries (see, for instance, Feurer et al. 2015; Post et al. 2016; Probst et al. 2017), leading to more publications and more collaboration with other scientists all over the world.

Finally, OpenML can help scientists to reinforce their reputation by making their work (published or not) visible to a wide group of people and by showing how often one’s data, code and experiments are downloaded or reused in the experiments of others.

Benefits for Society

OpenML also provides a useful learning and working environment for students, citizen scientists and practitioners. Students and citizen scientist can easily explore the state of the art and work together with top minds by contributing their own algorithms and experiments. Teachers can challenge their students by letting them compete on OpenML tasks or by reusing OpenML data in assignments. Finally, machine learning practitioners can explore and reuse the best solutions for specific analysis problems, interact with the scientific community or efficiently try out many possible approaches.


Get involved

OpenML has grown into quite a big project. We could use many more hands to help us out 🔧.

  • You want to contribute?: Awesome! Check out our wiki page on how to contribute or get in touch. There may be unexpected ways for how you could help. We are open for any ideas.
  • You want to support us financially?: YES! Getting funding through conventional channels is very competitive, and we are happy about every small contribution. Please send an email to [email protected]!

GitHub organization structure

OpenML's code distrubuted over different repositories to simplify development. Please see their individual readme's and issue trackers of you like to contribute. These are the most important ones:

  • openml/OpenML: The OpenML web application, including the REST API.
  • openml/openml-python: The Python API, to talk to OpenML from Python scripts (including scikit-learn).
  • openml/openml-r: The R API, to talk to OpenML from R scripts (inclusing mlr).
  • openml/java: The Java API, to talk to OpenML from Java scripts.
  • openml/openml-weka: The WEKA plugin, to talk to OpenML from the WEKA toolbox.

More Repositories

1

automlbenchmark

OpenML AutoML Benchmarking Framework
Python
402
star
2

openml-python

OpenML's Python API for a World of Data and More 💫
Python
280
star
3

openml-r

R package to interface with OpenML
HTML
95
star
4

openml.org

New OpenML website
JavaScript
25
star
5

docs

OpenML documentation
21
star
6

openml-tutorial

Learn how to use OpenML for reproducible, collaborative machine learning projects
HTML
13
star
7

Study-14

Jupyter Notebook
12
star
8

openml-docker-dev

Docker compose for starting local OpenML instances
Dockerfile
11
star
9

openml-java

Java library to interface with OpenML
Java
10
star
10

EvaluationEngine

Sources of the Java Evaluation Engine
Java
8
star
11

benchmark-suites

Jupyter Notebook
7
star
12

openml-pytorch

Pytorch extension for openml-python
Python
5
star
13

openml-dotnet

.NET API
C#
5
star
14

continual-automl

Adaptations of AutoML libraries H2O, Autosklearn and GAMA for stream learning
Python
5
star
15

openml-weka

The OpenmlWeka package
Java
4
star
16

openml-rapidminer

RapidMiner plugin
Java
4
star
17

articles

Latex sources and final PDF versions of all articles published on OpenML.
HTML
3
star
18

knime

KNIME plugin for OpenML
Java
3
star
19

randomBot

Random hyper-parameter sweep over OpenML tasks
R
2
star
20

blog

Blog for Openml.org
Jupyter Notebook
2
star
21

openml-tensorflow

Tensorflow extension for openml-python
Python
2
star
22

openml-deeplearning

Python
2
star
23

openml-aslib

Turns an OpenML study into an ASLib scenario
Python
1
star
24

meet

Hackathon website
JavaScript
1
star
25

openml-python-contrib

OpenML-compatible wrappers and projects in Python
Python
1
star
26

sklearn-bot

Random bot running sklearn classifiers on OpenML
Python
1
star
27

server-api

Python-based server
Python
1
star
28

openml-data

For tracking issues related to OpenML datasets
Python
1
star