• Stars
    star
    218
  • Rank 175,543 (Top 4 %)
  • Language
    Julia
  • License
    MIT License
  • Created over 7 years ago
  • Updated 9 days ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Utility package for accessing common Machine Learning datasets in Julia

MLDatasets.jl

Docs Stable Docs Latest CI

This package represents a community effort to provide a common interface for accessing common Machine Learning datasets. In contrast to other data-related Julia packages, the focus of MLDatasets.jl is specifically on downloading, unpacking, and accessing benchmark datasets. Functionality for the purpose of data processing or visualization is only provided to a degree that is special to some dataset.

This package is a part of the JuliaML ecosystem. Its functionality is built on top of the package DataDeps.jl.

Available Datasets

Datasets are grouped into different categories. Click on the links below for a full list of datasets available in each category.

  • Graphs - Datasets with an underlying graph structure: Cora, PubMed, CiteSeer, ...
  • Misc - Datasets that do not fall into any of the other categories: Iris, BostonHousing, ...
  • Text - Datasets for language models.
  • Vision - Vision related datasets such as MNIST, CIFAR10, CIFAR100, ...

Installation

To install MLDatasets.jl, start up Julia and type the following code snippet into the REPL. It makes use of the native Julia package manger.

import Pkg
Pkg.add("MLDatasets")

Contributing to MLDatasets

Pull requests contributing new datasets are warmly welcome. See the source code of any of the available implemented datasets for implementation examples.

Other data repositories for Julia

If you don't find here the dataset you are looking for, please let us know by opening an issue. Moreover, you can check out these other packages to find what you need:

License

This code is free to use under the terms of the MIT license.

More Repositories

1

Reinforce.jl

Abstractions, algorithms, and utilities for reinforcement learning in Julia
Julia
201
star
2

LossFunctions.jl

Julia package of loss functions for machine learning.
Julia
146
star
3

OpenAIGym.jl

OpenAI's Gym binding for Julia
Julia
104
star
4

MLDataUtils.jl

Utility package for generating, loading, splitting, and processing Machine Learning datasets
Julia
101
star
5

TableTransforms.jl

Transforms and pipelines with tabular data in Julia
Julia
100
star
6

MLUtils.jl

Utilities and abstractions for Machine Learning tasks
Julia
99
star
7

OpenAI.jl

OpenAI API wrapper for Julia
Julia
86
star
8

LIBSVM.jl

LIBSVM bindings for Julia
Julia
85
star
9

MLDataPattern.jl

Utility package for subsetting, resampling, iteration, and partitioning of various types of data sets in Machine Learning
Julia
60
star
10

Learn.jl

JuliaML bundled in a convenient all-in-one toolkit.
Julia
57
star
11

AtariAlgos.jl

Arcade Learning Environment (ALE) wrapped as a Reinforce.jl environment
Julia
40
star
12

MLLabelUtils.jl

Utility package for working with classification targets and label-encodings
Julia
32
star
13

Transformations.jl

Static transforms, activation functions, and other implementations of LearnBase abstractions
Julia
31
star
14

StochasticOptimization.jl

Implementations of stochastic optimization algorithms and solvers
Julia
30
star
15

DensityRatioEstimation.jl

Density ratio estimation in Julia
Julia
30
star
16

ValueHistories.jl

Utilities to efficiently track learning curves or other optimization information
Julia
29
star
17

LearningStrategies.jl

A generic and modular framework for building custom iterative algorithms in Julia
Julia
28
star
18

PenaltyFunctions.jl

Julia package of regularization functions for machine learning
Julia
26
star
19

MLMetrics.jl

Metrics for scoring machine learning models in Julia
Julia
25
star
20

MLPlots.jl

Plotting recipes for statistics and machine learning using Plots.jl
Julia
24
star
21

TableDistances.jl

Distances between heterogeneous tabular data
Julia
23
star
22

LearnBase.jl

Abstractions for Julia Machine Learning Packages
Julia
17
star
23

MLPreprocessing.jl

Julia
15
star
24

LIBLINEAR.jl

LIBLINEAR bindings for Julia
Julia
11
star
25

META

Discussions related to the future of Machine Learning in Julia
10
star
26

OpenAIGymAPI.jl

A Julia package providing access to the OpenAI Gym API
Julia
10
star
27

ContinuousOptimization.jl

A playground for implementations of unconstrained continuous full-batch optimization algorithms
Julia
8
star
28

TransformsBase.jl

Base package for general data transformations in Julia
Julia
7
star
29

DataScienceTraits.jl

Traits for data science
Julia
6
star
30

ObjectiveFunctions.jl

Generic definitions of objective functions using abstractions from LearnBase
Julia
5
star
31

JuliaML.github.io

The home page of the JuliaML organization
JavaScript
5
star
32

StatsLearnModels.jl

Statistical learning models for tabular data
Julia
4
star
33

Prox.jl

Bank of proximal operators to support proximal optimization algorithms
Julia
3
star
34

RankAggregation.jl

Rank aggregation in Julia
Julia
3
star
35

FileStorage

Storage for images and other binary files used throughout our documentation
Julia
2
star
36

TransformsAPI.jl

Julia API for general data transformations
Julia
2
star
37

ColumnSelectors.jl

Column selectors for tables
Julia
2
star