• Stars
    star
    1,725
  • Rank 25,968 (Top 0.6 %)
  • Language
    Julia
  • License
    Other
  • Created almost 6 years ago
  • Updated 5 days ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A Julia machine learning framework
MLJ

A Machine Learning Framework for Julia

Build Status Documentation bibtex bibtex

MLJ (Machine Learning in Julia) is a toolbox written in Julia providing a common interface and meta-algorithms for selecting, tuning, evaluating, composing and comparing about 200 machine learning models written in Julia and other languages.

New to MLJ? Start here.

Integrating an existing machine learning model into the MLJ framework? Start here.

Wanting to contribute? Start here.

PhD and Postdoc opportunies See here.

MLJ was initially created as a Tools, Practices and Systems project at the Alan Turing Institute in 2019. Current funding is provided by a New Zealand Strategic Science Investment Fund awarded to the University of Auckland.

MLJ has been developed with the support of the following organizations:

The MLJ Universe

The functionality of MLJ is distributed over several repositories illustrated in the dependency chart below. These repositories live at the JuliaAI umbrella organization.

Dependency Chart

Dependency chart for MLJ repositories. Repositories with dashed connections do not currently exist but are planned/proposed.


Contributing  •  Code Organization  •  Road Map

Contributors

Core design: A. Blaom, F. Kiraly, S. Vollmer

Lead contributor: A. Blaom

Active maintainers: A. Blaom, S. Okon, T. Lienart, D. Aluthge

More Repositories

1

the-turing-way

Host repository for The Turing Way: a how to guide for reproducible data science
TeX
1,635
star
2

CleverCSV

CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Python
1,145
star
3

AIrsenal

Machine learning Fantasy Premier League team
Jupyter Notebook
252
star
4

rse-course

Materials for The Alan Turing Institute's Research Software Engineering course
Jupyter Notebook
220
star
5

distinctipy

A lightweight package for generating visually distinct colours.
Python
215
star
6

ReadabiliPy

A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.
HTML
178
star
7

TCPD

The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
Python
116
star
8

TCPDBench

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data
114
star
9

scivision

scivision: a framework for scientific image analysis
JavaScript
89
star
10

environmental-ds-book

A computational notebook community for open environmental data science 🌎
TeX
84
star
11

WorldCupPrediction

Predicting results for the FIFA Men's 2022 World Cup and the FIFA Women's 2023 World Cup.
Jupyter Notebook
70
star
12

deepsensor

A Python package for tackling diverse environmental prediction tasks with NPs.
Python
61
star
13

SHEEP

SHEEP is a Homomorphic Encryption Evaluation Platform
C++
46
star
14

mogp-emulator

Package for fitting Gaussian Process Emulators to multiple output computer simulation results.
Python
44
star
15

TuringDataStories

TuringDataStories: An open community creating “Data Stories”: A mix of open data, code, narrative 💬, visuals 📊📈 and knowledge 🧠 to help understand the world around us.
Jupyter Notebook
40
star
16

SemAIDA

Semantic Technologies for the AIDA project
Python
37
star
17

data-safe-haven

PowerShell
36
star
18

mathematics-of-ml-course

Jupyter Notebook
34
star
19

PDSampler.jl

Piecewise Deterministic Sampler library (Bouncy particle sampler, Zig Zag sampler, ...)
Julia
33
star
20

AutSPACEs

Code respository for AutSPACEs: the Autistica/Turing citizen science platform
Python
33
star
21

grace

Graph Representation Analysis for Connected Embeddings
Jupyter Notebook
33
star
22

ds-ai-educators-programme

The Data Science and AI Educators' Programme
31
star
23

tapas

Python
30
star
24

rds-course

Materials for Turing's Research Data Science course
Jupyter Notebook
28
star
25

ptype

Probabilistic type inference
Jupyter Notebook
28
star
26

AutisticaCitizenScience

Project management and resource repository for the Autistica/Turing Citizen Science project
Ruby
28
star
27

bocpdms

Python
28
star
28

TimeSeriesClassification.jl

Machine Learning with Time Series in Julia
Julia
27
star
29

xpandas

Universal 1d/2d data containers with Transformers functionality for data analysis.
Python
26
star
30

CSV_Wrangling

Repository for reproducibility of the CSV file project
TeX
26
star
31

datadiff

Datadiff is diff for data
R
25
star
32

foundation-models-reading-group

Information and materials for the Turing's Foundation Models reading group.
Jupyter Notebook
25
star
33

SigNet

A package for clustering of Signed Networks
Python
24
star
34

CROP

CROP is a Research Observation Platform
Python
24
star
35

turing-roche-partnership

23
star
36

solar-panel-detection

Solar Panel Detection (Turing Climate Action Call)
Jupyter Notebook
22
star
37

open-research-community-management

Establishing cross-community collaborations and promoting open research in data science
Jupyter Notebook
21
star
38

monitoring-ecosystem-resilience

Repository for mini-projects in the Data science for Sustainable development project
Python
21
star
39

rbocpdms

Robust bayesian online changepoint detection with model selection
Python
21
star
40

ThermodynamicAnalyticsToolkit

Sampling-based approach to analyse neural networks using TensorFlow
Python
21
star
41

signatures-psychiatry

Code from the paper "A signature-based machine learning model for bipolar disorder and borderline personality disorder".
Python
21
star
42

uatk-spc

Synthetic Population Catalyst
Jupyter Notebook
20
star
43

QUIPP-pipeline

Privacy preserving synthetic data generation workflows
Python
20
star
44

AnnotateChange

A simple flask application to collect annotations for the Turing Change Point Dataset, a benchmark dataset for change point detection algorithms
Python
19
star
45

the-turing-way-book

The Turing Way: A Handbook for Reproducible Data Science
CSS
16
star
46

turing-commons

The main repository for the Turing Commons platform
HTML
16
star
47

defoe

Code to analyse books and newspapers data using Apache Spark.
Lex
16
star
48

Palaeoanalytics

Repository for the Paleoanalytics project.
Python
16
star
49

AssurancePlatform

Project to facilitate creation of Assurance Cases
JavaScript
15
star
50

rPSMF

Code for Probabilistic Sequential Matrix Factorization
Python
15
star
51

learning-at-the-turing

The core repository for training materials at the Alan Turing Institute.
13
star
52

autoemulate

emulate simulations easily
Python
13
star
53

HDS-DiscussionGroup

Repo of the Turing's Humanities & Data Science Discussion Group
13
star
54

network-comparison

An R package implementing the NetEMD and NetDis network comparison measures
R
13
star
55

templates

Turing Beamer templates for presentations
TeX
13
star
56

reproducible-project-template

Template repository for setting a reproducible research project.
12
star
57

room2glo

Python
12
star
58

RSE4DataScience18

Repo containing docs and outputs from the RSE4DataScience18 meeting.
12
star
59

affinity-vae

Self-supervised method for disentanglement, clustering and classification of objects in multidimensional image data
Python
12
star
60

research-application-management

11
star
61

advent-of-code-2021

Advent of Code 2021
Racket
11
star
62

SIMple-ID

SIM-based QR-code authentication for basic and feature phones
TeX
11
star
63

notice-board

Community notice board for the Turing Institute
11
star
64

bias-in-AI-course

Jupyter Notebook
11
star
65

stat-fem

Python tools for solving data-constrained finite element problems
Python
11
star
66

ReproducibleResearchResources

This repository contains information to help you make your research reproducible
10
star
67

clim-recal

Open repository of methods for recalibrating & bias correcting UKCP18 climate projections data
HTML
10
star
68

sqlsynthgen

Synthetic data for SQL databases
Python
10
star
69

python-project-template

Python
10
star
70

Turing-RSS-Health-Data-Lab-Biomedical-Acoustic-Markers

Python
9
star
71

professionalising-data-science-roles

Policy Skills Award project with TPS and Skills team - Professionalising traditional and infrastructure research roles in data science
9
star
72

DTBase

A starting point from which digital twins can be developed.
Python
9
star
73

DH-RSE-Summer-School

R
9
star
74

data-training-for-bioscience

Introduction to Data Science Project Management for Project Leaders.
9
star
75

spatial-inequality

Jupyter Notebook
9
star
76

AI-workflows

A collections of portable, real-world AI workflows for testing and benchmarking
Shell
8
star
77

netts

Toolbox for creating networks capturing semantic content of speech transcripts.
Python
8
star
78

p2lab-pokemon

A Python library for running genetic algorithms to optimize Pokemon teams!
Python
8
star
79

hub23-deploy

A repo to manage the Turing BinderHub instance
Python
8
star
80

jbc-turing-rss-nowcasting

A Bayesian model for time-series count data with weekend effects and a lagged reporting process
Jupyter Notebook
8
star
81

mousehole

Quickly deploy a flexible, collaborative environment for working with private data.
HCL
8
star
82

learn-azure

Repository for generalised learning materials on Azure
Python
8
star
83

DSSG19-HomelessLink-PUBLIC

TSQL
8
star
84

cage-challenge-2-public

Team Mindrake's hierarchical RL solution to the second CybORG CAGE challenge.
Python
8
star
85

trustchain

Trustworthy decentralised PKI
Rust
8
star
86

COVID-19_PSTC

Pandemic Symptom Tracker Calendar open code /Symptom tracker open code repository
HTML
8
star
87

Intro-to-transparent-ML-course

An Introduction to Transparent Machine Learning
Jupyter Notebook
8
star
88

branded-overleaf-template

TeX
7
star
89

guard

Simulating Imperial Dynamics and Conflict in the Ancient World
Jupyter Notebook
7
star
90

alexa-room-finder

Lets you find meeting rooms through our Amazon Echo
JavaScript
7
star
91

causal-cyber-defence

This repository contains glue-code necessary to run dynamic Causal Bayesian optimisation within the Yawning Titan cyber-simulation environment.
Jupyter Notebook
7
star
92

DSSG19-Cochrane-PUBLIC

Python
7
star
93

uicc_identity_toolbox

A framework of Java Card applets for enhancing the trustworthiness of DigitalID systems using low-cost basic and feature phone devices.
TeX
7
star
94

pam-aad-oidc

PAM module connecting to AzureAD for user authentication using OpenID Connect/OAuth2.
Go
6
star
95

empiarreader

Reader for EMPIAR datasets
Python
6
star
96

DSSG

meta repository for DSSG projects
6
star
97

REG-handbook

A way of working guide for the Research Engineering Group at The Alan Turing Institute
HTML
6
star
98

reprosyn

Python
6
star
99

Data-Study-Group-

Data Study Group Organisers Hub
HTML
6
star
100

neuro-ai-reading-group

Space to collate materials related to the Neuroscience-AI reading group
6
star