The Alan Turing Institute (@alan-turing-institute)

Top repositories

1

the-turing-way

Host repository for The Turing Way: a how to guide for reproducible data science
TeX
1,635
star
2

CleverCSV

CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Python
1,145
star
3

AIrsenal

Machine learning Fantasy Premier League team
Jupyter Notebook
289
star
4

distinctipy

A lightweight package for generating visually distinct colours.
Python
236
star
5

ReadabiliPy

A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.
HTML
221
star
6

rse-course

Materials for The Alan Turing Institute's Research Software Engineering course
Jupyter Notebook
220
star
7

TCPD

The Turing Change Point Dataset - A collection of time series for the evaluation and development of change point detection algorithms
Python
116
star
8

TCPDBench

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data
114
star
9

environmental-ds-book

A computational notebook community for open environmental data science ๐ŸŒŽ
TeX
95
star
10

scivision

scivision: a framework for scientific image analysis
JavaScript
94
star
11

FootballTournamentPrediction

Predicting results for International men's and women's football tournaments.
Jupyter Notebook
73
star
12

deepsensor

A Python package for tackling diverse environmental prediction tasks with NPs.
Python
72
star
13

SHEEP

SHEEP is a Homomorphic Encryption Evaluation Platform
C++
47
star
14

mogp-emulator

Package for fitting Gaussian Process Emulators to multiple output computer simulation results.
Python
47
star
15

TuringDataStories

TuringDataStories: An open community creating โ€œData Storiesโ€: A mix of open data, code, narrative ๐Ÿ’ฌ, visuals ๐Ÿ“Š๐Ÿ“ˆ and knowledge ๐Ÿง  to help understand the world around us.
Jupyter Notebook
39
star
16

mathematics-of-ml-course

Jupyter Notebook
38
star
17

SemAIDA

Semantic Technologies for the AIDA project
Python
37
star
18

AutSPACEs

Code respository for AutSPACEs: the Autistica/Turing citizen science platform
Python
36
star
19

data-safe-haven

PowerShell
36
star
20

grace

Graph Representation Analysis for Connected Embeddings
Jupyter Notebook
34
star
21

PDSampler.jl

Piecewise Deterministic Sampler library (Bouncy particle sampler, Zig Zag sampler, ...)
Julia
33
star
22

ds-ai-educators-programme

The Data Science and AI Educators' Programme
32
star
23

rds-course

Materials for Turing's Research Data Science course
Jupyter Notebook
31
star
24

tapas

Python
31
star
25

robots-in-disguise

Information and materials for the Turing's "robots-in-disguise" reading group on fundamental AI research.
Jupyter Notebook
31
star
26

bocpdms

Python
30
star
27

ptype

Probabilistic type inference
Jupyter Notebook
29
star
28

AutisticaCitizenScience

Project management and resource repository for the Autistica/Turing Citizen Science project
Ruby
29
star
29

TimeSeriesClassification.jl

Machine Learning with Time Series in Julia
Julia
27
star
30

datadiff

Datadiff is diff for data
R
26
star
31

xpandas

Universal 1d/2d data containers with Transformers functionality for data analysis.
Python
26
star
32

CSV_Wrangling

Repository for reproducibility of the CSV file project
TeX
26
star
33

CROP

CROP is a Research Observation Platform
Python
25
star
34

signatures-psychiatry

Code from the paper "A signature-based machine learning model for bipolar disorder and borderline personality disorder".
Python
25
star
35

SigNet

A package for clustering of Signed Networks
Python
24
star
36

open-research-community-management

Establishing cross-community collaborations and promoting open research in data science
Jupyter Notebook
23
star
37

solar-panel-detection

Solar Panel Detection (Turing Climate Action Call)
Jupyter Notebook
23
star
38

turing-roche-partnership

23
star
39

AssurancePlatform

Project to facilitate creation of Assurance Cases
TypeScript
22
star
40

monitoring-ecosystem-resilience

Repository for mini-projects in the Data science for Sustainable development project
Python
22
star
41

rbocpdms

Robust bayesian online changepoint detection with model selection
Python
22
star
42

ThermodynamicAnalyticsToolkit

Sampling-based approach to analyse neural networks using TensorFlow
Python
22
star
43

QUIPP-pipeline

Privacy preserving synthetic data generation workflows
Python
20
star
44

uatk-spc

Synthetic Population Catalyst
Jupyter Notebook
20
star
45

AnnotateChange

A simple flask application to collect annotations for the Turing Change Point Dataset, a benchmark dataset for change point detection algorithms
Python
19
star
46

prompto

An open source library for asynchronous querying of LLM endpoints
Python
19
star
47

autoemulate

emulate simulations easily
Python
17
star
48

the-turing-way-book

The Turing Way: A Handbook for Reproducible Data Science
CSS
17
star
49

turing-commons

The main repository for the Turing Commons platform
HTML
17
star
50

Palaeoanalytics

Repository for the Paleoanalytics project.
Python
17
star
51

defoe

Code to analyse books and newspapers data using Apache Spark.
Lex
16
star
52

rPSMF

Code for Probabilistic Sequential Matrix Factorization
Python
15
star
53

network-comparison

An R package implementing the NetEMD and NetDis network comparison measures
R
14
star
54

bias-in-AI-course

Jupyter Notebook
14
star
55

learning-at-the-turing

The core repository for training materials at the Alan Turing Institute.
13
star
56

HDS-DiscussionGroup

Repo of the Turing's Humanities & Data Science Discussion Group
13
star
57

Turing-RSS-Health-Data-Lab-Biomedical-Acoustic-Markers

Python
13
star
58

reproducible-project-template

Template repository for setting a reproducible research project.
13
star
59

affinity-vae

Self-supervised method for disentanglement, clustering and classification of objects in multidimensional image data
Python
13
star
60

research-application-management

12
star
61

SIMple-ID

SIM-based QR-code authentication for basic and feature phones
TeX
12
star
62

templates

Turing Beamer templates for presentations
TeX
12
star
63

stat-fem

Python tools for solving data-constrained finite element problems
Python
12
star
64

RSE4DataScience18

Repo containing docs and outputs from the RSE4DataScience18 meeting.
12
star
65

advent-of-code-2021

Advent of Code 2021
Racket
11
star
66

professionalising-data-science-roles

Policy Skills Award project with TPS and Skills team - Professionalising traditional and infrastructure research roles in data science
11
star
67

notice-board

Community notice board for the Turing Institute
11
star
68

DTBase

A starting point from which digital twins can be developed.
Python
11
star
69

room2glo

Python
11
star
70

trustchain

Trustworthy decentralised PKI
Rust
11
star
71

python-project-template

Python
11
star
72

Intro-to-transparent-ML-course

An Introduction to Transparent Machine Learning
Jupyter Notebook
11
star
73

sqlsynthgen

Synthetic data for SQL databases
Python
11
star
74

ReproducibleResearchResources

This repository contains information to help you make your research reproducible
10
star
75

clim-recal

Open repository of methods for recalibrating & bias correcting UKCP18 climate projections data
HTML
10
star
76

gnn-reading-group

Public-facing repo for organising activities+ archiving material relating to the Graph Neural Network reading group.
Jupyter Notebook
10
star
77

hub23-deploy

A repo to manage the Turing BinderHub instance
Python
9
star
78

empiarreader

Reader for EMPIAR datasets
Python
9
star
79

DH-RSE-Summer-School

R
9
star
80

data-training-for-bioscience

Introduction to Data Science Project Management for Project Leaders.
9
star
81

cage-challenge-2-public

Team Mindrake's hierarchical RL solution to the second CybORG CAGE challenge.
Python
9
star
82

spatial-inequality

Jupyter Notebook
9
star
83

ADViCE

AI for Decarbonisation's Virtual Centre of Excellence
9
star
84

branded-overleaf-template

TeX
8
star
85

guard

Simulating Imperial Dynamics and Conflict in the Ancient World
Jupyter Notebook
8
star
86

DSSG19-Cochrane-PUBLIC

Python
8
star
87

netts

Toolbox for creating networks capturing semantic content of speech transcripts.
Python
8
star
88

p2lab-pokemon

A Python library for running genetic algorithms to optimize Pokemon teams!
Python
8
star
89

AI-workflows

A collections of portable, real-world AI workflows for testing and benchmarking
Shell
8
star
90

jbc-turing-rss-nowcasting

A Bayesian model for time-series count data with weekend effects and a lagged reporting process
Jupyter Notebook
8
star
91

mousehole

Quickly deploy a flexible, collaborative environment for working with private data.
HCL
8
star
92

DSSG19-HomelessLink-PUBLIC

TSQL
8
star
93

learn-azure

Repository for generalised learning materials on Azure
Python
8
star
94

uicc_identity_toolbox

A framework of Java Card applets for enhancing the trustworthiness of DigitalID systems using low-cost basic and feature phone devices.
TeX
8
star
95

neuro-ai-reading-group

Space to collate materials related to the Neuroscience-AI reading group
8
star
96

COVID-19_PSTC

Pandemic Symptom Tracker Calendar open code /Symptom tracker open code repository
HTML
8
star
97

alexa-room-finder

Lets you find meeting rooms through our Amazon Echo
JavaScript
7
star
98

causal-cyber-defence

This repository contains glue-code necessary to run dynamic Causal Bayesian optimisation within the Yawning Titan cyber-simulation environment.
Jupyter Notebook
7
star
99

pam-aad-oidc

PAM module connecting to AzureAD for user authentication using OpenID Connect/OAuth2.
Go
6
star
100

reprosyn

Python
6
star