Machine Learning and Systems Biology Research Department (@BorgwardtLab)

Top repositories

1

SAT

Official Pytorch code for Structure-Aware Transformer.
Python
236
star
2

topological-autoencoders

Code for the paper "Topological Autoencoders" by Michael Moor, Max Horn, Bastian Rieck, and Karsten Borgwardt.
Python
137
star
3

Set_Functions_for_Time_Series

Repository of the ICML 2020 paper "Set Functions for Time Series"
Python
120
star
4

TOGL

Topological Graph Neural Networks (ICLR 2022)
Python
112
star
5

GraphKernels

A package for computing Graph Kernels
C++
100
star
6

proteinshake

Protein structure datasets for machine learning.
Python
99
star
7

WWL

Wasserstein Weisfeiler-Lehman Graph Kernels
Python
77
star
8

mgp-tcn

Sepsis Prediction on MIMIC
Python
66
star
9

P-WL

A Persistent Weisfeiler–Lehman Procedure for Graph Classification
Python
60
star
10

graph-kernels

Graph kernels
C++
56
star
11

S3M

A software package for statistically significant shapelet mining
C++
52
star
12

sampling-outlier-detection

Rapid computation of distance-based outlierness scores via sampling
C
33
star
13

maldi_amr

Code for the paper "Antimicrobial resistance prediction in clinical isolates through machine learning on MALDI-TOF mass spectra"
Jupyter Notebook
30
star
14

PST

Protein Structure Transformer (PST): Endowing pretrained protein language models with structural knowledge
Python
30
star
15

Neural-Persistence

Code for the paper 'Neural Persistence: A Complexity Measure for Deep Neural Networks Using Algebraic Topology'
Python
29
star
16

maldi-learn

Software library for Maldi-Tof preprocessing and machine learning analysis.
Python
28
star
17

PyChange

Multiple change detection with python
C++
22
star
18

WTK

A Wasserstein Subsequence Kernel for Time Series.
Python
21
star
19

JointMDS

Official implementation of Joint Multidimensional Scaling
Python
21
star
20

fMRI_Cubical_Persistence

Code of our NeurIPS 2020 publication 'Uncovering the Topology of Time-Varying fMRI Data using Cubical Persistence'
Python
20
star
21

filtration_curves

Code for the KDD 2021 paper 'Filtration Curves for Graph Representation'
Python
18
star
22

fisher_information_embedding

Official code for Fisher information embedding for node and graph learning (ICML 2023)
Python
17
star
23

NeuralWalker

Official Pytorch implementation of NeuralWalker
Python
17
star
24

MvKDR

Multi-view Spectral Clustering on Conflicting Views
Python
15
star
25

Kernelized-Rank-Learning

Kernelized rank learning for personalized drug recommendation
Jupyter Notebook
15
star
26

ggme

Official repository for the ICLR 2022 paper "Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical Solutions" https://openreview.net/forum?id=tBtoZYKd9n
Python
14
star
27

HOGImine

Higher-order genetic interaction discovery with network-based biological priors.
C++
13
star
28

MotiFiesta

Approximate subgraph motif mining through learnable edge contraction.
Python
12
star
29

networkGWAS

Method for performing genome-wide association like studies on neighborhoods identified on biological networks relevant for the phenotype of interest.
Python
12
star
30

multicenter-sepsis

Python
11
star
31

MID

MID (Mutual Information Dimension) for measuring statistical dependence between two random variables
C
11
star
32

graphkernels-review

Code and data sets for the review on graph kernels
Python
10
star
33

reComBat

reComBat package to correct batch effects
Jupyter Notebook
10
star
34

maldi_PIKE

Code for 'Topological and kernel-based microbial phenotype prediction from MALDI-TOF mass spectra'
Python
9
star
35

proteinshake_models

Python
8
star
36

topo-ae-distances

Python
8
star
37

CAsMap

Detection of statistically significant combinations of SNPs in association mapping
C++
7
star
38

significant-subgraph-mining

Finding statistically significant subgraphs while correcting for multiple testing
C++
7
star
39

Kernel-Conditional-Clustering

Kernel Conditional Clustering
MATLAB
7
star
40

ADNI_3DCNNvsTDA

This is a summary of the model code used in "Back to the basics with inclusion of clinical domain knowledge - A simple, scalable and effective model of Alzheimer's Disease classification". It comprised the relevant 3D CNNs for hippocampus, patch and full inner brain image subsets, the TDA 2D CNN with relevant dense models to combine models trained on persistence images from different homological dimensions. Moreover, the models (GNN and LR) to combine multiple image patches are included, as well as the data splits in terms of ADNI database patient IDs (partitions).
Python
7
star
41

2019-06-Machine-Learning-for-Biology

Introduction to Machine Learning for Biology (Workshop @ D-BSSE Retreat 2019)
TeX
7
star
42

SiNIMin

Significant Network Interval Mining
C++
6
star
43

LSH-WTK

Locality-Sensitive Hashing for the Wasserstein Time Series Kernel.
Python
6
star
44

LMM-Lasso

An implementation of the Lasso model for association mapping and phenotype prediction which corrects for population strucure (Rakitsch et al., Bioinformatics 2013): http://goo.gl/FRmXwI
Python
6
star
45

sepsis-prediction-review

Python
5
star
46

uea_ucr_datasets

A small package for loading and handling UEA UCR time series classification datasets.
Python
5
star
47

GP-PoM

TypeScript
5
star
48

graphhopper-kernels

Scalable kernels for graphs with continuous attributes (Feragen et al., NIPS 2013) http://goo.gl/VxSfzZ
MATLAB
5
star
49

ARDISS

Automatic Relevance Determination for Imputation of Summary Statistics
Python
4
star
50

Topf

Topological peak filtering
Python
4
star
51

Epistasis-GLIDE

A C and CUDA implementation of tabulating linear regression for an exhaustive pairwise interaction search on a CUDA enabled GPU (Kam-Thong et al., Human Heredity 2012) http://goo.gl/XE54ir
Cuda
4
star
52

simbsig

The official implementation for the SIMBSIG package
Python
3
star
53

FindComb

Scientifica app for finding the most significant combinations of features
Python
3
star
54

Multi-SConES

Multi-task feature selection coupled with multiple network regularizers
R
3
star
55

Imputing_Signatures

Python
2
star
56

PheGeMIL

Python
2
star
57

ccSVM

Confounder-corrected Classification with Support Vector Machines (Li et al., Bioinformatics 2011) http://goo.gl/Qz9Ap5
C++
2
star
58

DeepEST

Python
2
star
59

proteinshake_release

Python
2
star
60

MODS-recovery

Codes for prediction of the recovery of pediatric sepsis patients with MODS
Python
2
star
61

GraphMatchingSubstitutionMatrices

Code and Data for the paper: Structure- and Function-Aware Substitution Matrices via Learnable Graph Matching (RECOMB 2024 & ICML 2024 Differentiable Almost Everything Workshop)
Python
2
star
62

SignificantPatternMiningFDR

Code and Data for the paper: FASM and FAST-YB: Significant Pattern Mining with False Discovery Rate Control (ICDM 2023).
C
2
star
63

sc-autoencoding

student internship of Simon Streib to reduce single cell data
Python
2
star
64

Epistasis-lightbulb

Efficient algorithms and GPU implementations for genome-wide epistasis screens as described in (Achlioptas et al., KDD 2011) http://goo.gl/jX8kPi
Python
2
star
65

batchCorrectionPublicData

Summary of the code published in 'reComBat: Batch effect removal in large-scale, multi-source omics data integration'.
Python
1
star
66

biobank_genomics

Shell
1
star
67

LongCOVID

Prediction of long COVID from proteomic and clinical data
Python
1
star
68

gene-representations-in-networks

A systematic evaluation of gene representations in network based genetic analysis
Python
1
star
69

homebrew-mlcb

Homebrew taps of the Machine Learning and Computational Biology group of Prof. Karsten Borgwardt
Ruby
1
star
70

glide-scripts

Companion scripts to the GLIDE software
Python
1
star
71

SCIRecoveryPredictionPublic

We provide a simple matching algorithm to identify digital twins for spinal cord injury patients in the acute injury phase.
Python
1
star
72

MoProEmbeddings

Implementation of moment propagation embeddings.
Python
1
star