🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
This curated list contains 910 awesome open-source projects with a total of 4M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
🧙
Contents
- Machine Learning Frameworks 60 projects
- Data Visualization 54 projects
- Text Data & NLP 101 projects
- Image Data 64 projects
- Graph Data 36 projects
- Audio Data 29 projects
- Geospatial Data 22 projects
- Financial Data 25 projects
- Time Series Data 29 projects
- Medical Data 19 projects
- Tabular Data 5 projects
- Optical Character Recognition 12 projects
- Data Containers & Structures 1 projects
- Data Loading & Extraction 1 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 1 projects
- Distributed Machine Learning 36 projects
- Hyperparameter Optimization & AutoML 52 projects
- Reinforcement Learning 23 projects
- Recommender Systems 17 projects
- Privacy Machine Learning 7 projects
- Workflow & Experiment Tracking 39 projects
- Model Serialization & Deployment 20 projects
- Model Interpretability 54 projects
- Vector Similarity Search (ANN) 13 projects
- Probabilistics & Statistics 23 projects
- Adversarial Robustness 9 projects
- GPU & Accelerator Utilities 20 projects
- Tensorflow Utilities 16 projects
- Jax Utilities 3 projects
- Sklearn Utilities 19 projects
- Pytorch Utilities 32 projects
- Database Clients 1 projects
- Others 66 projects
Explanation
🥇 🥈 🥉 Combined project-quality score⭐️ Star count from GitHub- 🐣 New project (less than 6 months old)
💤 Inactive project (6 months no activity)💀 Dead project (12 months no activity)- 📈📉 Project is trending up or down
- ➕ Project was recently added
- ❗️ Warning (e.g. missing/risky license)
- 👨💻 Contributors count from GitHub
- 🔀 Fork count from GitHub
- 📋 Issue count from GitHub
⏱️ Last update timestamp on package manager- 📥 Download count from package manager
- 📦 Number of dependent projects
- Tensorflow related project
- Sklearn related project
- PyTorch related project
- MxNet related project
- Apache Spark related project
- Jupyter related project
- PaddlePaddle related project
- Pandas related project
- Jax related project
Machine Learning Frameworks
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇 55 · ⭐ 180K) - An Open Source Machine Learning Framework for Everyone. Apache-2
-
GitHub (👨💻 4.4K · 🔀 72K · 📦 290K · 📋 37K - 5% open · ⏱️ 29.06.2023):
git clone https://github.com/tensorflow/tensorflow
-
PyPi (
📥 16M / month):pip install tensorflow
-
Conda (
📥 4.2M ·⏱️ 18.06.2023):conda install -c conda-forge tensorflow
-
Docker Hub (📥 73M · ⭐ 2.2K · ⏱️ 29.06.2023):
docker pull tensorflow/tensorflow
scikit-learn (🥇 52 · ⭐ 55K) - scikit-learn: machine learning in Python. BSD-3
StatsModels (🥇45 · ⭐ 8.6K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
XGBoost (🥈43 · ⭐ 24K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
LightGBM (🥈43 · ⭐ 15K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
PaddlePaddle (🥈42 · ⭐ 20K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
Jina (🥈 39 · ⭐ 19K) - Build multimodal AI services via cloud native technologies. Apache-2
-
GitHub (
👨💻 170 ·🔀 2.1K ·📦 720 ·📋 1.9K - 0% open ·⏱️ 29.06.2023):git clone https://github.com/jina-ai/jina
-
PyPi (
📥 390K / month ·📦 13 ·⏱️ 06.07.2022):pip install jina
-
Conda (📥 49K · ⏱️ 16.06.2023):
conda install -c conda-forge jina-core
-
Docker Hub (
📥 1.3M ·⭐ 8 ·⏱️ 29.06.2023):docker pull jinaai/jina
pytorch-lightning (🥈 38 · ⭐ 24K) - Deep learning framework to train, deploy, and ship AI.. Apache-2
Theano (🥈 38 · ⭐ 9.7K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
Vowpal Wabbit (🥈 37 · ⭐ 8.2K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
Catboost (🥈 35 · ⭐ 7.2K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
Thinc (🥈 35 · ⭐ 2.7K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
tensorpack (🥉32 · ⭐ 6.3K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
tensorflow-upstream (🥉 31 · ⭐ 650) - TensorFlow ROCm port. Apache-2
Neural Tangents (🥉27 · ⭐ 2K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
Neural Network Libraries (🥉26 · ⭐ 2.6K) - Neural Network Libraries. Apache-2
xLearn (🥉25 · ⭐ 3.1K · 💤) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
ThunderSVM (🥉20 · ⭐ 1.5K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
ThunderGBM (🥉 17 · ⭐ 670 · 💤 ) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
chefboost (🥉17 · ⭐ 400) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
Show 14 hidden projects...
- dlib (
🥈 40 ·⭐ 12K) - A toolkit for making real world machine learning and data analysis..❗️BSL-1.0
- MindsDB (🥈34 · ⭐ 17K) - MindsDB is a Server for Artificial Intelligence Logic. Enabling..
❗️GPL-3.0
- Turi Create (
🥈 33 ·⭐ 11K ·💀 ) - Turi Create simplifies the development of custom machine..BSD-3
- TFlearn (
🥉 32 ·⭐ 9.6K ·💀 ) - Deep learning library featuring a higher-level API for TensorFlow.MIT
- mlpack (🥉30 · ⭐ 4.4K) - mlpack: a fast, header-only C++ machine learning library.
❗Unlicensed
- NuPIC (
🥉 28 ·⭐ 6.3K ·💀 ) - Numenta Platform for Intelligent Computing is an implementation..❗️AGPL-3.0
- Lasagne (🥉28 · ⭐ 3.8K · 💀) - Lightweight library to build and train neural networks in Theano.
MIT
- SHOGUN (🥉27 · ⭐ 3K · 💀) - Unified and efficient Machine Learning.
BSD-3
- CNTK (
🥉 26 ·⭐ 17K ·💤 ) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning..❗Unlicensed
- mace (🥉23 · ⭐ 4.8K · 💀) - MACE is a deep learning inference framework optimized for mobile..
Apache-2
- neon (🥉23 · ⭐ 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best..
Apache-2
- Torchbearer (🥉21 · ⭐ 630 · 💀) - torchbearer: A model fitting library for PyTorch.
MIT
- elegy (🥉18 · ⭐ 450 · 💀) - A High Level API for Deep Learning in JAX.
MIT
- StarSpace (
🥉 16 ·⭐ 3.9K ·💀 ) - Learning embeddings for classification, retrieval and ranking.MIT
Data Visualization
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇 48 · ⭐ 18K) - matplotlib: plotting with Python. ❗Unlicensed
pandas-profiling (🥇 37 · ⭐ 11K) - 1 Line of code data quality profiling & exploratory.. MIT
Plotly (🥈 36 · ⭐ 14K · 📉 ) - The interactive graphing library for Python This project now includes.. MIT
datashader (🥈 33 · ⭐ 3K) - Quickly and accurately render even the largest data. BSD-3
D-Tale (🥈 31 · ⭐ 4.1K) - Visualizer for pandas data structures. ❗️LGPL-2.1
bqplot (🥈 30 · ⭐ 3.4K) - Plotting library for IPython/Jupyter notebooks. Apache-2
-
GitHub (👨💻 62 · 🔀 470 · 📦 43 · 📋 610 - 40% open · ⏱️ 11.04.2023):
git clone https://github.com/bqplot/bqplot
-
PyPi (
📥 130K / month ·📦 100 ·⏱️ 02.09.2022):pip install bqplot
-
Conda (📥 1.2M · ⏱️ 16.06.2023):
conda install -c conda-forge bqplot
-
npm (
📥 3.2K / month ·📦 14 ·⏱️ 11.04.2023):npm install bqplot
hvPlot (🥈 30 · ⭐ 770) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
Facets Overview (🥉 28 · ⭐ 7.1K) - Visualizations for machine learning datasets. Apache-2
Perspective (🥉 28 · ⭐ 6.4K) - A data visualization and analytics component, especially.. Apache-2
-
GitHub (👨💻 89 · 🔀 740 · 📦 11 · 📋 640 - 13% open · ⏱️ 28.06.2023):
git clone https://github.com/finos/perspective
-
PyPi (📥 5.7K / month):
pip install perspective-python
-
Conda (📥 390K · ⏱️ 06.06.2023):
conda install -c conda-forge perspective
-
npm (
📥 1.1K / month):npm install @finos/perspective-jupyterlab
data-validation (🥉 28 · ⭐ 720) - Library for exploring and validating machine learning.. Apache-2
AutoViz (🥉 27 · ⭐ 1.3K) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
Plotly-Resampler (🥉 26 · ⭐ 760) - Visualize large time series data with plotly.py. MIT
Chartify (🥉25 · ⭐ 3.4K) - Python library that makes it easy for data scientists to create.. Apache-2
Multicore-TSNE (🥉 25 · ⭐ 1.8K) - Parallel t-SNE implementation with Python and Torch.. BSD-3
Pandas-Bokeh (🥉24 · ⭐ 850) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
python-ternary (🥉 24 · ⭐ 640) - Ternary plotting library for python with matplotlib. MIT
Sweetviz (🥉22 · ⭐ 2.4K · 💤) - Visualize and compare datasets, target values and associations, with.. MIT
Show 16 hidden projects...
- VisPy (🥈33 · ⭐ 3.1K) - High-performance interactive 2D/3D data visualization library.
❗Unlicensed
- cartopy (🥈31 · ⭐ 1.2K) - Cartopy - a cartographic python library with matplotlib support.
❗️LGPL-3.0
- Cufflinks (
🥉 29 ·⭐ 2.8K ·💀 ) - Productivity Tools for Plotly + Pandas.MIT
- PyQtGraph (🥉28 · ⭐ 3.3K · 📉) - Fast data visualization and GUI tools for scientific /..
❗Unlicensed
- PandasGUI (🥉25 · ⭐ 3K · 💀) - A GUI for Pandas DataFrames.
❗️MIT-0
- HyperTools (
🥉 25 ·⭐ 1.8K ·💀 ) - A Python toolbox for gaining geometric insights into high-..MIT
- pythreejs (🥉25 · ⭐ 880) - A Jupyter - Three.js bridge.
❗Unlicensed
- joypy (
🥉 22 ·⭐ 490 ·💀 ) - Joyplots in Python with matplotlib & pandas.MIT
- vegafusion (
🥉 22 ·⭐ 240) - Serverside scaling for Vega and Altair visualizations.BSD-3
- pivottablejs (🥉21 · ⭐ 570 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..
❗Unlicensed
- animatplot (
🥉 19 ·⭐ 400 ·💀 ) - A python package for animating plots build on matplotlib.MIT
- ivis (
🥉 19 ·⭐ 300) - Dimensionality reduction in very large datasets using Siamese..Apache-2
- data-describe (🥉17 · ⭐ 300 · 💀) - datadescribe: Pythonic EDA Accelerator for Data Science.
Apache-2
- pdvega (
🥉 16 ·⭐ 340 ·💀 ) - Interactive plotting for Pandas using Vega-Lite.MIT
- nx-altair (
🥉 13 ·⭐ 210 ·💀 ) - Draw interactive NetworkX graphs with Altair.MIT
- nptsne (
🥉 10 ·⭐ 31 ·💀 ) - nptsne is a numpy compatible python binary package that offers a..Apache-2
Text Data & NLP
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇50 · ⭐ 110K) - Transformers: State-of-the-art Machine Learning for.. Apache-2
nltk (🥇 41 · ⭐ 12K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
gensim (🥇40 · ⭐ 14K) - Topic Modelling for Humans. ❗️LGPL-2.1
sentencepiece (🥇37 · ⭐ 7.7K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
haystack (🥇 36 · ⭐ 9.3K) - Haystack is an open source NLP framework to interact with your data.. Apache-2
sentence-transformers (🥇 35 · ⭐ 11K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2
TextBlob (🥇35 · ⭐ 8.6K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
TensorFlow Text (🥇35 · ⭐ 1.1K) - Making text a first-class citizen in TensorFlow. Apache-2
Tokenizers (🥈 33 · ⭐ 7.2K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
DeepPavlov (🥈 33 · ⭐ 6.3K) - An open source library for deep learning end-to-end dialog.. Apache-2
SciSpacy (🥈 31 · ⭐ 1.4K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
spacy-transformers (🥈31 · ⭐ 1.3K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
snowballstemmer (🥈31 · ⭐ 670) - Snowball compiler and stemming algorithms. BSD-3
jellyfish (🥈30 · ⭐ 1.9K) - a python library for doing approximate and phonetic matching of strings. MIT
english-words (🥈29 · ⭐ 9.1K · 💤) - A text file containing 479k English words for all your.. Unlicense
TextDistance (🥈 28 · ⭐ 3.1K · 💤 ) - Compute distance between sequences. 30+ algorithms, pure.. MIT
fastNLP (🥈28 · ⭐ 3K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
Ciphey (🥈 27 · ⭐ 14K) - Automatically decrypt encryptions without knowing the key or cipher, decode.. MIT
-
GitHub (
👨💻 47 ·🔀 800 ·📋 310 - 15% open ·⏱️ 05.12.2022):git clone https://github.com/Ciphey/Ciphey
-
PyPi (📥 31K / month):
pip install ciphey
-
Docker Hub (📥 19K · ⭐ 14 · ⏱️ 10.03.2023):
docker pull remnux/ciphey
qdrant (🥉26 · ⭐ 11K) - Qdrant - Vector Database for the next generation of AI applications... Apache-2
-
GitHub (
👨💻 57 ·🔀 560 ·📥 280 ·📋 590 - 16% open ·⏱️ 23.06.2023):git clone https://github.com/qdrant/qdrant
OpenPrompt (🥉 26 · ⭐ 3.5K) - An Open-Source Framework for Prompt-Learning. Apache-2
PyTextRank (🥉26 · ⭐ 2K · 💤) - Python implementation of TextRank algorithms (textgraphs) for.. MIT
scattertext (🥉25 · ⭐ 2.1K) - Beautiful visualizations of how language differs among document.. Apache-2
promptsource (🥉 25 · ⭐ 1.9K) - Toolkit for creating, sharing and using natural language.. Apache-2
Texthero (🥉23 · ⭐ 2.7K · 💤) - Text preprocessing, representation and visualization from zero to.. MIT
YouTokenToMe (🥉 23 · ⭐ 890) - Unsupervised text tokenizer focused on computational efficiency. MIT
lightseq (🥉21 · ⭐ 2.8K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
small-text (🥉 21 · ⭐ 460) - Active Learning for Text Classification in Python. MIT
NLP Architect (🥉20 · ⭐ 2.9K · 💤) - A model library for exploring state-of-the-art deep.. Apache-2
happy-transformer (🥉 20 · ⭐ 430) - A package built on top of Hugging Faces transformers.. Apache-2
huggingface
textaugment (🥉18 · ⭐ 320) - TextAugment: Text Augmentation Library. MIT
OpenNRE (🥉 17 · ⭐ 4K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
-
GitHub (👨💻 12 · 🔀 970 · 📋 360 - 2% open · ⏱️ 03.01.2023):
git clone https://github.com/thunlp/OpenNRE
TextBox (🥉17 · ⭐ 990) - TextBox 2.0 is a text generation library with pre-trained language models. MIT
Show 39 hidden projects...
- flair (
🥇 37 ·⭐ 13K) - A very simple framework for state-of-the-art Natural Language..❗Unlicensed
- ChatterBot (🥇35 · ⭐ 13K · 💀) - ChatterBot is a machine learning, conversational dialog engine..
BSD-3
- fuzzywuzzy (
🥈 34 ·⭐ 8.9K ·💀 ) - Fuzzy String Matching in Python.❗️GPL-2.0
- textacy (
🥈 31 ·⭐ 2.1K) - NLP, before and after spaCy.❗Unlicensed
- stanza (
🥈 30 ·⭐ 6.7K) - Official Stanford NLP Python Library for Many Human Languages.❗Unlicensed
- vaderSentiment (
🥈 28 ·⭐ 4K ·💀 ) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary and..MIT
- underthesea (
🥈 27 ·⭐ 1.2K) - Underthesea - Vietnamese NLP Toolkit.❗️GPL-3.0
- flashtext (🥉26 · ⭐ 5.4K · 💀) - Extract Keywords from sentence or Replace keywords in sentences.
MIT
- neuralcoref (
🥉 26 ·⭐ 2.7K ·💀 ) - Fast Coreference Resolution in spaCy with Neural Networks.MIT
- pytorch-nlp (
🥉 25 ·⭐ 2.2K ·💀 ) - Basic Utilities for PyTorch Natural Language Processing..BSD-3
- whoosh (🥉25 · ⭐ 430 · 💀) - Pure-Python full-text search library.
❗️BSD-1-Clause
- textgenrnn (🥉24 · ⭐ 4.9K · 💀) - Easily train your own text-generating neural network of any..
MIT
- Snips NLU (
🥉 24 ·⭐ 3.8K ·💀 ) - Snips Python library to extract meaning from text.Apache-2
- polyglot (
🥉 24 ·⭐ 2.2K ·💀 ) - Multilingual text (NLP) processing toolkit.❗Unlicensed
- langid (
🥉 24 ·⭐ 2.1K ·💀 ) - Stand-alone language identification system.❗Unlicensed
- pySBD (🥉24 · ⭐ 630 · 💀) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence..
MIT
- MatchZoo (🥉23 · ⭐ 3.8K · 💀) - Facilitating the design, comparison and sharing of deep..
Apache-2
- Kashgari (🥉23 · ⭐ 2.4K · 💀) - Kashgari is a production-level NLP Transfer learning..
Apache-2
- gpt-2-simple (
🥉 22 ·⭐ 3.3K ·💀 ) - Python package to easily retrain OpenAIs GPT-2 text-..MIT
- Texar (🥉22 · ⭐ 2.4K · 💀) - Toolkit for Machine Learning, Natural Language Processing, and..
Apache-2
- stop-words (🥉22 · ⭐ 150 · 💀) - Get list of common stop words in various languages in Python.
BSD-3
- DELTA (🥉21 · ⭐ 1.6K · 💀) - DELTA is a deep learning based natural language and speech..
Apache-2
- anaGo (
🥉 21 ·⭐ 1.5K ·💀 ) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,..MIT
- pyfasttext (🥉20 · ⭐ 230 · 💀) - Yet another Python binding for fastText.
❗️GPL-3.0
- fastT5 (
🥉 19 ·⭐ 480 ·💀 ) - boost inference speed of T5 models by 5x & reduce the model size..Apache-2
- Camphr (
🥉 19 ·⭐ 340 ·💀 ) - Camphr - NLP libary for creating pipeline components.Apache-2
spacy
- numerizer (🥉19 · ⭐ 200) - A Python module to convert natural language numerics into ints and..
MIT
- DeepMatcher (🥉18 · ⭐ 490 · 💀) - Python package for performing Entity and Text Matching using..
BSD-3
- nboost (🥉17 · ⭐ 660 · 💀) - NBoost is a scalable, search-api-boosting platform for deploying..
Apache-2
- textpipe (
🥉 17 ·⭐ 300 ·💀 ) - Textpipe: clean and extract metadata from text.MIT
- skift (🥉17 · ⭐ 230 · 💤) - scikit-learn wrappers for Python fastText.
MIT
- NeuroNER (🥉16 · ⭐ 1.7K · 💀) - Named-entity recognition using neural networks. Easy-to-use and..
MIT
- BLINK (
🥉 15 ·⭐ 1.1K ·💀 ) - Entity Linker solution.MIT
- NeuralQA (
🥉 15 ·⭐ 220 ·💀 ) - NeuralQA: A Usable Library for Question Answering on Large Datasets..MIT
- spacy-dbpedia-spotlight (
🥉 14 ·⭐ 88) - A spaCy wrapper for DBpedia Spotlight.MIT
spacy
- TransferNLP (🥉13 · ⭐ 290 · 💀) - NLP library designed for reproducible experimentation..
MIT
- ONNX-T5 (🥉13 · ⭐ 230 · 💀) - Summarization, translation, sentiment-analysis, text-generation..
Apache-2
- textvec (🥉13 · ⭐ 190 · 💤) - Text vectorization tool to outperform TFIDF for classification..
MIT
- Headliner (
🥉 10 ·⭐ 230 ·💀 ) - Easy training and deployment of seq2seq models.❗Unlicensed
Image Data
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
torchvision (🥇42 · ⭐ 14K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
PyTorch Image Models (🥇 37 · ⭐ 26K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
InsightFace (🥇 37 · ⭐ 16K) - State-of-the-art 2D and 3D Face Analysis Project. MIT
Face Recognition (🥈35 · ⭐ 49K · 💤) - The worlds simplest facial recognition api for Python.. MIT
MMDetection (🥈 35 · ⭐ 25K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
detectron2 (🥈34 · ⭐ 25K) - Detectron2 is a platform for object detection, segmentation.. Apache-2
Albumentations (🥈 34 · ⭐ 12K) - Fast image augmentation library and an easy-to-use wrapper.. MIT
PaddleDetection (🥈32 · ⭐ 11K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2
vit-pytorch (🥈29 · ⭐ 15K) - Implementation of Vision Transformer, a simple way to achieve.. MIT
imageai (🥈29 · ⭐ 7.9K) - A python library built to empower developers to build applications and.. MIT
Face Alignment (🥈 29 · ⭐ 6.4K) - 2D and 3D Face alignment library build using pytorch. BSD-3
facenet-pytorch (🥈29 · ⭐ 3.6K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT
sahi (🥈29 · ⭐ 2.8K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT
opencv-python (🥉28 · ⭐ 3.6K · 📉) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
vidgear (🥉 28 · ⭐ 2.8K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
layout-parser (🥉26 · ⭐ 3.7K · 💤) - A Unified Toolkit for Deep Learning Based Document Image.. Apache-2
Norfair (🥉26 · ⭐ 1.9K) - Lightweight Python library for adding real-time multi-object tracking.. BSD-3
Image Deduplicator (🥉25 · ⭐ 4.6K) - Finding duplicate images made easy!. Apache-2
segmentation_models (🥉25 · ⭐ 4.4K · 💤) - Segmentation models with pretrained backbones. Keras.. MIT
pytorchvideo (🥉25 · ⭐ 2.9K) - A deep learning library for video understanding research. Apache-2
Classy Vision (🥉 24 · ⭐ 1.6K) - An end-to-end PyTorch framework for image and video.. MIT
icevision (🥉 24 · ⭐ 820) - An Agnostic Computer Vision Framework - Pluggable to any Training.. Apache-2
tensorflow-graphics (🥉 22 · ⭐ 2.7K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
PySlowFast (🥉21 · ⭐ 5.7K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
image-match (🥉 20 · ⭐ 2.9K) - Quickly search over billions of images. Apache-2
DE⫶TR (🥉 19 · ⭐ 11K) - End-to-End Object Detection with Transformers. Apache-2
-
GitHub (👨💻 26 · 🔀 2K · 📋 490 - 42% open · ⏱️ 07.02.2023):
git clone https://github.com/facebookresearch/detr
scenic (🥉19 · ⭐ 2.3K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2
-
GitHub (👨💻 65 · 🔀 330 · 📋 220 - 67% open · ⏱️ 27.06.2023):
git clone https://github.com/google-research/scenic
Show 18 hidden projects...
- scikit-image (🥇42 · ⭐ 5.5K) - Image processing in Python.
❗Unlicensed
- imgaug (🥈35 · ⭐ 14K · 💀) - Image augmentation for machine learning experiments.
MIT
- glfw (🥈34 · ⭐ 11K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.
❗️Zlib
- imutils (🥈31 · ⭐ 4.4K · 💀) - A series of convenience functions to make basic image processing..
MIT
- PyTorch3D (
🥈 30 ·⭐ 7.4K) - PyTorch3D is FAIRs library of reusable components for..❗Unlicensed
- chainercv (
🥉 27 ·⭐ 1.5K ·💀 ) - ChainerCV: a Library for Deep Learning in Computer Vision.MIT
- mtcnn (🥉26 · ⭐ 2K · 💀) - MTCNN face detection implementation for TensorFlow, as a PIP package.
MIT
- CellProfiler (
🥉 24 ·⭐ 780) - An open-source application for biological image analysis.❗Unlicensed
- Image Super-Resolution (🥉23 · ⭐ 4.2K · 💀) - Super-scale your images and run experiments with..
Apache-2
- Luminoth (🥉23 · ⭐ 2.4K · 💀) - Deep Learning toolkit for Computer Vision.
BSD-3
- deep-daze (🥉22 · ⭐ 4.4K · 💀) - Simple command line tool for text to image generation using..
MIT
- Pillow-SIMD (
🥉 22 ·⭐ 2K) - The friendly PIL fork.❗️PIL
- nude.py (🥉21 · ⭐ 910 · 💀) - Nudity detection with Python.
MIT
- detecto (🥉21 · ⭐ 590 · 💀) - Build fully-functioning computer vision models with PyTorch.
MIT
- Caer (🥉18 · ⭐ 690 · 💀) - A lightweight Computer Vision library. Scale your models, not boilerplate.
MIT
- solt (🥉18 · ⭐ 260 · 💤) - Streaming over lightweight data transformations.
MIT
- HugsVision (
🥉 16 ·⭐ 180) - HugsVision is a easy to use huggingface wrapper for state-of-the-..MIT
huggingface
- Torch Points 3D (🥉16 · ⭐ 150 · 💀) - Pytorch framework for doing deep learning on point..
BSD-3
Graph Data
Libraries for graph processing, clustering, embedding, and machine learning tasks.
dgl (🥇 37 · ⭐ 12K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
PyTorch Geometric (🥇 35 · ⭐ 18K · 📉 ) - Graph Neural Network Library for PyTorch. MIT
ogb (🥈31 · ⭐ 1.7K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
pygraphistry (🥈 29 · ⭐ 1.9K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3
Paddle Graph Learning (🥈 28 · ⭐ 1.5K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
PyKEEN (🥈 25 · ⭐ 1.3K) - A Python library for learning and evaluating knowledge graph embeddings. MIT
pytorch_geometric_temporal (🥈 24 · ⭐ 2.1K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT
AmpliGraph (🥈24 · ⭐ 2K) - Python library for Representation Learning on Knowledge.. Apache-2
torch-cluster (🥉21 · ⭐ 660) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
graph-nets (🥉18 · ⭐ 5.3K) - Build Graph Nets in Tensorflow. Apache-2
GraphEmbedding (🥉 16 · ⭐ 3.3K · 💤 ) - Implementation and experiments of graph embedding.. MIT
-
GitHub (👨💻 9 · 🔀 940 · 📦 28 · 📋 64 - 60% open · ⏱️ 21.06.2022):
git clone https://github.com/shenweichen/GraphEmbedding
kglib (🥉 16 · ⭐ 540 · 💤 ) - TypeDB-ML is the Machine Learning integrations library for TypeDB. Apache-2
OpenNE (🥉 14 · ⭐ 1.6K · 💤 ) - An Open-Source Package for Network Embedding (NE). MIT
-
GitHub (👨💻 11 · 🔀 480 · 📋 98 - 2% open · ⏱️ 02.11.2022):
git clone https://github.com/thunlp/OpenNE
Show 16 hidden projects...
- networkx (🥇42 · ⭐ 13K) - Network Analysis in Python.
❗Unlicensed
- igraph (
🥇 33 ·⭐ 1.1K) - Python interface for igraph.❗️GPL-2.0
- StellarGraph (
🥈 28 ·⭐ 2.7K ·💀 ) - StellarGraph - Machine Learning on Graphs.Apache-2
- pygal (
🥈 27 ·⭐ 2.5K) - PYthon svg GrAph plotting Library.❗️LGPL-3.0
- DIG (
🥈 24 ·⭐ 1.5K) - A library for graph deep learning research.❗️GPL-3.0
- Karate Club (
🥉 23 ·⭐ 1.9K) - Karate Club: An API Oriented Open-source Python Framework for..❗️GPL-3.0
- PyTorch-BigGraph (🥉21 · ⭐ 3.3K) - Generate embeddings from large-scale graph-structured..
❗Unlicensed
- pyRDF2Vec (
🥉 21 ·⭐ 210) - Python Implementation and Extension of RDF2Vec.MIT
- DeepWalk (
🥉 20 ·⭐ 2.6K ·💀 ) - DeepWalk - Deep Learning for Graphs.❗Unlicensed
- DeepGraph (🥉17 · ⭐ 270 · 💀) - Analyze Data with Pandas-based Networks. Documentation:.
BSD-3
- Euler (🥉15 · ⭐ 2.8K · 💀) - A distributed graph deep learning framework.
Apache-2
- GraphVite (
🥉 15 ·⭐ 1.1K ·💀 ) - GraphVite: A General and High-performance Graph Embedding..Apache-2
- Sematch (
🥉 15 ·⭐ 410 ·💀 ) - semantic similarity framework for knowledge graph.Apache-2
- GraphSAGE (
🥉 14 ·⭐ 3.1K ·💀 ) - Representation learning on large graphs using stochastic..MIT
- ptgnn (
🥉 14 ·⭐ 370 ·💀 ) - A PyTorch Graph Neural Network Library.MIT
- OpenKE (🥉13 · ⭐ 3.5K · 💤) - An Open-Source Package for Knowledge Embedding (KE).
❗Unlicensed
Audio Data
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
speechbrain (🥇35 · ⭐ 6.2K) - A PyTorch-based Speech Toolkit. Apache-2
torchaudio (🥇 35 · ⭐ 2.1K · 📈 ) - Data manipulation and transformation for audio signal.. BSD-2
SpeechRecognition (🥈 33 · ⭐ 7.3K) - Speech recognition module for Python, supporting several.. BSD-3
audiomentations (🥈 29 · ⭐ 1.4K) - A Python library for audio data augmentation. Inspired by.. MIT
pyAudioAnalysis (🥉28 · ⭐ 5.3K · 💤) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
tinytag (🥉 28 · ⭐ 610) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
python-soundfile (🥉 26 · ⭐ 560 · 📉 ) - SoundFile is an audio library based on libsndfile, CFFI,.. BSD-3
Show 10 hidden projects...
- DeepSpeech (
🥈 34 ·⭐ 22K ·💀 ) - DeepSpeech is an open source embedded (offline, on-..MPL-2.0
- aubio (🥉28 · ⭐ 3K · 💀) - a library for audio and music analysis.
❗️GPL-3.0
- Essentia (
🥉 28 ·⭐ 2.4K) - C++ library for audio and music analysis, description and..❗️AGPL-3.0
- Madmom (
🥉 26 ·⭐ 1.1K ·💀 ) - Python audio and music signal processing library.BSD-3
- python_speech_features (🥉25 · ⭐ 2.3K · 💀) - This library provides common speech features for ASR..
MIT
- TTS (
🥉 22 ·⭐ 7.6K ·💀 ) - Deep learning for Text to Speech (Discussion forum:..MPL-2.0
- TimeSide (
🥉 22 ·⭐ 350) - scalable audio processing framework and server written in Python.❗️AGPL-3.0
- Dejavu (
🥉 21 ·⭐ 6.1K ·💀 ) - Audio fingerprinting and recognition in Python.MIT
- Muda (
🥉 17 ·⭐ 220 ·💀 ) - A library for augmenting annotated audio data.ISC
- textlesslib (🥉9 · ⭐ 410 · 💀) - Library for Textless Spoken Language Processing.
MIT
Geospatial Data
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇 41 · ⭐ 11K) - WebGL2 powered visualization framework. MIT
-
GitHub (👨💻 230 · 🔀 2K · 📦 6.1K · 📋 2.7K - 8% open · ⏱️ 27.06.2023):
git clone https://github.com/visgl/deck.gl
-
PyPi (📥 1.4M / month · 📦 35 · ⏱️ 25.10.2021):
pip install pydeck
-
Conda (
📥 400K ·⏱️ 16.06.2023):conda install -c conda-forge pydeck
-
npm (📥 470K / month · 📦 450 · ⏱️ 14.06.2023):
npm install deck.gl
ipyleaflet (🥉32 · ⭐ 1.4K) - A Jupyter - Leaflet.js bridge. MIT
ArcGIS API (🥉30 · ⭐ 1.6K) - Documentation and samples for ArcGIS API for Python. Apache-2
-
GitHub (
👨💻 85 ·🔀 1K ·📥 8.5K ·📋 590 - 5% open ·⏱️ 22.06.2023):git clone https://github.com/Esri/arcgis-python-api
-
PyPi (📥 89K / month):
pip install arcgis
-
Docker Hub (
📥 11K ·⭐ 40 ·⏱️ 17.06.2022):docker pull esridocker/arcgis-api-python-notebook
EarthPy (🥉28 · ⭐ 450) - A package built to support working with spatial data using open source.. BSD-3
pymap3d (🥉 25 · ⭐ 310) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
Show 8 hidden projects...
- Rasterio (
🥈 35 ·⭐ 2K) - Rasterio reads and writes geospatial raster datasets.❗Unlicensed
- Geocoder (
🥈 33 ·⭐ 1.5K ·💀 ) - Python Geocoder.MIT
- Satpy (
🥉 30 ·⭐ 930) - Python package for earth-observing satellite data processing.❗️GPL-3.0
- Sentinelsat (
🥉 28 ·⭐ 900) - Search and download Copernicus Sentinel satellite images.❗️GPL-3.0
- prettymaps (🥉24 · ⭐ 9.8K) - A small set of Python functions to draw pretty maps from..
❗️AGPL-3.0
- Mapbox GL (🥉23 · ⭐ 640 · 💀) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook.
MIT
- gmaps (🥉21 · ⭐ 750 · 💀) - Google maps for Jupyter notebooks.
BSD-3
- geoplotlib (🥉20 · ⭐ 990 · 💀) - python toolbox for visualizing geographical data and making maps.
MIT
Financial Data
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
Alpha Vantage (🥈 29 · ⭐ 4K · 📈 ) - A python wrapper for Alpha Vantage API for financial data. MIT
stockstats (🥉 28 · ⭐ 1.1K) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
TensorTrade (🥉 27 · ⭐ 4.2K · 💤 ) - An open source reinforcement learning framework for.. Apache-2
PyAlgoTrade (🥉25 · ⭐ 4.1K) - Python Algorithmic Trading Library. Apache-2
Crypto Signals (🥉 23 · ⭐ 4.4K · 💤 ) - Github.com/CryptoSignal - Trading & Technical Analysis Bot -.. MIT
-
GitHub (
👨💻 28 ·🔀 1.1K ·📋 260 - 21% open ·⏱️ 09.08.2022):git clone https://github.com/CryptoSignal/crypto-signal
-
Docker Hub (
📥 140K ·⭐ 8 ·⏱️ 03.09.2020):docker pull shadowreaver/crypto-signal
finmarketpy (🥉 22 · ⭐ 3.1K) - Python library for backtesting trading strategies & analyzing.. Apache-2
tf-quant-finance (🥉 21 · ⭐ 3.9K) - High-performance TensorFlow library for quantitative.. Apache-2
Show 12 hidden projects...
- zipline (🥇33 · ⭐ 16K · 💀) - Zipline, a Pythonic Algorithmic Trading Library.
Apache-2
- arch (🥇32 · ⭐ 1.1K) - ARCH models in Python.
❗️NCSA
- pyfolio (
🥈 31 ·⭐ 5K ·💀 ) - Portfolio and risk analytics in Python.Apache-2
- backtrader (
🥈 29 ·⭐ 11K) - Python Backtesting library for trading strategies.❗️GPL-3.0
- Alphalens (🥉27 · ⭐ 2.7K · 💀) - Performance analysis of predictive (alpha) stock factors.
Apache-2
- empyrical (🥉27 · ⭐ 1.1K · 💀) - Common financial risk and performance metrics. Used by..
Apache-2
- Enigma Catalyst (
🥉 24 ·⭐ 2.4K ·💀 ) - An Algorithmic Trading Library for Crypto-Assets in..Apache-2
- FinTA (
🥉 23 ·⭐ 1.9K ·💤 ) - Common financial technical indicators implemented in Pandas.❗️LGPL-3.0
- Backtesting.py (
🥉 20 ·⭐ 3.8K) - Backtest trading strategies in Python.❗️AGPL-3.0
- FinQuant (🥉18 · ⭐ 980 · 💀) - A program for financial portfolio management, analysis and..
MIT
- surpriver (🥉12 · ⭐ 1.6K · 💀) - Find big moving stocks before they move using machine..
❗️GPL-3.0
- pyrtfolio (🥉8 · ⭐ 130 · 💀) - Python package to generate stock portfolios.
❗️GPL-3.0
Time Series Data
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (🥇33 · ⭐ 16K) - Tool for producing high quality forecasts for time series data that has.. MIT
NeuralProphet (🥇33 · ⭐ 3K) - NeuralProphet: A simple forecasting package. MIT
StatsForecast (🥇 33 · ⭐ 2.7K) - Lightning fast forecasting with statistical and econometric.. Apache-2
pmdarima (🥇 33 · ⭐ 1.4K) - A statistical library designed to fill the void in Pythons time series.. MIT
Darts (🥈31 · ⭐ 6K) - A python library for user-friendly forecasting and anomaly detection on.. Apache-2
-
GitHub (
👨💻 93 ·🔀 680 ·📦 310 ·📋 1.1K - 21% open ·⏱️ 29.06.2023):git clone https://github.com/unit8co/darts
-
PyPi (
📥 13K / month):pip install u8darts
-
Conda (📥 31K · ⏱️ 16.06.2023):
conda install -c conda-forge u8darts-all
-
Docker Hub (📥 380 · ⏱️ 12.04.2023):
docker pull unit8/darts
STUMPY (🥈 31 · ⭐ 2.7K) - STUMPY is a powerful and scalable Python library for modern time series.. BSD-3
pytorch-forecasting (🥈 29 · ⭐ 3K) - Time series forecasting with PyTorch. MIT
NeuralForecast (🥈 29 · ⭐ 1.5K) - Scalable and user friendly neural forecasting algorithms. Apache-2
uber/orbit (🥉 25 · ⭐ 1.7K) - A Python package for Bayesian forecasting with object-oriented.. Apache-2
Show 7 hidden projects...
- PyFlux (🥉23 · ⭐ 2K · 💀) - Open source time series library for Python.
BSD-3
- ADTK (
🥉 20 ·⭐ 950 ·💀 ) - A Python toolkit for rule-based/unsupervised anomaly detection in time..MPL-2.0
- matrixprofile-ts (🥉20 · ⭐ 710 · 💀) - A Python library for detecting patterns and anomalies..
Apache-2
- tsflex (🥉19 · ⭐ 300) - Flexible time series feature extraction & processing.
MIT
- tsaug (
🥉 14 ·⭐ 300 ·💀 ) - A Python package for time series augmentation.Apache-2
- atspy (
🥉 12 ·⭐ 480 ·💀 ) - AtsPy: Automated Time Series Models in Python (by @firmai).❗Unlicensed
- tslumen (
🥉 8 ·⭐ 48 ·💤 ) - A library for Time Series EDA (exploratory data analysis).Apache-2
Medical Data
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.
MNE (🥇38 · ⭐ 2.3K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
DIPY (🥈34 · ⭐ 600 · 📈) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic.. BSD-3
DeepVariant (🥉25 · ⭐ 2.8K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
Show 10 hidden projects...
- NiBabel (
🥈 31 ·⭐ 560) - Python package to access a cacophony of neuro-imaging file formats.❗Unlicensed
- NiftyNet (🥉25 · ⭐ 1.3K · 💀) - [unmaintained] An open-source convolutional neural..
Apache-2
- MedPy (
🥉 23 ·⭐ 480 ·💤 ) - Medical image processing in Python.❗️GPL-3.0
- Brainiak (
🥉 21 ·⭐ 300 ·💀 ) - Brain Imaging Analysis Kit.Apache-2
- Glow (
🥉 21 ·⭐ 230) - An open-source toolkit for large-scale genomic analysis.Apache-2
- DLTK (
🥉 20 ·⭐ 1.4K ·💀 ) - Deep Learning Toolkit for Medical Image Analysis.Apache-2
- Medical Detection Toolkit (
🥉 14 ·⭐ 1.2K ·💀 ) - The Medical Detection Toolkit contains 2D + 3D..Apache-2
- MedicalTorch (🥉14 · ⭐ 820 · 💀) - A medical imaging framework for Pytorch.
Apache-2
- MedicalNet (🥉13 · ⭐ 1.6K · 💀) - Many studies have shown that the performance on deep learning is..
MIT
- DeepNeuro (
🥉 11 ·⭐ 120 ·💀 ) - A deep learning python package for neuroimaging data. Made by:.MIT
Tabular Data
Libraries for processing tabular and structured data.
carefree-learn (🥈21 · ⭐ 400) - Deep Learning PyTorch. MIT
pytorch_tabular (🥉 20 · ⭐ 960) - A standard framework for modelling Deep Learning Models.. MIT
Show 3 hidden projects...
Optical Character Recognition
Libraries for optical character recognition (OCR) and text extraction from images or videos.
EasyOCR (🥇35 · ⭐ 19K) - Ready-to-use OCR with 80+ supported languages and all popular writing.. Apache-2
OCRmyPDF (🥈 31 · ⭐ 9.2K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
Tesseract (🥉27 · ⭐ 4.9K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2
pdftabextract (🥉21 · ⭐ 2.1K · 💤) - A set of tools for extracting tables from PDF files.. Apache-2
Mozart (🥉 10 · ⭐ 480 · 💤 ) - An optical music recognition (OMR) system. Converts sheet.. Apache-2
-
GitHub (👨💻 5 · 🔀 68 · 📋 14 - 14% open · ⏱️ 24.08.2022):
git clone https://github.com/aashrafh/Mozart
Show 2 hidden projects...
- attention-ocr (
🥉 21 ·⭐ 950 ·💀 ) - A Tensorflow model for text recognition (CNN + seq2seq..MIT
- doc2text (
🥉 18 ·⭐ 1.3K ·💀 ) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python..MIT
Data Containers & Structures
General-purpose data containers & structures as well as utilities & extensions for pandas.
🔗 best-of-python - Data Containers ( ⭐ 3K) - Collection of data-container, dataframe, and pandas-..
Data Loading & Extraction
Libraries for loading, collecting, and extracting data from a variety of data sources and formats.
🔗 best-of-python - Data Extraction ( ⭐ 3K) - Collection of data-loading and -extraction libraries.
Web Scraping & Crawling
Libraries for web scraping, crawling, downloading, and mining as well as libraries.
Data Pipelines & Streaming
Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
Distributed Machine Learning
Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.
Ray (🥇44 · ⭐ 26K) - Ray is a unified framework for scaling AI and Python applications. Ray.. Apache-2
dask.distributed (🥇 40 · ⭐ 1.5K) - A distributed task scheduler for Dask. BSD-3
DeepSpeed (🥇 39 · ⭐ 27K) - DeepSpeed is a deep learning optimization library that makes.. Apache-2
-
GitHub (
👨💻 210 ·🔀 3.2K ·📦 1.8K ·📋 1.9K - 46% open ·⏱️ 29.06.2023):git clone https://github.com/microsoft/DeepSpeed
-
PyPi (
📥 270K / month ·📦 51 ·⏱️ 17.01.2023):pip install deepspeed
-
Docker Hub (
📥 17K ·⭐ 3 ·⏱️ 02.09.2022):docker pull deepspeed/deepspeed
horovod (🥈38 · ⭐ 13K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
BigDL (🥈37 · ⭐ 4.2K) - Fast, distributed, secure AI for Big Data. Apache-2
-
GitHub (
👨💻 180 ·🔀 1.1K ·📦 44 ·📋 1.9K - 39% open ·⏱️ 29.06.2023):git clone https://github.com/intel-analytics/BigDL
-
PyPi (📥 12K / month · 📦 1 · ⏱️ 07.07.2022):
pip install bigdl
-
Maven (📦 4 · ⏱️ 20.04.2021):
<dependency> <groupId>com.intel.analytics.bigdl</groupId> <artifactId>bigdl-SPARK_2.4</artifactId> <version>[VERSION]</version> </dependency>
H2O-3 (🥈 36 · ⭐ 6.3K) - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning.. Apache-2
ColossalAI (🥈32 · ⭐ 31K) - Making large AI models cheaper, faster and more accessible. Apache-2
-
GitHub (
👨💻 140 ·🔀 3.5K ·📦 240 ·📋 1.2K - 26% open ·⏱️ 29.06.2023):git clone https://github.com/hpcaitech/colossalai
petastorm (🥉27 · ⭐ 1.6K) - Petastorm library enables single machine or distributed training.. Apache-2
TensorFlowOnSpark (🥉 26 · ⭐ 3.8K) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2
analytics-zoo (🥉24 · ⭐ 2.6K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
Apache Singa (🥉21 · ⭐ 2.9K) - a distributed deep learning platform. Apache-2
-
GitHub (
👨💻 81 ·🔀 1K ·📦 2 ·📋 120 - 44% open ·⏱️ 02.04.2023):git clone https://github.com/apache/singa
-
Conda (📥 640 · ⏱️ 16.06.2023):
conda install -c nusdbsystem singa
-
Docker Hub (
📥 2.9K ·⭐ 4 ·⏱️ 31.05.2022):docker pull apache/singa
parallelformers (🥉 18 · ⭐ 680 · 💤 ) - Parallelformers: An Efficient Model Parallelization.. Apache-2
mesh-transformer-jax (🥉 16 · ⭐ 6K) - Model parallel transformers in JAX and Haiku. Apache-2
-
GitHub (👨💻 23 · 🔀 850 · 📋 200 - 20% open · ⏱️ 12.01.2023):
git clone https://github.com/kingoflolz/mesh-transformer-jax
moolib (🥉10 · ⭐ 360) - A library for distributed ML training with PyTorch. MIT
-
GitHub (👨💻 6 · 🔀 20 · 📦 3 · 📋 19 - 36% open · ⏱️ 12.12.2022):
git clone https://github.com/facebookresearch/moolib
Show 11 hidden projects...
- DEAP (
🥈 32 ·⭐ 5.2K) - Distributed Evolutionary Algorithms in Python.❗️LGPL-3.0
- ipyparallel (
🥈 31 ·⭐ 2.4K) - IPython Parallel: Interactive Parallel Computing in..❗Unlicensed
- somoclu (
🥉 22 ·⭐ 250 ·💤 ) - Massively parallel self-organizing maps: accelerate training on..MIT
- launchpad (🥉21 · ⭐ 300) - Launchpad is a library that simplifies writing distributed..
Apache-2
- BytePS (🥉20 · ⭐ 3.4K · 💀) - A high performance and generic framework for distributed DNN..
Apache-2
- TensorFrames (🥉20 · ⭐ 750 · 💀) - [DEPRECATED] Tensorflow wrapper for DataFrames on..
Apache-2
- sk-dist (🥉20 · ⭐ 280) - Distributed scikit-learn meta-estimators in PySpark.
Apache-2
- Fiber (🥉17 · ⭐ 1K · 💀) - Distributed Computing for AI Made Simple.
Apache-2
- bluefog (🥉17 · ⭐ 290) - Distributed and decentralized training framework for PyTorch..
Apache-2
- LazyCluster (🥉13 · ⭐ 44 · 💀) - Distributed machine learning made simple.
Apache-2
- autodist (🥉11 · ⭐ 130 · 💀) - Simple Distributed Deep Learning on TensorFlow.
Apache-2
Hyperparameter Optimization & AutoML
Libraries for hyperparameter optimization, automl and neural architecture search.
Bayesian Optimization (🥇33 · ⭐ 6.9K) - A Python implementation of global optimization with.. MIT
featuretools (🥇 33 · ⭐ 6.7K) - An open source python library for automated feature engineering. BSD-3
Keras Tuner (🥈32 · ⭐ 2.7K) - A Hyperparameter Tuning Library for Keras. Apache-2
auto-sklearn (🥈 31 · ⭐ 7K) - Automated Machine Learning with scikit-learn. BSD-3
mljar-supervised (🥈 30 · ⭐ 2.7K) - Python package for AutoML on Tabular Data with Feature.. MIT
AutoGluon (🥈29 · ⭐ 5.9K) - AutoGluon: AutoML for Image, Text, Time Series, and.. Apache-2
-
GitHub (
👨💻 100 ·🔀 730 ·📦 2 ·📋 1K - 19% open ·⏱️ 29.06.2023):git clone https://github.com/autogluon/autogluon
-
PyPi (📥 66K / month):
pip install autogluon
-
Docker Hub (📥 3.4K · ⭐ 11 · ⏱️ 17.03.2023):
docker pull autogluon/autogluon
lazypredict (🥈 26 · ⭐ 2K · 💤 ) - Lazy Predict help build a lot of basic models without much.. MIT
Hyperactive (🥈23 · ⭐ 440) - An optimization and data collection toolbox for convenient and fast.. MIT
featurewiz (🥈 23 · ⭐ 440) - Use advanced feature engineering strategies and select best.. Apache-2
Neuraxle (🥉22 · ⭐ 570 · 💤) - The worlds cleanest AutoML library - Do hyperparameter tuning.. Apache-2
Auto ViML (🥉22 · ⭐ 440) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
igel (🥉20 · ⭐ 3.1K) - a delightful machine learning tool that allows you to train, test, and use.. MIT
AlphaPy (🥉 19 · ⭐ 940) - Automated Machine Learning [AutoML] with Python, scikit-learn, Keras,.. Apache-2
opytimizer (🥉 17 · ⭐ 570) - Opytimizer is a Python library consisting of meta-heuristic.. Apache-2
shap-hypetune (🥉17 · ⭐ 470 · 💤) - A python package for simultaneous Hyperparameters Tuning and.. MIT
Show 26 hidden projects...
- Optuna (🥇39 · ⭐ 8.3K) - A hyperparameter optimization framework.
❗Unlicensed
- TPOT (
🥇 34 ·⭐ 9.1K) - A Python Automated Machine Learning tool that optimizes machine..❗️LGPL-3.0
- scikit-optimize (🥇33 · ⭐ 2.6K · 💀) - Sequential model-based optimization with a..
BSD-3
- SMAC3 (
🥈 24 ·⭐ 870) - SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter..❗Unlicensed
- Orion (🥈24 · ⭐ 260) - Asynchronous Distributed Hyperparameter Optimization.
❗Unlicensed
- AdaNet (
🥈 23 ·⭐ 3.5K ·💀 ) - Fast and flexible AutoML with learning guarantees.Apache-2
- HpBandSter (
🥈 23 ·⭐ 580 ·💀 ) - a distributed Hyperband implementation on Steroids.BSD-3
- lightwood (🥈23 · ⭐ 370) - Lightwood is Legos for Machine Learning.
❗️GPL-3.0
- auto_ml (
🥉 21 ·⭐ 1.6K ·💀 ) - [UNMAINTAINED] Automated machine learning for analytics & production.MIT
- optunity (
🥉 21 ·⭐ 410 ·💀 ) - optimization routines for hyperparameter tuning.BSD-3
- MLBox (🥉20 · ⭐ 1.4K · 💀) - MLBox is a powerful Automated Machine Learning python library.
❗Unlicensed
- Test Tube (
🥉 19 ·⭐ 740 ·💀 ) - Python library to easily log experiments and parallelize..MIT
- sklearn-deap (🥉19 · ⭐ 730 · 💀) - Use evolutionary algorithms instead of gridsearch in..
MIT
- Auto Tune Models (
🥉 18 ·⭐ 520 ·💀 ) - Auto Tune Models - A multi-tenant, multi-data system for..MIT
- Sherpa (
🥉 18 ·⭐ 320 ·💀 ) - Hyperparameter optimization that enables researchers to..❗️GPL-3.0
- Advisor (
🥉 17 ·⭐ 1.5K ·💀 ) - Open-source implementation of Google Vizier for hyper parameters..Apache-2
- automl-gs (
🥉 16 ·⭐ 1.8K ·💀 ) - Provide an input CSV and a target field to predict, generate a..MIT
- Xcessiv (🥉16 · ⭐ 1.3K · 💀) - A web-based application for quick, scalable, and automated..
Apache-2
- HyperparameterHunter (🥉16 · ⭐ 700 · 💀) - Easy hyperparameter optimization and automatic result..
MIT
- Parfit (
🥉 15 ·⭐ 200 ·💀 ) - A package for parallelizing the fit and flexibly scoring of..MIT
- ENAS (
🥉 13 ·⭐ 2.6K ·💀 ) - PyTorch implementation of Efficient Neural Architecture Search via..Apache-2
- Devol (
🥉 12 ·⭐ 950 ·💀 ) - Genetic neural architecture search with Keras.MIT
- Auptimizer (🥉12 · ⭐ 200 · 💀) - An automatic ML model optimization tool.
❗️GPL-3.0
- Hypermax (🥉12 · ⭐ 110 · 💀) - Better, faster hyper-parameter optimization.
BSD-3
- model_search (
🥉 11 ·⭐ 3.2K ·💀 ) - AutoML algorithms for model architecture search at scale.Apache-2
- Hypertunity (🥉9 · ⭐ 140 · 💀) - A toolset for black-box hyperparameter optimisation.
Apache-2
Reinforcement Learning
Libraries for building and evaluating reinforcement learning & agent-based systems.
OpenAI Gym (🥇 39 · ⭐ 32K) - A toolkit for developing and comparing reinforcement learning.. MIT
TensorLayer (🥈 27 · ⭐ 7.2K) - Deep Learning and Reinforcement Learning Library for.. Apache-2
TensorForce (🥈26 · ⭐ 3.2K · 📈) - Tensorforce: a TensorFlow library for applied.. Apache-2
Stable Baselines (🥉24 · ⭐ 3.8K · 💤) - A fork of OpenAI Baselines, implementations of.. MIT
rliable (🥉 13 · ⭐ 590) - [NeurIPS21 Outstanding Paper] Library for reliable evaluation on RL.. Apache-2
Show 7 hidden projects...
- baselines (
🥈 28 ·⭐ 15K ·💀 ) - OpenAI Baselines: high-quality implementations of reinforcement..MIT
- keras-rl (🥈28 · ⭐ 5.4K · 💀) - Deep Reinforcement Learning for Keras.
MIT
- ChainerRL (
🥉 24 ·⭐ 1.1K ·💀 ) - ChainerRL is a deep reinforcement learning library built on top of..MIT
- TRFL (🥉22 · ⭐ 3.1K · 💀) - TensorFlow Reinforcement Learning.
Apache-2
- SerpentAI (
🥉 18 ·⭐ 6.5K ·💀 ) - Game Agent Framework. Helping you create AIs / Bots that learn to..MIT
- DeepMind Lab (🥉17 · ⭐ 6.9K) - A customisable 3D platform for agent-based AI research.
❗Unlicensed
- Maze (
🥉 11 ·⭐ 240 ·💤 ) - Maze Applied Reinforcement Learning Framework.❗️Custom
Recommender Systems
Libraries for building and evaluating recommendation systems.
Recommenders (🥇 34 · ⭐ 16K) - Best Practices on Recommendation Systems. MIT
lightfm (🥈 30 · ⭐ 4.4K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
TF Ranking (🥈 29 · ⭐ 2.6K) - Learning to Rank in TensorFlow. Apache-2
TF Recommenders (🥈29 · ⭐ 1.6K) - TensorFlow Recommenders is a library for building.. Apache-2
scikit-surprise (🥈 28 · ⭐ 5.9K) - A Python scikit for building and analyzing recommender.. BSD-3
recmetrics (🥉19 · ⭐ 520) - A library of metrics for evaluating recommender systems. MIT
Show 7 hidden projects...
- tensorrec (
🥉 22 ·⭐ 1.2K ·💀 ) - A TensorFlow recommendation algorithm and framework in..Apache-2
- lkpy (🥉22 · ⭐ 240) - Python recommendation toolkit.
MIT
- fastFM (
🥉 21 ·⭐ 1K ·💀 ) - fastFM: A Library for Factorization Machines.BSD-3
- Spotlight (🥉19 · ⭐ 2.9K · 💀) - Deep recommender models using PyTorch.
MIT
- Case Recommender (
🥉 18 ·⭐ 440 ·💀 ) - Case Recommender: A Flexible and Extensible Python..MIT
- Collie (🥉18 · ⭐ 100) - A library for preparing, training, and evaluating scalable deep..
BSD-3
- OpenRec (
🥉 15 ·⭐ 400 ·💀 ) - OpenRec is an open-source and modular library for neural network-..Apache-2
Privacy Machine Learning
Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.
TensorFlow Privacy (🥈 27 · ⭐ 1.8K) - Library for training machine learning models with.. Apache-2
TFEncrypted (🥉26 · ⭐ 1.1K) - A Framework for Encrypted Machine Learning in TensorFlow. Apache-2
Show 1 hidden projects...
- PipelineDP (🥉20 · ⭐ 260) - PipelineDP is a Python framework for applying differentially..
Apache-2
Workflow & Experiment Tracking
Libraries to organize, track, and visualize machine learning experiments.
Tensorboard (🥇 43 · ⭐ 6.3K) - TensorFlows Visualization Toolkit. Apache-2
DVC (🥇41 · ⭐ 12K) - Data Version Control | Git for Data & Models | ML Experiments Management. Apache-2
wandb client (🥇40 · ⭐ 6.3K) - A tool for visualizing and tracking your machine learning.. MIT
SageMaker SDK (🥈 38 · ⭐ 1.9K) - A library for training and deploying machine learning.. Apache-2
tensorboardX (🥈35 · ⭐ 7.6K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
AzureML SDK (🥈 34 · ⭐ 3.7K) - Python notebooks with ML and deep learning examples with Azure.. MIT
ClearML (🥈 33 · ⭐ 4.5K) - ClearML - Auto-Magical CI/CD to streamline your ML workflow... Apache-2
-
GitHub (👨💻 81 · 🔀 590 · 📥 1.3K · 📦 620 · 📋 830 - 42% open · ⏱️ 25.06.2023):
git clone https://github.com/allegroai/clearml
-
PyPi (
📥 220K / month ·📦 12 ·⏱️ 04.07.2022):pip install clearml
-
Docker Hub (
📥 30K ·⏱️ 05.10.2020):docker pull allegroai/trains
ml-metadata (🥉 28 · ⭐ 540) - For recording and retrieving metadata associated with ML.. Apache-2
Neptune.ai (🥉28 · ⭐ 390) - Experiment tracking tool and model registry. Apache-2
Labml (🥉24 · ⭐ 1.5K) - Monitor deep learning model training and hardware usage from your mobile.. MIT
TensorWatch (🥉 20 · ⭐ 3.3K) - Debugging, monitoring and visualization for Python Machine Learning.. MIT
Show 14 hidden projects...
- Catalyst (
🥈 29 ·⭐ 3.1K ·💀 ) - Accelerated deep learning R&D.Apache-2
- SKLL (🥉26 · ⭐ 540) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine..
❗️BSD-1-Clause
- knockknock (
🥉 25 ·⭐ 2.6K ·💀 ) - Knock Knock: Get notified when your training ends with only two..MIT
- livelossplot (
🥉 25 ·⭐ 1.3K ·💀 ) - Live training loss plot in Jupyter Notebook for Keras,..MIT
- gokart (🥉24 · ⭐ 280) - Gokart solves reproducibility, task dependencies, constraints of good code,..
MIT
- hiddenlayer (
🥉 23 ·⭐ 1.7K ·💀 ) - Neural network graphs and training metrics for..MIT
- TensorBoard Logger (🥉20 · ⭐ 620 · 💀) - Log TensorBoard events without touching TensorFlow.
MIT
- MXBoard (🥉20 · ⭐ 330 · 💀) - Logging MXNet data for visualization in TensorBoard.
Apache-2
- keepsake (🥉18 · ⭐ 1.6K · 💀) - Version control for machine learning.
Apache-2
- datmo (
🥉 17 ·⭐ 340 ·💀 ) - Open source production model management tool for data scientists.MIT
- chitra (🥉16 · ⭐ 220) - A multi-functional library for full-stack Deep Learning. Simplifies..
Apache-2
- steppy (🥉16 · ⭐ 140 · 💀) - Lightweight, Python library for fast and reproducible experimentation.
MIT
- ModelChimp (🥉13 · ⭐ 120 · 💀) - Experiment tracking for machine and deep learning projects.
BSD-2
- traintool (🥉9 · ⭐ 12 · 💀) - Train off-the-shelf machine learning models in one..
Apache-2
Model Serialization & Deployment
Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.
Core ML Tools (🥈 33 · ⭐ 3.5K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
TorchServe (🥈 32 · ⭐ 3.5K) - Serve, optimize and scale PyTorch models in production. Apache-2
-
GitHub (
👨💻 160 ·🔀 750 ·📥 4.3K ·📋 1.3K - 21% open ·⏱️ 28.06.2023):git clone https://github.com/pytorch/serve
-
PyPi (📥 34K / month · 📦 10 · ⏱️ 13.05.2022):
pip install torchserve
-
Conda (📥 92K · ⏱️ 14.06.2023):
conda install -c pytorch torchserve
-
Docker Hub (📥 1.1M · ⭐ 17 · ⏱️ 14.06.2023):
docker pull pytorch/torchserve
huggingface_hub (🥈 29 · ⭐ 980) - All the open source things related to the Hugging Face Hub. Apache-2
Hummingbird (🥈 28 · ⭐ 3.2K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
nebullvm (🥈25 · ⭐ 8.2K) - The next-generation platform to monitor and optimize your AI costs.. Apache-2
mmdnn (🥈25 · ⭐ 5.7K · 💤) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
Show 8 hidden projects...
- sklearn-porter (🥉23 · ⭐ 1.2K · 💀) - Transpile trained scikit-learn estimators to C, Java,..
BSD-3
- pytorch2keras (🥉19 · ⭐ 840 · 💀) - PyTorch to Keras model convertor.
MIT
- Larq Compute Engine (
🥉 19 ·⭐ 230) - Highly optimized inference engine for Binarized..Apache-2
- OMLT (🥉18 · ⭐ 200) - Represent trained machine learning models as Pyomo optimization formulations.
BSD-3
- tfdeploy (
🥉 16 ·⭐ 350 ·💀 ) - Deploy tensorflow graphs for fast evaluation and export to..BSD-3
- modelkit (🥉15 · ⭐ 140) - Toolkit for developing and maintaining ML models.
MIT
- ml-ane-transformers (🥉14 · ⭐ 2.3K · 💤) - Reference implementation of the Transformer..
❗Unlicensed
- backprop (🥉9 · ⭐ 240 · 💀) - Backprop makes it simple to use, finetune, and deploy state-..
❗Unlicensed
Model Interpretability
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.
shap (🥇40 · ⭐ 20K) - A game theoretic approach to explain the output of any machine learning model. MIT
InterpretML (🥇34 · ⭐ 5.6K) - Fit interpretable models. Explain blackbox machine learning. MIT
evaluate (🥈30 · ⭐ 1.4K) - Evaluate: A library for easily evaluating machine learning models.. Apache-2
DoWhy (🥈 29 · ⭐ 6K) - DoWhy is a Python library for causal inference that supports explicit modeling.. MIT
dtreeviz (🥈29 · ⭐ 2.6K) - A python library for decision tree visualization and model interpretation. MIT
Model Analysis (🥈 29 · ⭐ 1.2K) - Model analysis tools for TensorFlow. Apache-2
responsible-ai-widgets (🥈 29 · ⭐ 880) - Responsible AI Toolbox is a suite of tools providing.. MIT
explainerdashboard (🥈 28 · ⭐ 1.8K) - Quickly build Explainable AI dashboards that show the inner.. MIT
Explainability 360 (🥈26 · ⭐ 1.3K) - Interpretability and explainability of data and.. Apache-2
iNNvestigate (🥈26 · ⭐ 1.1K) - A toolbox to iNNvestigate neural networks predictions!. BSD-2
Fairness 360 (🥉25 · ⭐ 2.1K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
yellowbrick (🥉 24 · ⭐ 4K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2
LIT (🥉 23 · ⭐ 3.2K) - The Learning Interpretability Tool: Interactively analyze ML models to.. Apache-2
tf-explain (🥉21 · ⭐ 980 · 💤) - Interpretability Methods for tf.keras models with Tensorflow.. MIT
What-If Tool (🥉21 · ⭐ 820) - Source code/webpage/demos for the What-If Tool. Apache-2
model-card-toolkit (🥉21 · ⭐ 380) - A toolkit that streamlines and automates the generation.. Apache-2
fairness-indicators (🥉 19 · ⭐ 310) - Tensorflows Fairness Evaluation and Visualization.. Apache-2
sklearn-evaluation (🥉 18 · ⭐ 360 · 🐣 ) - Machine learning model evaluation made easy: plots,.. MIT
ExplainX.ai (🥉16 · ⭐ 350 · 💤) - Explainable AI framework for data scientists. Explain & debug any.. MIT
Show 20 hidden projects...
- Lime (
🥇 33 ·⭐ 11K ·💀 ) - Lime: Explaining the predictions of any machine learning classifier.BSD-2
- Deep Checks (🥈31 · ⭐ 2.9K) - Deepchecks: Tests for Continuous Validation of ML Models &..
❗️AGPL-3.0
- scikit-plot (
🥈 28 ·⭐ 2.3K ·💀 ) - An intuitive library to add plotting functionality to..MIT
- DALEX (
🥈 27 ·⭐ 1.2K) - moDel Agnostic Language for Exploration and eXplanation.❗️GPL-3.0
- Lucid (
🥉 25 ·⭐ 4.6K ·💀 ) - A collection of infrastructure and tools for research in..Apache-2
- keras-vis (🥉24 · ⭐ 3K · 💀) - Neural network visualization toolkit for keras.
MIT
- eli5 (🥉23 · ⭐ 2.7K · 💀) - A library for debugging/inspecting machine learning classifiers and..
MIT
- random-forest-importances (
🥉 22 ·⭐ 560 ·💀 ) - Code to compute permutation and drop-column..MIT
- Quantus (🥉22 · ⭐ 390) - Quantus is an eXplainable AI toolkit for responsible evaluation of..
❗️GPL-3.0
- ecco (
🥉 21 ·⭐ 1.7K ·💀 ) - Explain, analyze, and visualize NLP language models. Ecco creates..BSD-3
- Skater (🥉21 · ⭐ 1.1K) - Python Library for Model Interpretation/Explanations.
❗️UPL-1.0
- deeplift (🥉21 · ⭐ 740 · 💀) - Public facing deeplift repo.
MIT
- TreeInterpreter (
🥉 21 ·⭐ 730 ·💀 ) - Package for interpreting scikit-learns decision tree..BSD-3
- tcav (🥉19 · ⭐ 580 · 💀) - Code for the TCAV ML interpretability project.
Apache-2
- FlashTorch (🥉17 · ⭐ 710 · 💀) - Visualization toolkit for neural networks in PyTorch! Demo --.
MIT
- XAI (
🥉 16 ·⭐ 920 ·💀 ) - XAI - An eXplainability toolbox for machine learning.MIT
- interpret-text (🥉13 · ⭐ 370 · 💀) - A library that incorporates state-of-the-art explainers..
MIT
- contextual-ai (🥉12 · ⭐ 83 · 💀) - Contextual AI adds explainability to different stages of..
Apache-2
- Attribution Priors (🥉11 · ⭐ 110 · 💀) - Tools for training explainable models using..
MIT
- bias-detector (
🥉 10 ·⭐ 43 ·💤 ) -MIT
Vector Similarity Search (ANN)
Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.
🔗 ANN Benchmarks ( ⭐ 3.8K) - Benchmarks of approximate nearest neighbor libraries in Python.
Milvus (🥇38 · ⭐ 21K) - A cloud-native vector database, storage for next generation AI.. Apache-2
-
GitHub (
👨💻 250 ·🔀 2.3K ·📥 30K ·📋 8.2K - 7% open ·⏱️ 29.06.2023):git clone https://github.com/milvus-io/milvus
-
PyPi (
📥 350K / month ·📦 27 ·⏱️ 02.04.2022):pip install pymilvus
-
Docker Hub (
📥 3.9M ·⭐ 30 ·⏱️ 29.06.2023):docker pull milvusdb/milvus
Faiss (🥇 37 · ⭐ 23K) - A library for efficient similarity search and clustering of dense vectors. MIT
Annoy (🥈 36 · ⭐ 12K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
hnswlib (🥈 31 · ⭐ 3.2K) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
PyNNDescent (🥉28 · ⭐ 770) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
USearch (🥉 21 · ⭐ 330 · 🐣 ) - Smaller & Faster Vector Search Engine for C++, Python,.. Apache-2
-
GitHub (👨💻 12 · 🔀 16 · 📥 14 · 📦 6 · 📋 18 - 38% open · ⏱️ 24.06.2023):
git clone https://github.com/unum-cloud/usearch
-
PyPi (
📥 7.8K / month ·⏱️ 24.06.2023):pip install usearch
-
npm (📥 2.3K / month · ⏱️ 24.06.2023):
npm install usearch
-
Docker Hub (
📥 2 ·⭐ 1 ·⏱️ 24.06.2023):docker pull unum/usearch
NearPy (🥉20 · ⭐ 740) - Python framework for fast (approximated) nearest neighbour search in large,.. MIT
N2 (🥉 19 · ⭐ 550) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs fast.. Apache-2
Show 3 hidden projects...
- NMSLIB (
🥈 31 ·⭐ 3.1K ·💀 ) - Non-Metric Space Library (NMSLIB): An efficient similarity search..Apache-2
- Magnitude (
🥉 23 ·⭐ 1.6K ·💀 ) - A fast, efficient universal vector embedding utility package.MIT
- PySparNN (
🥉 11 ·⭐ 910 ·💀 ) - Approximate Nearest Neighbor Search for Sparse Data in Python!.BSD-3
Probabilistics & Statistics
Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.
tensorflow-probability (🥇37 · ⭐ 3.9K) - Probabilistic reasoning and statistical analysis in.. Apache-2
pomegranate (🥈31 · ⭐ 3.1K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
filterpy (🥈31 · ⭐ 2.8K · 💤) - Python Kalman filtering and optimal estimation library. Implements.. MIT
SALib (🥈 31 · ⭐ 720) - Sensitivity Analysis Library in Python. Contains Sobol, Morris, FAST, and.. MIT
Orbit (🥉24 · ⭐ 1.7K) - A Python package for Bayesian forecasting with object-oriented design.. Apache-2
Baal (🥉21 · ⭐ 760) - Library to enable Bayesian active learning in your research or labeling.. Apache-2
Show 7 hidden projects...
- pingouin (
🥉 29 ·⭐ 1.4K) - Statistical package in Python based on Pandas.❗️GPL-3.0
- Edward (🥉27 · ⭐ 4.8K · 💀) - A probabilistic programming language in TensorFlow. Deep..
Apache-2
- PyStan (
🥉 26 ·⭐ 270) - PyStan, a Python interface to Stan, a platform for statistical modeling...ISC
- scikit-posthocs (🥉22 · ⭐ 290) - Multiple Pairwise Comparisons (Post Hoc) Tests in Python.
MIT
- pyhsmm (🥉20 · ⭐ 530 · 💀) - Bayesian inference in HSMMs and HMMs.
MIT
- Funsor (🥉20 · ⭐ 220) - Functional tensors for probabilistic programming.
Apache-2
- ZhuSuan (
🥉 15 ·⭐ 2.2K ·💀 ) - A probabilistic programming library for Bayesian deep learning,..MIT
Adversarial Robustness
Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.
ART (🥇 33 · ⭐ 3.8K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
-
GitHub (
👨💻 130 ·🔀 1K ·📦 390 ·📋 830 - 15% open ·⏱️ 20.06.2023):git clone https://github.com/Trusted-AI/adversarial-robustness-toolbox
-
PyPi (
📥 20K / month ·📦 8 ·⏱️ 21.04.2023):pip install adversarial-robustness-toolbox
-
Conda (📥 28K · ⏱️ 16.06.2023):
conda install -c conda-forge adversarial-robustness-toolbox
CleverHans (🥈30 · ⭐ 5.8K) - An adversarial example library for constructing attacks,.. MIT
TextAttack (🥈28 · ⭐ 2.4K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
AdvBox (🥉18 · ⭐ 1.3K · 💤) - Advbox is a toolbox to generate adversarial examples that fool.. Apache-2
Show 5 hidden projects...
- Foolbox (🥈28 · ⭐ 2.5K · 💀) - A Python toolbox to create adversarial examples that fool neural..
MIT
- advertorch (🥉22 · ⭐ 1.2K · 💀) - A Toolbox for Adversarial Robustness Research.
❗️GPL-3.0
- robustness (
🥉 19 ·⭐ 800 ·💀 ) - A library for experimenting with, training and evaluating neural..MIT
- textflint (🥉17 · ⭐ 610 · 💤) - Unified Multilingual Robustness Evaluation Toolkit for..
❗️GPL-3.0
- Adversary (
🥉 15 ·⭐ 380 ·💀 ) - Tool to generate adversarial text examples and test machine..MIT
GPU & Accelerator Utilities
Libraries that require and make use of CUDA/GPU or other accelerator hardware capabilities to optimize machine learning tasks.
optimum (🥇 32 · ⭐ 1.3K) - Accelerate training and inference of Transformers and Diffusers with.. Apache-2
CuPy (🥉 25 · ⭐ 7K · 📉 ) - NumPy & SciPy for GPU. MIT
-
GitHub (👨💻 260 · 🔀 690):
git clone https://github.com/cupy/cupy
-
PyPi (
📥 24K / month ·📦 180 ·⏱️ 19.01.2023):pip install cupy
-
Conda (📥 2.9M · ⏱️ 28.06.2023):
conda install -c conda-forge cupy
-
Docker Hub (
📥 57K ·⭐ 11 ·⏱️ 13.06.2023):docker pull cupy/cupy
DALI (🥉 25 · ⭐ 4.5K) - A GPU-accelerated library containing highly optimized building blocks.. Apache-2
-
GitHub (👨💻 84 · 🔀 580 · 📋 1.5K - 18% open · ⏱️ 29.06.2023):
git clone https://github.com/NVIDIA/DALI
Vulkan Kompute (🥉20 · ⭐ 1.2K) - General purpose GPU compute framework built on Vulkan to.. Apache-2
cuSignal (🥉 19 · ⭐ 680) - GPU accelerated signal processing. Apache-2
-
GitHub (👨💻 46 · 🔀 120 · 📋 150 - 15% open · ⏱️ 07.06.2023):
git clone https://github.com/rapidsai/cusignal
Show 7 hidden projects...
- scikit-cuda (🥉24 · ⭐ 940 · 💀) - Python interface to GPU-powered libraries.
BSD-3
- GPUtil (
🥉 23 ·⭐ 980 ·💀 ) - A Python module for getting the GPU status from NVIDA GPUs using..MIT
- BlazingSQL (
🥉 22 ·⭐ 1.8K ·💀 ) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for..Apache-2
- py3nvml (
🥉 22 ·⭐ 220 ·💀 ) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside..BSD-3
- nvidia-ml-py3 (
🥉 19 ·⭐ 110 ·💀 ) - Python 3 Bindings for the NVIDIA Management Library.BSD-3
- ipyexperiments (🥉16 · ⭐ 160 · 💀) - jupyter/ipython experiment containers for GPU and..
Apache-2
- SpeedTorch (🥉15 · ⭐ 660 · 💀) - Library for faster pinned CPU - GPU transfer in Pytorch.
MIT
Tensorflow Utilities
Libraries that extend TensorFlow with additional capabilities.
TensorFlow Datasets (🥇35 · ⭐ 3.9K) - TFDS is a collection of datasets ready to use with.. Apache-2
tensorflow-hub (🥈33 · ⭐ 3.4K) - A library for transfer learning by reusing parts of.. Apache-2
tensor2tensor (🥈 32 · ⭐ 14K) - Library of deep learning models and datasets designed to.. Apache-2
TF Model Optimization (🥈 32 · ⭐ 1.4K) - A toolkit to optimize ML models for deployment for.. Apache-2
TensorFlow Transform (🥈 32 · ⭐ 960) - Input pipeline framework. Apache-2
TensorFlow I/O (🥉 29 · ⭐ 650) - Dataset, streaming, and file system extensions.. Apache-2
Neural Structured Learning (🥉26 · ⭐ 970) - Training neural models with structured signals. Apache-2
TensorFlow Cloud (🥉26 · ⭐ 350) - The TensorFlow Cloud repository provides APIs that.. Apache-2
TF Compression (🥉22 · ⭐ 760) - Data compression in TensorFlow. Apache-2
Show 5 hidden projects...
- Keras-Preprocessing (
🥈 30 ·⭐ 1K ·💀 ) - Utilities for working with image data, text data, and..MIT
- efficientnet (
🥉 27 ·⭐ 2K ·💀 ) - Implementation of EfficientNet model. Keras and..Apache-2
- Saliency (
🥉 22 ·⭐ 890 ·💀 ) - Framework-agnostic implementation for state-of-the-art..Apache-2
- TensorNets (🥉20 · ⭐ 1K · 💀) - High level network definitions with pre-trained weights in..
MIT
- tffm (🥉17 · ⭐ 780 · 💀) - TensorFlow implementation of an arbitrary order Factorization Machine.
MIT
Jax Utilities
Libraries that extend Jax with additional capabilities.
Show 1 hidden projects...
- jaxdf (
🥉 14 ·⭐ 71) - A JAX-based research framework for writing differentiable..❗️LGPL-3.0
Sklearn Utilities
Libraries that extend scikit-learn with additional capabilities.
imbalanced-learn (🥇34 · ⭐ 6.4K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
scikit-learn-intelex (🥇34 · ⭐ 980) - Intel(R) Extension for Scikit-learn is a seamless way.. Apache-2
category_encoders (🥈 32 · ⭐ 2.2K) - A library of sklearn compatible categorical variable.. BSD-3
scikit-multilearn (🥈 27 · ⭐ 850) - A scikit-learn based module for multi-label et. al... BSD-2
scikit-opt (🥈 24 · ⭐ 4.2K) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT
scikit-lego (🥈24 · ⭐ 1K) - Extra blocks for scikit-learn pipelines. MIT
iterative-stratification (🥉 22 · ⭐ 780 · 💤 ) - scikit-learn cross validators for iterative.. BSD-3
combo (🥉 20 · ⭐ 620) - (AAAI 20) A Python Toolbox for Machine Learning Model Combination. BSD-2
xgboost
Show 10 hidden projects...
- MLxtend (🥈33 · ⭐ 4.4K) - A library of extension and helper modules for Pythons data..
❗Unlicensed
- scikit-survival (
🥈 29 ·⭐ 950) - Survival analysis built on top of scikit-learn.❗️GPL-3.0
- fancyimpute (
🥈 26 ·⭐ 1.2K ·💀 ) - Multivariate imputation and matrix completion..Apache-2
- sklearn-contrib-lightning (🥉23 · ⭐ 1.7K · 💀) - Large-scale linear classification, regression and..
BSD-3
- sklearn-crfsuite (
🥉 23 ·⭐ 420 ·💀 ) - scikit-learn inspired API for CRFsuite.❗Unlicensed
- skope-rules (
🥉 19 ·⭐ 540) - machine learning with logical rules in Python.❗Unlicensed
- celer (🥉19 · ⭐ 170) - Fast solver for L1-type problems: Lasso, sparse Logisitic regression,..
BSD-3
- skggm (🥉17 · ⭐ 220) - Scikit-learn compatible estimation of general graphical models.
MIT
- scikit-tda (
🥉 16 ·⭐ 400 ·💀 ) - Topological Data Analysis for Python.❗Unlicensed
- dabl (🥉16 · ⭐ 120) - Data Analysis Baseline Library.
BSD-3
Pytorch Utilities
Libraries that extend Pytorch with additional capabilities.
accelerate (🥇 37 · ⭐ 5.1K) - A simple way to train and use PyTorch models with multi-.. Apache-2
PML (🥇33 · ⭐ 5.3K) - The easiest way to use deep metric learning in your application. Modular,.. MIT
tinygrad (🥇30 · ⭐ 15K) - You like pytorch? You like micrograd? You love tinygrad!. MIT
-
GitHub (
👨💻 150 ·🔀 1.5K ·📦 20 ·📋 240 - 11% open ·⏱️ 28.06.2023):git clone https://github.com/geohot/tinygrad
lightning-flash (🥇30 · ⭐ 1.7K) - Your PyTorch AI Factory - Flash enables you to easily.. Apache-2
torchdiffeq (🥈29 · ⭐ 4.7K) - Differentiable ODE solvers with full GPU support and.. MIT
pytorch-optimizer (🥈 29 · ⭐ 2.8K) - torch-optimizer -- collection of optimizers for.. Apache-2
torch-scatter (🥈 27 · ⭐ 1.3K) - PyTorch Extension Library of Optimized Scatter Operations. MIT
Pytorch Toolbelt (🥈26 · ⭐ 1.4K) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
PyTorch Sparse (🥈25 · ⭐ 840) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT
reformer-pytorch (🥉 21 · ⭐ 1.9K) - Reformer, the efficient Transformer, in Pytorch. MIT
Show 19 hidden projects...
- pretrainedmodels (🥇30 · ⭐ 8.8K · 💀) - Pretrained ConvNets for pytorch: NASNet, ResNeXt,..
BSD-3
- torchsde (
🥈 29 ·⭐ 1.3K ·💀 ) - Differentiable SDE solvers with GPU support and efficient..Apache-2
- EfficientNet-PyTorch (
🥈 27 ·⭐ 7.4K ·💀 ) - A PyTorch implementation of EfficientNet and..Apache-2
- pytorch-summary (🥈27 · ⭐ 3.8K · 💀) - Model summary in PyTorch similar to
model.summary()
..MIT
- SRU (
🥉 23 ·⭐ 2.1K ·💀 ) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755).MIT
- Torchmeta (
🥉 23 ·⭐ 1.8K ·💀 ) - A collection of extensions and data-loaders for few-shot..MIT
- Higher (
🥉 23 ·⭐ 1.5K ·💀 ) - higher is a pytorch library allowing users to obtain higher..Apache-2
- Antialiased CNNs (
🥉 22 ·⭐ 1.6K ·💀 ) - pip install antialiased-cnns to improve stability and..❗️CC BY-NC-SA 4.0
- EfficientNets (🥉22 · ⭐ 1.5K · 💀) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,..
Apache-2
- AdaBound (🥉20 · ⭐ 2.9K · 💀) - An optimizer that trains as fast as Adam and as good as SGD.
Apache-2
- Poutyne (
🥉 20 ·⭐ 550) - A simplified framework and utilities for PyTorch.❗️LGPL-3.0
- pytorchviz (🥉19 · ⭐ 2.7K · 💀) - A small package to create visualizations of PyTorch execution..
MIT
- Lambda Networks (
🥉 19 ·⭐ 1.5K ·💀 ) - Implementation of LambdaNetworks, a new approach to..MIT
- Performer Pytorch (🥉19 · ⭐ 960 · 💀) - An implementation of Performer, a linear attention-..
MIT
- micrograd (
🥉 18 ·⭐ 5.4K ·💀 ) - A tiny scalar-valued autograd engine and a neural net library..MIT
- Tensor Sensor (🥉18 · ⭐ 730 · 💀) - The goal of this library is to generate more helpful..
MIT
- Torch-Struct (🥉17 · ⭐ 1.1K · 💀) - Fast, general, and tested differentiable structured..
MIT
- TorchDrift (
🥉 15 ·⭐ 290 ·💤 ) - Drift Detection for your PyTorch Models.Apache-2
- Pywick (
🥉 13 ·⭐ 390 ·💀 ) - High-level batteries-included neural network training..❗Unlicensed
Database Clients
Libraries for connecting to, operating, and querying databases.
🔗 best-of-python - DB Clients (
Others
scipy (🥇49 · ⭐ 11K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
PyOD (🥇33 · ⭐ 7.2K) - A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly.. BSD-2
PennyLane (🥈32 · ⭐ 1.8K) - PennyLane is a cross-platform Python library for differentiable.. Apache-2
adapter-transformers (🥈 32 · ⭐ 1.7K) - Huggingface Transformers + Adapters =. Apache-2
huggingface
alibi-detect (🥈 29 · ⭐ 1.9K) - Algorithms for outlier, adversarial and drift detection. Apache-2
TabPy (🥈29 · ⭐ 1.4K) - Execute Python code on the fly and display results in Tableau visualizations:. MIT
anomalib (🥈28 · ⭐ 2K) - An anomaly detection library comprising state-of-the-art algorithms.. Apache-2
metric-learn (🥉27 · ⭐ 1.3K · 💤) - Metric learning algorithms in Python. MIT
Feature Engine (🥉 25 · ⭐ 1.4K) - Feature engineering package with sklearn like functionality. BSD-3
MONAILabel (🥉24 · ⭐ 420) - MONAI Label is an intelligent open source image labeling and.. Apache-2
rrcf (🥉21 · ⭐ 440) - Implementation of the Robust Random Cut Forest algorithm for anomaly detection.. MIT
StreamAlert (🥉 20 · ⭐ 2.8K · 💤 ) - StreamAlert is a serverless, realtime data analysis.. Apache-2
-
GitHub (
👨💻 33 ·🔀 320 ·📋 340 - 24% open ·⏱️ 20.07.2022):git clone https://github.com/airbnb/streamalert
benchmark_VAE (🥉20 · ⭐ 1.4K) - Unifying Variational Autoencoder (VAE) implementations.. Apache-2
NeuralCompression (🥉17 · ⭐ 340) - A collection of tools for neural compression enthusiasts. MIT
traingenerator (🥉13 · ⭐ 1.3K · 💤) - A web app to generate template code for machine learning. MIT
-
GitHub (
👨💻 3 ·🔀 170 ·📋 16 - 81% open ·⏱️ 30.06.2022):git clone https://github.com/jrieke/traingenerator
Show 16 hidden projects...
- cleanlab (🥈30 · ⭐ 6.1K) - The standard data-centric AI package for data quality and machine..
❗️AGPL-3.0
- pyclustering (
🥈 28 ·⭐ 1.1K ·💀 ) - pyclustering is a Python, C++ data mining library.BSD-3
- Cython BLIS (
🥈 28 ·⭐ 200) - Fast matrix-multiplication as a self-contained Python library no..BSD-3
- minisom (
🥉 26 ·⭐ 1.3K) - MiniSom is a minimalistic implementation of the Self Organizing..❗️CC-BY-3.0
- metricflow (🥉24 · ⭐ 910) - MetricFlow allows you to define, build, and maintain metrics..
❗Unlicensed
- findspark (🥉24 · ⭐ 480 · 💀) - Find pyspark to make it importable.
BSD-3
- mlens (
🥉 23 ·⭐ 790 ·💀 ) - ML-Ensemble high performance ensemble learning.MIT
- vecstack (🥉22 · ⭐ 680 · 💀) - Python package for stacking (machine learning technique).
❗Unlicensed
- opyrator (🥉21 · ⭐ 2.9K · 💀) - Turns your machine learning code into microservices with web API,..
MIT
- scikit-rebate (🥉21 · ⭐ 380 · 💀) - A scikit-learn-compatible Python implementation of..
MIT
- impyute (🥉21 · ⭐ 340 · 💀) - Data imputations library to preprocess datasets with missing data.
MIT
- apricot (
🥉 20 ·⭐ 480 ·💀 ) - apricot implements submodular optimization for the purpose of..MIT
- pymdp (
🥉 20 ·⭐ 270) - A Python implementation of active inference for Markov Decision Processes.MIT
- baikal (🥉18 · ⭐ 600 · 💀) - A graph-based functional API for building complex scikit-learn..
BSD-3
- pandas-ml (
🥉 16 ·⭐ 310 ·💀 ) - pandas, scikit-learn, xgboost and seaborn integration.BSD-3
- nylon (
🥉 12 ·⭐ 82 ·💀 ) - An intelligent, flexible grammar of machine learning.MIT
Related Resources
- Papers With Code: Discover ML papers, code, and evaluation tables.
- Sotabench: Discover & compare open-source ML models.
- Google Dataset Search: Dataset search engine by Google.
- Dataset List: List of the biggest ML datasets from across the web.
- Awesome Public Datasets: A topic-centric list of open datasets.
- Best-of lists: Discover other best-of lists with awesome open-source projects on all kinds of topics.
- best-of-python-dev: A ranked list of awesome python developer tools and libraries.
- best-of-web-python: A ranked list of awesome python libraries for web development.
Contribution
Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:
- Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
- Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.
If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.
For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.