Awesome colab notebooks collection for ML experiments

Research

name	description	authors	links	update
TabPFN	Neural network that learned to do tabular data prediction	Noah Hollmann Samuel Müller Katharina Eggensperger Frank Hutter	, , , , , blog post	31.05.2023
AudioLDM	Text-to-audio system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining latents	Haohe Liu Zehua Chen Yi Yuan others Xinhao Mei Xubo Liu Danilo Mandic Wenwu Wang Mark Plumbley	, , project	31.05.2023
AlphaFold	Highly accurate protein structure prediction	John Jumper Richard Evans Alexander Pritzel others Tim Green Michael Figurnov Olaf Ronneberger Kathryn Tunyasuvunakool Russ Bates Augustin Žídek Anna Potapenko Alex Bridgland Clemens Meyer Simon Kohl Andrew Ballard Bernardino Romera-Paredes Stanislav Nikolov Rishub Jain	blog post, blog post , paper, paper ,	03.05.2023
DFL-Colab	This project provides you IPython Notebook to use DeepFaceLab	chervonij	guide	30.04.2023
MiniGPT-4	Enhancing Vision-language Understanding with Advanced Large Language Models	Deyao Zhu Jun Chen Xiaoqian Shen others Xiang Li Mohamed Elhoseiny	, project , , ,	23.04.2023
YOLOv5	You Only Look Once	Glenn Jocher	data ,	21.04.2023
YOLOv3	You Only Look Once	Glenn Jocher	data ,	21.04.2023
Segment Anything	The Segment Anything Model produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image	Alexander Kirillov Eric Mintun Nikhila Ravi others Hanzi Mao Chloé Rolland Laura Gustafson Tete Xiao Spencer Whitehead Alex Berg Wan-Yen Lo Piotr Dollar Ross Girshick	blog post, blog post data website , ,	10.04.2023
EVA3D	High-quality unconditional 3D human generative model that only requires 2D image collections for training	Fangzhou Hong Zhaoxi Chen Yushi Lan others Liang Pan Ziwei Liu	project ,	06.04.2023
Stable Dreamfusion	Using a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis	Jiaxiang Tang Ben Poole Ajay Jain others Jon Barron Ben Mildenhall	, project , , ,	04.04.2023
Parallel WaveGAN	State-of-the-art non-autoregressive models to build your own great vocoder	Tomoki Hayashi	, , demo ,	03.04.2023
Wav2Lip	A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild	Prajwal Renukanand Rudrabha Mukhopadhyay Vinay Namboodiri C. V. Jawahar	data demo project	31.03.2023
UniFormer	Unified Transformer for Efficient Spatiotemporal Representation Learning	Kunchang Li Yali Wang Peng Gao others Guanglu Song Yu Liu Hongsheng Li Yu Qiao	, , , , , , , , , , , , , ,	31.03.2023
PIFuHD	Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization	Shunsuke Saito Tomas Simon Jason Saragih Hanbyul Joo	,	26.03.2023
Visual ChatGPT	Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting	Chenfei Wu Shengming Yin Weizhen Qi others Xiaodong Wang Zecheng Tang Nan Duan	, , , ,	15.03.2023
LaMa	Resolution-robust Large Mask Inpainting with Fourier Convolutions	Roman Suvorov Elizaveta Logacheva Anton Mashikhin others Anastasia Remizova Arsenii Ashukha Aleksei Silvestrov Naejin Kong Harshith Goka Kiwoong Park Victor Lempitsky	, , , project	15.02.2023
GPEN	GAN Prior Embedded Network for Blind Face Restoration in the Wild	Tao Yang Peiran Ren Xuansong Xie Lei Zhang	demo ,	15.02.2023
YOLOv6	Single-stage object detection framework dedicated to industrial applications	Kaiheng Weng Meng Cheng Yiduo Li others Xiangxiang Chu Xiaolin Wei	, blog post data , , , , , ,	14.02.2023
CutLER	Simple approach for training unsupervised object detection and segmentation models	Xudong Wang Rohit Girdhar Stella Yu Ishan Misra	, project	11.02.2023
Disco Diffusion	A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations	Max Ingham Adam Letts Daniel Russell Chigozie Nri	, ,	11.02.2023
DALL·E Mini	Generate images from a text prompt	Boris Dayma Suraj Patil Pedro Cuenca others Khalid Saifullah Tanishq Abraham Phúc H. Lê Khắc Luke Melas Ritobrata Ghosh	, , , , , blog post data ,	10.02.2023
Open-Unmix	A deep neural network reference implementation for music source separation, applicable for researchers, audio engineers and artists	Fabian-Robert Stöter Antoine Liutkus	data paper project	09.02.2023
OWL-ViT	Simple Open-Vocabulary Object Detection with Vision Transformers	Matthias Minderer Alexey Gritsenko Austin Stone others Maxim Neumann Dirk Weissenborn Alexey Dosovitskiy Aravindh Mahendran Anurag Arnab Mostafa Dehghani Zhuoran Shen Xiao Wang Xiaohua Zhai Thomas Kipf Neil Houlsby		08.02.2023
GrooVAE	Some applications of machine learning for generating and manipulating beats and drum performances	Jon Gillick Adam Roberts Jesse Engel	blog post data web app	01.02.2023
Multitrack MusicVAE	The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord	Ian Simon Adam Roberts Colin Raffel others Jesse Engel Curtis Hawthorne Douglas Eck	blog post	01.02.2023
MusicVAE	A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music	Adam Roberts Jesse Engel Colin Raffel others Curtis Hawthorne Douglas Eck	blog post project	01.02.2023
Learning to Paint	Learning to Paint With Model-based Deep Reinforcement Learning	Manuel Romero		01.02.2023
VALL-E	Language modeling approach for text to speech synthesis	Chengyi Wang Sanyuan Chen Yu Wu others Ziqiang Zhang Long Zhou Shujie Liu Zhuo Chen Yanqing Liu Huaming Wang Jinyu Li Lei He Sheng Zhao Furu Wei	, project , , ,	18.01.2023
Instant-NGP	Instant Neural Graphics Primitives with a Multiresolution Hash Encoding	Thomas Müller Alex Evans Christoph Schied Alexander Keller	blog post , , , , project tutorial , , ,	18.01.2023
Fourier Feature Networks	Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains	Matthew Tancik Pratul Srinivasan Ben Mildenhall others Sara Fridovich-Keil Nithin Raghavan Utkarsh Singhal Ravi Ramamoorthi Jon Barron Ren Ng	, project	17.01.2023
HybrIK	Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation	Jiefeng Li Chao Xu Zhicun Chen others Siyuan Bian Lixin Yang Cewu Lu	project supp	01.01.2023
First Order Motion Model for Image Animation	Transferring facial movements from video to image	Aliaksandr Siarohin	project	08.12.2022
FILM	A frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion	Fitsum Reda Janne Kontkanen Eric Tabellion others Deqing Sun Caroline Pantofaru Brian Curless	data, data, data project , ,	26.11.2022
Demucs	Hybrid Spectrogram and Waveform Source Separation	Alexandre Défossez	, , , , , ,	21.11.2022
ESM	Evolutionary Scale Modeling: Pretrained language models for proteins	Zeming Lin Roshan Rao Brian Hie others Zhongkai Zhu Allan dos Santos Costa Maryam Fazel-Zarandi Tom Sercu Salvatore Candido Alexander Rives Joshua Meier Robert Verkuil Jason Liu Chloe Hsu Adam Lerer	ESM Atlas FSDP ICML data paper, paper, paper, paper pubmed ,	02.11.2022
Musika	Music generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU	Marco Pasini Jan Schlüter	, data , project ,	29.10.2022
ICON	Given a set of images, method estimates a detailed 3D surface from each image and then combines these into an animatable avatar	Yuliang Xiu Jinlong Yang Dimitrios Tzionas Michael Black	, , , , , , , project	25.10.2022
MotionDiffuse	The first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods	Mingyuan Zhang Zhongang Cai Liang Pan others Fangzhou Hong Xinying Guo Lei Yang Ziwei Liu	project	13.10.2022
VToonify	Leverages the mid- and high-resolution layers of StyleGAN to render high-quality artistic portraits based on the multi-scale content features extracted by an encoder to better preserve the frame details	Shuai Yang Liming Jiang Ziwei Liu Chen Change Loy	, , , , project	07.10.2022
PyMAF	Pyramidal Mesh Alignment Feedback loop in regression network for well-aligned body mesh recovery and extend it for the recovery of expressive full-body models	Hongwen Zhang Yating Tian Yuxiang Zhang others Mengcheng Li Liang An Zhenan Sun Yebin Liu	, , , , project ,	06.10.2022
AlphaTensor	Discovering faster matrix multiplication algorithms with reinforcement learning	Alhussein Fawzi Matej Balog Aja Huang others Thomas Hubert Bernardino Romera-Paredes Mohammadamin Barekatain Alexander Novikov Francisco Ruiz Julian Schrittwieser Grzegorz Swirszcz David Silver Demis Hassabis Pushmeet Kohli	blog post paper , , ,	04.10.2022
Swin2SR	Novel Swin Transformer V2, to improve SwinIR for image super-resolution, and in particular, the compressed input scenario	Marcos Conde Ui-Jin Choi Maxime Burchi Radu Timofte	, , , , , , ,	03.10.2022
Thin-Plate Spline Motion Model	End-to-end unsupervised motion transfer framework	Jian Zhao Hui Zhang	, , , supp	30.09.2022
Functa	From data to functa: Your data point is a function and you can treat it like one	Emilien Dupont Hyunjik Kim Ali Eslami others Danilo Rezende Dan Rosenbaum	,	24.09.2022
Whisper	Automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web	Alec Radford Jong Wook Kim Tao Xu others Greg Brockman Christine McLeavey Ilya Sutskever	blog post , ,	21.09.2022
DeOldify (video)	Colorize your own videos!	Jason Antic	, model , ,	19.09.2022
DeOldify (photo)	Colorize your own photos!	Jason Antic Matt Robinson María Benavente	, model	19.09.2022
Real-ESRGAN	Extend the powerful ESRGAN to a practical restoration application, which is trained with pure synthetic data	Xintao Wang Liangbin Xie Chao Dong Ying Shan	, , , ,	18.09.2022
IDE-3D	Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis	Jingxiang Sun Xuan Wang Yichun Shi others Lizhen Wang Jue Wang Yebin Liu	, , ,	08.09.2022
Decision Transformers	An architecture that casts the problem of RL as conditional sequence modeling	Lili Chen Kevin Lu Aravind Rajeswaran others Kimin Lee Aditya Grover Michael Laskin Pieter Abbeel Aravind Srinivas Igor Mordatch	, , project , , ,	06.09.2022
Dream Fields	Zero-Shot Text-Guided Object Generation	Ajay Jain Ben Mildenhall Jon Barron others Pieter Abbeel Ben Poole	, , , project	05.09.2022
GANgealing	Framework for learning discriminative models and their GAN-generated training data jointly end-to-end	William Peebles Jun-Yan Zhu Richard Zhang others Antonio Torralba Alexei Efros Eli Shechtman	, , project ,	01.09.2022
textual-inversion	An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion	Rinon Gal Yuval Alaluf Yuval Atzmon others Or Patashnik Amit Bermano Gal Chechik Daniel Cohen-Or	project ,	21.08.2022
StyleGAN-Human	A Data-Centric Odyssey of Human Generation	Jianglin Fu Shikai Li Yuming Jiang others Kwan-Yee Lin Chen Qian Chen Change Loy Wayne Wu Ziwei Liu	, , project , , ,	19.08.2022
Make-A-Scene	Scene-Based Text-to-Image Generation with Human Priors	Oran Gafni Adam Polyak Oron Ashual others Shelly Sheynin Devi Parikh Yaniv Taigman		12.08.2022
StyleGAN-NADA	Zero-Shot non-adversarial domain adaptation of pre-trained generators	Rinon Gal Or Patashnik Haggai Maron others Gal Chechik Daniel Cohen-Or	, , project	09.08.2022
YOLOv7	Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors	Chien-Yao Wang Alexey Bochkovskiy Mark Liao	data, data, data, data , , , , , , ,	09.08.2022
Anycost GAN	Interactive natural image editing	Ji Lin Richard Zhang Frieder Ganz others Song Han Jun-Yan Zhu	, , , , project	20.07.2022
GFPGAN	Towards Real-World Blind Face Restoration with Generative Facial Prior	Xintao Wang Yu Li Honglun Zhang Ying Shan	, , project	13.07.2022
EPro-PnP	Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation	Hansheng Chen Pichao Wang Fan Wang others Wei Tian Lu Xiong Hao Li	, , , , nuScenes	12.07.2022
VQ-Diffusion	Based on a VQ-VAE whose latent space is modeled by a conditional variant of the recently developed Denoising Diffusion Probabilistic Model	Shuyang Gu Dong Chen Jianmin Bao others Fang Wen Bo Zhang Dongdong Chen Lu Yuan Baining Guo Shuyang Gu Zhicong Tang	, ,	30.06.2022
OPT	Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet	Susan Zhang Stephen Roller Naman Goyal others Mikel Artetxe Moya Chen Christopher Dewan Mona Diab Xi Victoria Lin Todor Mihaylov Myle Ott Sam Shleifer Kurt Shuster Daniel Simig Punit Singh Koura Anjali Sridhar Tianlu Wang Luke Zettlemoyer	, , , blog post	29.06.2022
Customizing a Transformer Encoder	We will learn how to customize the encoder to employ new network architectures	Chen Chen		22.06.2022
MTTR	End-to-End Referring Video Object Segmentation with Multimodal Transformers	Adam Botach Evgenii Zheltonozhskii Chaim Baskin	, ,	20.06.2022
SwinIR	Image Restoration Using Swin Transformer	Jingyun Liang Jiezhang Cao Guolei Sun others Kai Zhang Luc Van Gool Radu Timofte	, , ,	17.06.2022
VRT	A Video Restoration Transformer	Jingyun Liang Jiezhang Cao Yuchen Fan others Kai Zhang Yawei Li Radu Timofte Luc Van Gool	, ,	15.06.2022
Omnivore	A single model which excels at classifying images, videos, and single-view 3D data using exactly the same model parameters	Rohit Girdhar Mannat Singh Nikhila Ravi others Laurens Maaten Armand Joulin Ishan Misra	, project	14.06.2022
Detic	Detecting Twenty-thousand Classes using Image-level Supervision	Xingyi Zhou Rohit Girdhar Armand Joulin others Philipp Krähenbühl Ishan Misra		07.06.2022
AMARETTO	Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease	Nathalie Pochet Olivier Gevaert Mohsen Nabian others Jayendra Shinde Celine Everaert Thorin Tabor	bioconductor project	01.06.2022
T0	Multitask Prompted Training Enables Zero-Shot Task Generalization	Victor Sanh Albert Webson Colin Raffel others Stephen Bach Lintang Sutawika Zaid Alyafeai Antoine Chaffin Arnaud Stiegler Teven Le Scao Arun Raja Manan Dey M Saiful Bari Canwen Xu Urmish Thakker Shanya Sharma Eliza Szczechla Taewoon Kim Gunjan Chhablani Nihal Nayak Debajyoti Datta Jonathan Chang Mike Tian-Jian Jiang Matteo Manica Sheng Shen Zheng Xin Yong Harshit Pandey Rachel Bawden Trishala Neeraj Jos Rozen Abheesht Sharma Andrea Santilli Thibault Fevry Jason Alan Fries Ryan Teehan Stella Biderman Leo Gao Tali Bers Thomas Wolf Alexander M. Rush	,	29.05.2022
AvatarCLIP	A zero-shot text-driven framework for 3D avatar generation and animation	Fangzhou Hong Mingyuan Zhang Liang Pan others Zhongang Cai Lei Yang Ziwei Liu	, , , , data , , , , project	15.05.2022
Text2Mesh	Text-Driven Neural Stylization for Meshes	Oscar Michel Roi Bar-On Richard Liu others Sagie Benaim Rana Hanocka	CLIP project	14.05.2022
T5	Text-To-Text Transfer Transformer	Colin Raffel Noam Shazeer Adam Roberts others Katherine Lee Sharan Narang Michael Matena Yanqi Zhou Wei Li Peter J. Liu		11.05.2022
XLS-R	Self-supervised Cross-lingual Speech Representation Learning at Scale	Arun Babu Changhan Wang Andros Tjandra others Kushal Lakhotia Qiantong Xu Naman Goyal Kritika Singh Patrick von Platen Yatharth Saraf Juan Pino Alexei Baevski Alexis Conneau Michael Auli	blog post	10.05.2022
DiffCSE	Unsupervised contrastive learning framework for learning sentence embeddings	Yung-Sung Chuang Rumen Dangovski Hongyin Luo others Yang Zhang Shiyu Chang Marin Soljačić Shang-Wen Li Scott Wen-tau Yih Yoon Kim James Glass	, ,	24.04.2022
ViDT+	An Extendable, Efficient and Effective Transformer-based Object Detector	Hwanjun Song Deqing Sun Sanghyuk Chun others Varun Jampani Dongyoon Han Byeongho Heo Wonjae Kim Ming-Hsuan Yang	, ,	20.04.2022
NAFNet	Nonlinear Activation Free Network for Image Restoration	Liangyu Chen Xiaojie Chu Xiangyu Zhang Jian Sun	, ,	15.04.2022
Panini-Net	GAN Prior based Degradation-Aware Feature Interpolation for Face Restoration	Yinhuai Wang Yujie Hu Jian Zhang	,	13.04.2022
Deep Painterly Harmonization	Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve	Fujun Luan Sylvain Paris Eli Shechtman Kavita Bala	, , ,	07.04.2022
E2FGVI	An End-to-End framework for Flow-Guided Video Inpainting through elaborately designed three trainable modules, namely, flow completion, feature propagation, and content hallucination modules	Zhen Li Cheng-Ze Lu Jianhua Qin others Chun-Le Guo Ming-Ming Cheng	data, data , , , ,	06.04.2022
LDM	High-Resolution Image Synthesis with Latent Diffusion Models	Robin Rombach Andreas Blattmann Dominik Lorenz others Patrick Esser Björn Ommer	, , , , ,	04.04.2022
GP-UNIT	Novel framework, Generative Prior-guided UNsupervised Image-to-image Translation, to improve the overall quality and applicability of the translation algorithm	Shuai Yang Liming Jiang Ziwei Liu Chen Change Loy	ImageNet , , , , , project	02.04.2022
DualStyleGAN	More challenging exemplar-based high-resolution portrait style transfer by introducing a novel DualStyleGAN with flexible control of dual styles of the original face domain and the extended artistic portrait domain	Shuai Yang Liming Jiang Ziwei Liu Chen Change Loy	data, data , , , project	24.03.2022
CLIPasso	Semantically-Aware Object Sketching	Yael Vinker Ehsan Pajouheshgar Jessica Y. Bo others Roman Bachmann Amit Bermano Daniel Cohen-Or Amir Zamir Ariel Shamir	, demo project	21.03.2022
StyleSDF	A high resolution, 3D-consistent image and shape generation technique	Roy Or-El Xuan Luo Mengyi Shan others Eli Shechtman Jeong Joon Park Ira Kemelmacher-Shlizerman	, project	05.03.2022
VideoGPT	A conceptually simple architecture for scaling likelihood based generative modeling to natural videos	Wilson Yan Yunzhi Zhang Pieter Abbeel Aravind Srinivas	, data project	02.03.2022
Disentangled Lifespan Face Synthesis	LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively	Sen He Wentong Liao Michael Yang others Yi-Zhe Song Bodo Rosenhahn Tao Xiang	project	22.02.2022
Mask2Former	Masked-attention Mask Transformer for Universal Image Segmentation	Bowen Cheng Ishan Misra Alexander Schwing others Alexander Kirillov Rohit Girdhar	, demo project	09.02.2022
SpecVQGAN	Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors	Vladimir Iashin Esa Rahtu	, , , , , , , , project , ,	03.02.2022
JoJoGAN	One Shot Face Stylization	Min Jin Chong David Forsyth	,	02.02.2022
Pose with Style	Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN	Badour AlBahar Jingwan Lu Jimei Yang others Zhixin Shu Eli Shechtman Jia-Bin Huang	project	19.01.2022
ConvNeXt	A pure ConvNet model constructed entirely from standard ConvNet modules	Zhuang Liu Hanzi Mao Chao-Yuan Wu others Christoph Feichtenhofer Trevor Darrell Saining Xie	, , , ,	19.01.2022
diffsort	Differentiable Sorting Networks	Felix Petersen Christian Borgelt Hilde Kuehne Oliver Deussen	,	17.01.2022
Taming Transformers for High-Resolution Image Synthesis	We combine the efficiancy of convolutional approaches with the expressivity of transformers by introducing a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer	Patrick Esser Robin Rombach Björn Ommer	project	13.01.2022
FuseDream	Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization	Xingchao Liu Chengyue Gong Lemeng Wu others Hao Su Qiang Liu		02.01.2022
GLIDE	Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models	Alex Nichol Prafulla Dhariwal Aditya Ramesh others Pranav Shyam Pamela Mishkin Bob McGrew Ilya Sutskever Mark Chen		22.12.2021
Music Composer	Synthesizing symbolic music in MIDI format using the Music Transformer model	bazanovvanya	blog post data, data , ,	20.12.2021
PoolFormer	MetaFormer Is Actually What You Need for Vision	Weihao Yu Mi Luo Pan Zhou others Chenyang Si Yichen Zhou Xinchao Wang Jiashi Feng Shuicheng Yan	, ,	05.12.2021
HyperStyle	A hypernetwork that learns to modulate StyleGAN's weights to faithfully express a given image in editable regions of the latent space	Yuval Alaluf Omer Tov Ron Mokady others Rinon Gal Amit Bermano	, , , data , , , , , , , project	03.12.2021
encoder4editing	Designing an Encoder for StyleGAN Image Manipulation	Omer Tov Yuval Alaluf Yotam Nitzan others Or Patashnik Daniel Cohen-Or		02.12.2021
StyleCariGAN	Caricature Generation via StyleGAN Feature Map Modulation	Wonjong Jang Gwangjin Ju Yucheol Jung others Jiaolong Yang Xin Tong Seungyong Lee	, project	30.11.2021
CartoonGAN	The implementation of the cartoon GAN model with PyTorch	Tobias Sunderdiek	CVPR project	24.11.2021
SimSwap	An efficient framework, called Simple Swap, aiming for generalized and high fidelity face swapping	Xuanhong Chen Bingbing Ni Yanhao Ge		24.11.2021
RVM	Robust High-Resolution Video Matting with Temporal Guidance	Peter Lin Linjie Yang Imran Saleemi Soumyadip Sengupta	, project ,	24.11.2021
AnimeGANv2	An improved version of AnimeGAN - it prevents the generation of high-frequency artifacts by simply changing the normalization of features in the network	Xin Chen Gang Liu bryandlee	, project	17.11.2021
SOAT	StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN	Min Jin Chong Hsin-Ying Lee David Forsyth	,	13.11.2021
Arnheim	Generative Art Using Neural Visual Grammars and Dual Encoders	Chrisantha Fernando Ali Eslami Jean-Baptiste Alayrac others Piotr Mirowski Dylan Banarse Simon Osindero	, , , , , , ,	11.11.2021
StyleGAN 2	Generation of faces, cars, etc.	Mikael Christensen		05.11.2021
ruDALL·E	Generate images from texts in Russian	Alex Shonenkov	, project	03.11.2021
ByteTrack	Multi-Object Tracking by Associating Every Detection Box	Yifu Zhang Peize Sun Yi Jiang others Dongdong Yu Ping Luo Xinggang Wang	data, data , , ,	30.10.2021
StyleGAN3	Alias-Free Generative Adversarial Networks	Tero Karras Miika Aittala Samuli Laine others Erik Härkönen Janne Hellsten Jaakko Lehtinen Timo Aila	, , , , , , , , , project	19.10.2021
GPT-2	Retrain an advanced text generating neural network on any text dataset using gpt-2-simple!	Max Woolf	blog post	18.10.2021
ConvMixer	An extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network	Asher Trockman Zico Kolter	,	05.10.2021
IC-GAN	Instance-Conditioned GAN	Arantxa Casanova Marlène Careil Jakob Verbeek others Michał Drożdżal Adriana Romero-Soriano	blog post , , , ,	01.10.2021
Skillful Precipitation Nowcasting Using Deep Generative Models of Radar	Open-sourced dataset and model snapshot for precipitation nowcasting	Suman Ravuri Karel Lenc Matthew Willson others Dmitry Kangin Rémi Lam Piotr Mirowski Maria Athanassiadou Sheleem Kashem Rachel Prudden Amol Mandhane Aidan Clark Andrew Brock Karen Simonyan Raia Hadsell Niall Robinson Ellen Clancy Shakir Mohamed	blog post local kernel paper	29.09.2021
Live Speech Portraits	Real-Time Photorealistic Talking-Head Animation	Yuanxun Lu Jinxiang Chai Xun Cao	, , , project	26.09.2021
StylEx	Training a GAN to explain a classifier in StyleSpace	Oran Lang Yossi Gandelsman Michal Yarom others Yoav Wald Gal Elidan Avinatan Hassidim William Freeman Phillip Isola Amir Globerso Michal Irani Inbar Mosseri	, , , , blog post project supplementary	25.08.2021
Bringing Old Photo Back to Life	Restoring old photos that suffer from severe degradation through a deep learning approach	Ziyu Wan Bo Zhang Dongdong Chen others Pan Zhang Dong Chen Jing Liao Fang Wen	demo project	13.07.2021
PTI	Pivotal Tuning Inversion enables employing off-the-shelf latent based semantic editing techniques on real images using StyleGAN	Daniel Roich Ron Mokady Amit Bermano Daniel Cohen-Or	,	01.07.2021
TediGAN	Framework for multi-modal image generation and manipulation with textual descriptions	Weihao Xia Yujiu Yang Jing-Hao Xue Baoyuan Wu	, , , ,	30.06.2021
GANs N' Roses	Stable, Controllable, Diverse Image to Image Translation	Min Jin Chong David Forsyth	, ,	19.06.2021
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes	A method to stylize images by optimizing parameterized brushstrokes instead of pixels	Dmytro Kotovenko Matthias Wright Arthur Heimbrecht Björn Ommer	project	02.06.2021
Pixel2Style2Pixel	Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation	Elad Richardson Yuval Alaluf Yotam Nitzan Daniel Cohen-Or	, project	01.06.2021
Fine-tuning a BERT	We will work through fine-tuning a BERT model using the tensorflow-models PIP package	Chen Chen Claire Yao		24.05.2021
ReStyle	A Residual-Based StyleGAN Encoder via Iterative Refinement	Yuval Alaluf Or Patashnik Daniel Cohen-Or	, , , project	21.05.2021
Motion Representations for Articulated Animation	Novel motion representations for animating articulated objects consisting of distinct parts	Aliaksandr Siarohin Oliver Woodford Jian Ren others Menglei Chai Sergey Tulyakov	project	29.04.2021
SAM	Age Transformation Using a Style-Based Regression Model	Yuval Alaluf Or Patashnik Daniel Cohen-Or	, project	26.04.2021
SkinDeep	Remove Body Tattoo Using Deep Learning	Vijish Madhavan	, , ,	24.04.2021
Geometry-Free View Synthesis	Is a geometric model required to synthesize novel views from a single image?	Robin Rombach Patrick Esser Björn Ommer	data	22.04.2021
NeRViS	An algorithm for full-frame video stabilization by first estimating dense warp fields	Yu-Lun Liu Wei-Sheng Lai Ming-Hsuan Yang others Yung-Yu Chuang Jia-Bin Huang	data , project	11.04.2021
NeX	View synthesis based on enhancements of multiplane image that can reproduce NeXt-level view-dependent effects in real time	Suttisak Wizadwongsa Pakkapon Phongthawee Jiraphon Yenphraphai Supasorn Suwajanakorn	data, data project vistec	25.03.2021
Score SDE	Score-Based Generative Modeling through Stochastic Differential Equations	Yang Song Jascha Sohl-Dickstein Diederik Kingma others Abhishek Kumar Stefano Ermon Ben Poole	, , , ,	18.03.2021
Big Sleep	Text to image generation, using OpenAI's CLIP and a BigGAN	Phil Wang	, ,	17.03.2021
Deep Daze	Text to image generation using OpenAI's CLIP and Siren	Phil Wang	,	17.03.2021
Talking Head Anime from a Single Image	The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose	Pramook Khungurn	project , , ,	23.02.2021
NFNet	An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets	Andrew Brock Soham De Samuel L. Smith Karen Simonyan	,	17.02.2021
CLIP	A neural network which efficiently learns visual concepts from natural language supervision	Jong Wook Alec Radford Ilya Sutskever	data paper project	29.01.2021
Adversarial Patch	A method to create universal, robust, targeted adversarial image patches in the real world	Tom Brown		27.01.2021
MSG-Net	Multi-style Generative Network with a novel Inspiration Layer, which retains the functionality of optimization-based approaches and has the fast speed of feed-forward networks	Hang Zhang Kristin Dana	project	25.01.2021
HiDT	A generative image-to-image model and a new upsampling scheme that allows to apply image translation at high resolution	Denis Korzhenkov Gleb Sterkin Sergey Nikolenko Victor Lempitsky	project ,	24.01.2021
Neural Style Transfer	Implementation of Neural Style Transfer in Keras 2.0+	Somshubra Majumdar	, ,	22.01.2021
SkyAR	A vision-based method for video sky replacement and harmonization, which can automatically generate realistic and dramatic sky backgrounds in videos with controllable styles	Zhengxia Zou	project	18.01.2021
Big GAN	Large Scale GAN Training for High Fidelity Natural Image Synthesis	Google		12.01.2021
MusicXML Documentation	The goal of this notebook is to explore one of the magenta libraries for music	Prakruti Joshi Falak Shah Twisha Naik	magenta music theory musicXML	08.01.2021
SVG VAE	A colab demo for the SVG VAE model	Raphael Gontijo Lopes	blog post	08.01.2021
Neural Magic Eye	Learning to See and Understand the Scene Behind an Autostereogram	Zhengxia Zou Tianyang Shi Yi Yuan Zhenwei Shi	project	01.01.2021
Flow-edge Guided Video Completion	Method first extracts and completes motion edges, and then uses them to guide piecewise-smooth flow completion with sharp edges	Chen Gao Ayush Saraf Johannes Kopf Jia-Bin Huang	project	30.12.2020
ArtLine	A Deep Learning based project for creating line art portraits	Vijish Madhavan	, , , data ,	24.12.2020
WikiArt (stylegan2-ada)	Generation of paintings of different styles and genres	Doron Adler		08.12.2020
GANSpace	A simple technique to analyze GANs and create interpretable controls for image synthesis, such as change of viewpoint, aging, lighting, and time of day	Erik Härkönen Aaron Hertzmann Jaakko Lehtinen Sylvain Paris	,	06.12.2020
SeFa	A closed-form approach for unsupervised latent semantic factorization in GANs	Yujun Shen Bolei Zhou	project	06.12.2020
Stylized Neural Painting	An image-to-painting translation method that generates vivid and realistic painting artworks with controllable styles	Zhengxia Zou Tianyang Shi Yi Yuan Zhenwei Shi	project	01.12.2020
MakeItTalk	A method that generates expressive talking-head videos from a single facial image with audio as the only input	Yang Zhou Xintong Han Eli Shechtman others Jose Echevarria Evangelos Kalogerakis Dingzeyu Li	data project	10.11.2020
LaSAFT	Latent Source Attentive Frequency Transformation for Conditioned Source Separation	Woosung Choi	data project	01.11.2020
Lifespan Age Transformation Synthesis	Multi-domain image-to-image generative adversarial network architecture, whose learned latent space models a continuous bi-directional aging process	Roy Or-El Soumyadip Sengupta Ohad Fried others Eli Shechtman Ira Kemelmacher-Shlizerman	, , project ,	31.10.2020
HiGAN	Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis	Ceyuan Yang Yujun Shen Bolei Zhou	, , project	14.10.2020
InterFaceGAN	Interpreting the Latent Space of GANs for Semantic Face Editing	Yujun Shen Jinjin Gu Xiaoou Tang Bolei Zhou	, , , project	13.10.2020
Faceswap-GAN	A minimum demo for faceswap-GAN v2.2	shaoanlu		12.09.2020
Instance-aware Image Colorization	Novel deep learning framework to achieve instance-aware colorization	Jheng-Wei Su	project	30.08.2020
MoCo	Momentum Contrast for unsupervised visual representation learning	Kaiming He Haoqi Fan Yuxin Wu others Saining Xie Ross Girshick	, , , ,	20.08.2020
Rewriting a Deep Generative Model	We ask if a deep network can be reprogrammed to follow different rules, by enabling a user to directly change the weights, instead of training with a data set	David Bau Steven Liu Tongzhou Wang others Jun-Yan Zhu Antonio Torralba	, , project ,	31.07.2020
BERT score	An automatic evaluation metric for text generation	Tianyi Zhang		17.07.2020
SIREN	Implicit Neural Representations with Periodic Activation Functions	Vincent Sitzmann Julien Martel	data project	24.06.2020
PIFu	Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization	Ryota Natsume Shunsuke Saito Zeng Huang others Angjoo Kanazawa Hao Li		17.06.2020
3D Ken Burns	A reference implementation of 3D Ken Burns Effect from a Single Image using PyTorch - given a single input image, it animates this still image with a virtual camera scan and zoom subject to motion parallax	Manuel Romero		13.06.2020
HRFAE	An encoder-decoder architecture for face age editing	Xu Yao Gilles Puy Alasdair Newson others Yann Gousseau Pierre Hellier	data ,	14.05.2020
Jukebox	A neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles	Christine Payne	blog post explorer	04.05.2020
3D Photo Inpainting	Method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view	Meng-Li Shih Shih-Yang Su Johannes Kopf Jia-Bin Huang	project	04.05.2020
Global Flow Local Attention	Differentiable global-flow local-attention framework to reassemble the inputs at the feature level	Yurui Ren Xiaoming Yu Junming Chen others Thomas Li Ge Li	, data , , project	30.04.2020
Motion Supervised co-part Segmentation	A self-supervised deep learning method for co-part segmentation	Aliaksandr Siarohin Subhankar Roy		07.04.2020
Onsets and Frames	Onsets and Frames is an automatic music transcription framework with piano and drums models	Curtis Hawthorne Erich Elsen	, , blog post data, data	02.04.2020
WikiArt (stylegan2)	Generation of paintings of different styles and genres	Doron Adler		27.01.2020
Siamese NN	Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task	Tomasz Latkowski	data	19.12.2019
Generating Piano Music with Transformer	This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer	Ian Simon Anna Huang Jesse Engel Curtis Hawthorne	, blog post	16.09.2019
GMCNN	Generative Multi-column Convolutional Neural Networks inpainting model in Keras	Tomasz Latkowski	data, data	09.08.2019
BERT with TPU	Using a free Colab Cloud TPU to fine-tune sentence and sentence-pair classification tasks built on top of pretrained BERT models and run predictions on tuned model	Sourabh Bajaj	TPU quickstart	29.03.2019
GANSynth	This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks	Jesse Engel	project	25.02.2019
Latent Constraints	Conditional Generation from Unconditional Generative Models	Jesse Engel Matthew Hoffman Adam Roberts	data	27.11.2017
Performance RNN	This notebook shows you how to generate new performed compositions from a trained model	Ian Simon Sageev Oore Curtis Hawthorne	blog post data	11.07.2017
NSynth	This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them	Jesse Engel Cinjon Resnick Adam Roberts others Sander Dieleman Karen Simonyan Mohammad Norouzi Douglas Eck	blog post data tutorial ,	06.04.2017

Tutorials

name	description	authors	links	update
Nerfstudio	API that allows for a simplified end-to-end process of creating, training, and testing NeRFs	Matthew Tancik Ethan Weber Evonne Ng others Ruilong Li Brent Yi Justin Kerr Terrance Wang Alexander Kristoffersen Jake Austin Kamyar Salahi Abhik Ahuja David McAllister Angjoo Kanazawa	Viewer , , ,	31.05.2023
Stable Diffusion 2	New stable diffusion model at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch	Robin Rombach Andreas Blattmann Dominik Lorenz others Patrick Esser Björn Ommer qunash	, , , , , , , , , , , ,	26.05.2023
SoftVC VITS	Singing Voice Conversion	svc develop team	, , , , ,	26.05.2023
Detectron2	FAIR's next-generation platform for object detection and segmentation	Yuxin Wu	blog post	26.05.2023
Anomalib	Deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets	Samet Akcay Dick Ameln Ashwin Vaidya others Barath Lakshmanan Nilesh Ahuja Utku Genc	data ,	24.05.2023
Deforum Stable Diffusion	Open source project is designed to be free to use and easy to modify for custom needs and pipelines	EnzymeZoo Артем Храпов Forest Star Walz pharmapsychotic	project , ,	18.05.2023
Building Your Own Federated Learning Algorithm	We discuss how to implement federated learning algorithms without deferring to the tff.learning API	Zachary Charles	blog post	18.05.2023
Federated Learning for Image Classification	We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow	Krzysztof Ostrowski	data ,	18.05.2023
Federated Learning for Text Generation	We start with a RNN that generates ASCII characters, and refine it via federated learning	Krzysztof Ostrowski	, data, data	18.05.2023
Custom Federated Algorithms, Part 1: Introduction to the Federated Core	This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer	Krzysztof Ostrowski	,	18.05.2023
Custom Federated Algorithms, Part 2: Implementing Federated Averaging	This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer	Krzysztof Ostrowski	,	18.05.2023
TFF for Federated Learning Research: Model and Update Compression	We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm	Weikang Song	tensor encoding ,	18.05.2023
High-performance simulations with TFF	This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios	Krzysztof Ostrowski		18.05.2023
Deep RL Course	The Hugging Face Deep Reinforcement Learning Course	Thomas Simonini Omar Sanseviero Sayak Paul	, syllabus , ,	16.05.2023
MMAction2	An open-source toolbox for video understanding based on PyTorch	MMAction2 Contributors	, , , , , data, data, data, data , , , ,	15.05.2023
Ray	Unified framework for scaling AI and Python applications	Philipp Moritz Robert Nishihara Stephanie Wang others Alexey Tumanov Richard Liaw Eric Liang Melih Elibol Zongheng Yang William Paul Michael Jordan Ion Stoica	, , , , website , ,	08.05.2023
Python Data Science Handbook	Jupyter notebook version of the Python Data Science Handbook by Jake VanderPlas	Jake Vanderplas	project	05.05.2023
PGMax	General factor graphs for discrete probabilistic graphical models, and hardware-accelerated differentiable loopy belief propagation in JAX	Guangyao Zhou Nishanth Kumar Antoine Dedieu others Miguel Lázaro-Gredilla Shrinu Kushagra Dileep George		05.05.2023
MyoSuite	A collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems	Vittorio Caggiano Huawei Wang Guillaume Durandau others Massimo Sartori Vikash Kumar		29.04.2023
StableLM	Stability AI Language Models	Stability AI	blog post , , , , , , , , , ,	27.04.2023
DeepFloyd IF	State-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding	Alex Shonenkov Misha Konstantinov Daria Bakshandaeva others Christoph Schuhman Ksenia Ivanova Nadiia Klokova	, , , , , website , ,	27.04.2023
TTS	A library for advanced Text-to-Speech generation, built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality	Eren Gölge Aya-AlJafari Edresson Casanova others Josh Meyer Kelly Davis Reuben Morais	blog post samples website , ,	26.04.2023
highway-env	A collection of environments for autonomous driving and tactical decision-making tasks	Edouard Leurent	, , , ,	22.04.2023
dm_control	DeepMind Infrastructure for Physics-Based Simulation	Saran Tunyasuvunakool Alistair Muldal Yotam Doron others Siqi Liu Steven Bohez Josh Merel Tom Erez Timothy Lillicrap Nicolas Heess Yuval Tassa	, , , , , blog post , ,	20.04.2023
MuJoCo	A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment	Emo Todorov Tom Erez Yuval Tassa	blog post, blog post website , , , ,	20.04.2023
Composer	PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy	The Mosaic ML Team	app , blog post website , ,	19.04.2023
OpenCLIP	An open source implementation of CLIP	Ross Wightman Cade Gordon Vaishaal Shankar	, , , , , data, data, data , , , , , , ,	16.04.2023
Stable Baselines3	Set of reliable implementations of reinforcement learning algorithms in PyTorch	Antonin Raffin Ashley Hill Adam Gleave others Anssi Kanervisto Maximilian Ernestus Noah Dormann	, , paper	14.04.2023
RL Baselines3 Zoo	Training Framework for Stable Baselines3 Reinforcement Learning Agents	Antonin Raffin	, ,	14.04.2023
Petals	Run 100B+ language models at home, BitTorrent-style	BigScience	, , project	12.04.2023
SentencePiece	An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training	Taku Kudo John Richardson	, , , , , , ,	08.04.2023
Transformer	This tutorial trains a Transformer model to translate Portuguese to English	Billy Lamberta	, link	07.04.2023
Brax	A differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators	Daniel Freeman Erik Frey Anton Raichuk others Sertan Girgin Igor Mordatch Olivier Bachem		30.03.2023
TorchGeo	PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data	Adam Stewart Caleb Robinson Isaac Corley others Anthony Ortiz Juan Lavista Ferres Arindam Banerjee	NDBI NDVI NDWI data, data	29.03.2023
LAVIS	Python deep learning library for LAnguage-and-VISion intelligence research and applications	Dongxu Li Junnan Li Hung Le others Guangsen Wang Silvio Savarese Steven Hoi	, , , , blog post	24.03.2023
Hello, many worlds	This tutorial shows how a classical neural network can learn to correct qubit calibration errors	Michael Broughton	, ,	20.03.2023
Image segmentation	This tutorial focuses on the task of image segmentation, using a modified U-Net	Billy Lamberta	data u-net	17.03.2023
Tzer	Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation	Jiawei Liu Yuxiang Wei Sen Yang others Yinlin Deng Lingming Zhang	docker	09.03.2023
Pix2Pix	This notebook demonstrates image to image translation using conditional GAN's	Billy Lamberta	data	09.03.2023
Image classification	This tutorial shows how to classify images of flowers	Billy Lamberta		05.03.2023
Haiku	A library built on top of JAX designed to provide simple, composable abstractions for machine learning research	Tom Hennigan Trevor Cai Tamara Norman Igor Babuschkin	website	02.03.2023
Data augmentation	This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation	Billy Lamberta		02.03.2023
The Autodiff Cookbook	You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics	Alex Wiltschko Matthew Johnson	, , , book, book , tutorial , , ,	01.03.2023
Simple audio recognition	This tutorial will show you how to build a basic speech recognition network that recognizes ten different words	Google	coursera tf.js	25.02.2023
normflows	PyTorch implementation of discrete normalizing flows	Vincent Stimper David Liu Andrew Campbell others Vincent Berenz Lukas Ryll Bernhard Schölkopf José Miguel Hernández-Lobato	,	24.02.2023
SAHI	A lightweight vision library for performing large scale object detection & instance segmentation	Fatih Cagatay Akyon Sinan Onur ALTINUÇ Alptekin Temizel others Cemil Cengiz Devrim Çavuşoğlu Kadir Şahin Oğulcan Eryüksel	,	23.02.2023
AmpliGraph	A suite of neural machine learning models for relational Learning, a branch of machine learning that deals with supervised learning on knowledge graphs	Luca Costabello Adrianna Janik Chan Le Van others Nicholas McCarthy Rory McGrath Sumit Pai	, , , , , ,	23.02.2023
Classify text with BERT	This tutorial contains complete code to fine-tune BERT to perform sentiment analysis on a dataset of plain-text IMDB movie reviews	Google	, data	15.02.2023
NMT with attention	This notebook trains a seq2seq model for Spanish to English translation	Billy Lamberta	, data	15.02.2023
GLUE using BERT on TPU	This tutorial contains complete end-to-end code to train models on a TPU	Google	GLUE	15.02.2023
Kornia	Library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors	Edgar Riba Dmytro Mishkin Daniel Ponsa others Ethan Rublee Gary Bradski	blog post website , ,	11.02.2023
Feast	An open source feature store for machine learning	Willem Pienaar Danny Chiao Achal Shah others Terence Lim Ches Martin Judah Rand Matt Delacour Miguel Trejo Marrufo Francisco Javier Arceo	, , , website ,	07.02.2023
High-performance Simulation with Kubernetes	This tutorial will describe how to set up high-performance simulation using a TFF runtime running on Kubernetes	Jason Roselander	GKE shell	31.01.2023
DALL·E Flow	An interactive workflow for generating high-definition images from text prompt	Han Xiao Delgermurun Purevkhuu Alex Cureton-Griffiths	, ,	26.01.2023
Home Robot	Low-level API for controlling various home robots	Chris Paxton	, , , , , , , ,	25.01.2023
Diffusers	Provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models	Hugging Face	, , , , , , , , , , ,	17.01.2023
Sample Factory	One of the fastest RL libraries focused on very efficient synchronous and asynchronous implementations of policy gradients	Aleksei Petrenko Zhehui Huang Tushar Kumar others Gaurav Sukhatme Vladlen Koltun	ICML	17.01.2023
Open-Assistant	Chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so	Andreas Köpf Yannic Kilcher Huu Nguyen others Christoph Schuhmann Keith Stevens Abdullah Barhoum Nguyen Minh Duc Oliver Stanley James Melvin Ebenezer	website , ,	14.01.2023
CleanRL	Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features	Shengyi Huang Rousslan Dossa Chang Ye others Jeff Braga Dipam Chakraborty Kinal Mehta João Araújo	, , , , , , , , , paper ,	12.01.2023
NeMo	A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis	Oleksii Kuchaiev Jason Li Chip Huyen others Oleksii Hrinchuk Ryan Leary Boris Ginsburg Samuel Kriman Stanislav Beliaev Vitaly Lavrukhin Jack Cook	project	05.01.2023
BANMo	Given multiple casual videos capturing a deformable object, BANMo reconstructs an animatable 3D model, including an implicit canonical 3D shape, appearance, skinning weights, and time-varying articulations, without pre-defined shape templates or registered cameras	Gengshan Yang Minh Vo Natalia Neverova others Deva Ramanan Andrea Vedaldi Hanbyul Joo	, , , project ,	30.12.2022
Actor-Critic	This tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment	Mark Daoust	gym ,	15.12.2022
PyG	Library built upon PyTorch to easily write and train Graph Neural Networks for a wide range of applications related to structured data	Matthias Fey Jan Eric Lenssen	, , , , , , , , , , , , , , , ,	08.12.2022
ruGPT3	Example of inference of RuGPT3XL	Anton Emelyanov	cristofari sparse attention	07.12.2022
Stable Diffusion Videos	Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts	Nathan Raw	,	05.12.2022
PyTerrier	A Python framework for performing information retrieval experiments	Craig Macdonald Nicola Tonellotto	, , , , , ,	02.11.2022
Image captioning	Given an image our goal is to generate a caption	Billy Lamberta	data	26.10.2022
DSP theory	Theory of digital signal processing: signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc	Alexander Kapitanov Vladimir Fadeev Karina Kvanchiani others Elizaveta Petrova Andrei Makhliarchuk	blog post	18.10.2022
Mubert	Prompt-based music generation via Mubert API	Ilya Belikov	project , ,	18.10.2022
Neural style transfer	This tutorial uses deep learning to compose one image in the style of another image	Billy Lamberta		26.09.2022
ACME	A library of reinforcement learning components and agents	Matt Hoffman Bobak Shahriari John Aslanides others Gabriel Barth-Maron Feryal Behbahani Tamara Norman Abbas Abdolmaleki Albin Cassirer Fan Yang Kate Baumli Sarah Henderson Alex Novikov Sergio Gómez Colmenarejo Serkan Cabi Caglar Gulcehre Tom Le Paine Andrew Cowie Ziyu Wang Bilal Piot Nando de Freitas	blog post , ,	26.09.2022
Word2Vec	Word2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasets	Google	link , projector	23.09.2022
NetKet	Open-source project delivering cutting-edge methods for the study of many-body quantum systems with artificial neural networks and machine learning techniques	Filippo Vicentini Damian Hofmann Attila Szabó others Dian Wu Christopher Roth Clemens Giuliani Gabriel Pescia Jannes Nys Vladimir Vargas-Calderón Nikita Astrakhantsev Giuseppe Carleo Kenny Choo James Smith Tom Westerhout Fabien Alet Emily Davis Stavros Efthymiou Ivan Glasser Sheng-Hsuan Lin Marta Mauri Mazzola Guglielmo Christian Mendl Evert Nieuwenburg Ossian O'Reilly Hugo Théveniaut Giacomo Torlai Alexander Wietek	, website	15.09.2022
pymdp	Package for simulating Active Inference agents in Markov Decision Process environments	Conor Heins Alec Tschantz Beren Millidge others Brennan Klein Arun Niranjan Daphne Demekas		24.08.2022
Stable Diffusion	A latent text-to-image diffusion model	Robin Rombach Andreas Blattmann Dominik Lorenz others Patrick Esser Björn Ommer	, , , , , , , , , ,	10.08.2022
Deep-MAC	Welcome to the Novel class segmentation demo	Vighnesh Birodkar		09.08.2022
NL-Augmenter	A collaborative effort intended to add transformations of datasets dealing with natural language	Aadesh Gupta Timothy Sum Hon Mun Aditya Srivatsa others Xudong Shen Juan Diego Rodriguez Ashish Shrivastava Nagender Aneja Zijie Wang Yiwen Shi Afnan Mir William Soto Chandan Singh Claude Roux Abinaya Mahendiran Anna Shvets Kaustubh Dhole Bryan Wilie Jamie Simon Mukund Varma Sang Han Denis Kleyko Samuel Cahyawijaya Filip Cornell Tanay Dixit Connor Boyle Genta Indra Winata Seungjae Ryan Lee Marcin Namysl Roman Sitelew Zhenhao Li Fiona Tan	website	06.08.2022
CycleGAN	This notebook demonstrates unpaired image to image translation using conditional GAN's	Billy Lamberta		02.08.2022
Accelerate	A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision	Hugging Face		27.07.2022
YOLOv5 on Custom Objects	This notebook shows training on your own custom objects	Jacob Solawetz	blog post data	20.07.2022
Epistemic Neural Networks	A library for neural networks that know what they don't know	Ian Osband Zheng Wen Seyed Mohammad Asghari others Vikranth Dwaracherla Morteza Ibrahimi Xiuyuan Lu Benjamin Van Roy		12.07.2022
MindsEye	Graphical user interface built to run multimodal ai art models for free from a Google Colab, without needing edit a single line of code or know any programming	multimodal.art João Paulo Apolinário Passos	project	06.07.2022
py-irt	Fitting Item Response Theory models using variational inference	John Lalor Hong Yu Pedro Rodriguez others Joe Barrow Alexander Hoyle Robin Jia Jordan Boyd-Graber	paper	30.06.2022
Integrated gradients	This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique	Google	visualizing , ,	30.06.2022
SberSwap	A new face swap method for image and video domains	Daniil Chesakov Anastasia Maltseva Alexander Groshev others Andrey Kuznetsov Denis Dimitrov	, , , , , blog post data	29.06.2022
BIG-bench	A collaborative benchmark intended to probe large language models and extrapolate their future capabilities	Jaehoon Lee Jascha Sohl-Dickstein Vinay Ramasesh others Sajant Anand Alicia Parrish Ethan Dyer Liam Dugan Dieuwke Hupkes Daniel Freeman Guy Gur-Ari Aitor Lewkowycz	API	27.06.2022
HuggingArtists	Choose your favorite Artist and train a language model to write new lyrics based on their unique voice	Aleksey Korshuk	,	25.06.2022
Introduction to the TensorFlow Models NLP library	You will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from NLP modeling library	Chen Chen		22.06.2022
Cirq	A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum circuits	Balint Pato Matthew Harrigan Animesh Sinha others Matthew Neeley Dave Bacon Matteo Pompili Michael Broughton		21.06.2022
CLIP-as-service	A low-latency high-scalability service for embedding images and text	Han Xiao	data website ,	19.06.2022
Jina	MLOps framework that empowers anyone to build cross-modal and multi-modal applications on the cloud	Han Xiao	data hub ,	11.06.2022
Transfer learning and fine-tuning	You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network	François Chollet		07.06.2022
Evidently	An open-source framework to evaluate, test and monitor ML models in production	Elena Samuylova Emeli Dral Olga Filippova	website ,	30.05.2022
Tortoise	A multi-voice TTS system trained with an emphasis on quality	James Betker	, , examples ,	03.05.2022
Text generation with RNN	This tutorial demonstrates how to generate text using a character-based RNN	Billy Lamberta	link	02.05.2022
CLIPDraw	Synthesize drawings to match a text prompt	Kevin Frans Lisa Soros Olaf Witkowski	, , blog post	28.04.2022
deep-significance	Easy-to-use package containing different significance tests and utility functions specifically tailored towards research needs and usability	Dennis Ulmer Christian Hardmeier Jes Frellsen	blog post , ,	12.04.2022
Autoencoders	This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection	Google	blog post book data examples	05.04.2022
Text classification with RNN	This text classification tutorial trains a recurrent neural network on the IMDB large movie review dataset for sentiment analysis	Billy Lamberta	data link	17.03.2022
Real-Time Voice Cloning	SV2TTS with a vocoder that works in real-time	Corentin Jemine Erdene-Ochir Tuguldur	, , , , ,	07.03.2022
BLIP	VLP framework which transfers flexibly to both vision-language understanding and generation tasks	Junnan Li Dongxu Li Caiming Xiong Steven Hoi	blog post , , , ,	02.03.2022
Silero Models	Pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple	Silero team	STT, STT, STT TTS, TTS Text Enhancement VAD, VAD website	27.02.2022
ArcaneGAN	Process video in the style of the Arcane animated series	Alexander Spirin	,	17.02.2022
textlesslib	A library aimed to facilitate research in Textless NLP	Eugene Kharitonov Jade Copet Kushal Lakhotia others Nguyễn Tú Anh Paden Tomasello Ann Lee Ali Elkahky Wei-Ning Hsu Abdelrahman Mohamed Emmanuel Dupoux Yossi Adi	, , ,	15.02.2022
AV-HuBERT	Self-supervised representation learning framework for audio-visual speech	Bowen Shi Wei-Ning Hsu Kushal Lakhotia Abdelrahman Mohamed	, , , blog post	12.02.2022
Word embeddings	This tutorial contains an introduction to word embeddings	Billy Lamberta	data projector	15.01.2022
RuDOLPH	A fast and light text-image-text transformer designed for a quick and easy fine-tuning setup for the solution of various tasks: from generating images by text description and image classification to visual question answering and more	Alex Shonenkov Michael Konstantinov	, ,	14.01.2022
DeepDream	This tutorial contains a minimal implementation of DeepDream: an experiment that visualizes the patterns learned by a neural network	Billy Lamberta	blog post	13.01.2022
MLP	The most basic neural network architectures, a multilayer perceptron, also known as a feedforward network	Ben Trevett	NN and DL , , optimization ,	26.12.2021
AlexNet	A neural network model that uses convolutional neural network layers and was designed for the ImageNet challenge	Ben Trevett	ILSVRC LR PMLR cifar-10 dropout ,	26.12.2021
VGG	Very Deep Convolutional Networks for Large-Scale Image Recognition	Ben Trevett	ILSVRC , , , , cifar-10 , ,	26.12.2021
LeNet	A neural network model that uses convolutional neural network layers and was designed for classifying handwritten characters	Ben Trevett	CNN LeNet-5 guide paper , ,	26.12.2021
FLAML	Lightweight Python library that finds accurate machine learning models automatically, efficiently and economically	Chi Wang Qingyun Wu	, paper ,	17.12.2021
CompilerGym	A reinforcement learning toolkit for compiler optimizations	Chris Cummins Bram Wasti Jiadong Guo others Brandon Cui Jason Ansel Sahir Gomez Olivier Teytaud Benoit Steiner Yuandong Tian Hugh Leather		16.11.2021
DeepStyle	The Neural Style algorithm synthesizes a pastiche by separating and combining the content of one image with the style of another image using convolutional neural networks	Cameron Smith Alexander Spirin	, , cvpr , , , , , ,	01.10.2021
Text2Animation	Generate images from text phrases with VQGAN and CLIP with animation and keyframes	Katherine Crowson Ryan Murdock Chigozie Nri Denis Malimonov	,	29.09.2021
EfficientNetV2	A family of image classification models, which achieve better parameter efficiency and faster training speed than prior arts	Mingxing Tan Quoc Le	,	24.09.2021
Droidlet	A modular embodied agent architecture and platform for building embodied agents	Anurag Pratik Soumith Chintala Kavya Srinet others Dhiraj Gandhi Rebecca Qian Yuxuan Sun Ryan Drew Sara Elkafrawy Anoushka Tiwari Tucker Hart Mary Williamson Abhinav Gupta Arthur Szlam	,	15.09.2021
GPT-J-6B	A 6 billion parameter, autoregressive text generation model trained on The Pile	Ben Wang Aran Komatsuzaki Janko Prester	The Pile blog post , web demo	15.09.2021
Sentence Transformers	Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co	Nils Reimers Iryna Gurevych	, ,	13.09.2021
Machine learning course	This course is broad and shallow, but author will provide additional links so that you can deepen your understanding of the ML method you need	Тимчишин Віталій	blog post , , , , , , , , , ,	02.09.2021
Lucid Sonic Dreams	Syncs GAN-generated visuals to music	Mikael Alafriz	,	24.08.2021
textgenrnn	Generate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexity	Max Woolf	blog post	13.07.2021
BasicSR	Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc.	Xintao Wang Liangbin Xie Ke Yu others Kelvin Chan Chen Change Loy Chao Dong	, , , , , , , , , , ,	07.06.2021
Hyperopt	Python library for serial and parallel optimization over awkward search spaces, which may include real-valued, discrete, and conditional dimensions	James Bergstra Dan Yamins David Cox	ICML , , , , , ,	01.06.2021
CNN	This tutorial demonstrates training a simple Convolutional Neural Network to classify CIFAR images	Billy Lamberta	cifar link	21.05.2021
Custom GPT-2 + Tokenizer	Train a custom GPT-2 model for free on a GPU using aitextgen!	Max Woolf	data	17.05.2021
Train a GPT-2 Text-Generating Model	Retrain an advanced text generating neural network on any text dataset for free on a GPU using Colaboratory using aitextgen!	Max Woolf	data	17.05.2021
EasyNMT	Easy to use, state-of-the-art machine translation for more than 100+ languages	Nils Reimers	, demo ,	26.04.2021
OCTIS	Framework for training, analyzing, and comparing Topic Models, whose optimal hyper-parameters are estimated using a Bayesian Optimization approach	Silvia Terragni Elisabetta Fersini Antonio Candelieri others Pietro Tropeano Bruno Galuzzi Lorenzo Famiglini Davide Pietrasanta	data, data , paper	19.04.2021
PyTorchVideo	Deeplearning library with a focus on video understanding work	Haoqi Fan Tullie Murrell Heng Wang others Kalyan Vasudev Alwala Yanghao Li Yilei Li Bo Xiong Nikhila Ravi Meng Li Haichuan Yang Jitendra Malik Ross Girshick Matt Feiszli Aaron Adcock Wan-Yen Lo Christoph Feichtenhofer	, blog post website	13.04.2021
GPT Neo	An implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library	EleutherAI	GPT-2 , , , pretrained	28.03.2021
CVAE	This notebook demonstrates how train a Variational Autoencoder on the MNIST dataset	Billy Lamberta	,	22.03.2021
DCGAN	This tutorial demonstrates how to generate images of handwritten digits using a Deep Convolutional Generative Adversarial Network	Billy Lamberta	,	12.03.2021
Adversarial FGSM	This tutorial creates an adversarial example using the Fast Gradient Signed Method attack. This was one of the first and most popular attacks to fool a neural network.	Billy Lamberta	imagenet	12.03.2021
GAN steerability	We will navigate in GAN latent space to simulate various camera transformations	Ali Jahanian Lucy Chai Phillip Isola	, project	04.03.2021
bsuite	A collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives	Ian Osband Yotam Doron Matteo Hessel others John Aslanides Eren Sezener Andre Saraiva Katrina McKinney Tor Lattimore Csaba Szepesvari Satinder Singh Benjamin Van Roy Richard Sutton David Silver Hado Van Hasselt	paper	13.02.2021
TF-Ranking	End-to-end walkthrough of training a TensorFlow Ranking neural network model which incorporates sparse textual features	Rama Kumar	, , , data ,	04.02.2021
Toon-Me	A fun project to toon portrait images	Vijish Madhavan	, ,	22.01.2021
TensorNetwork	A library for easy and efficient manipulation of tensor networks	Chase Roberts	,	21.01.2021
Spleeter	Deezer source separation library including pretrained models	Romain Hennequin Anis Khlif Félix Voituret Manuel Moussallam	blog post data project	10.01.2021
Person Remover	Project that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos	Javier Gamazo Daryl Autar	,	22.08.2020
Semantic Segmentation	Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset	Bolei Zhou Hang Zhao Xavier Puig others Sanja Fidler Antonio Torralba	, , , , , project	21.08.2020
CoVoST	A Large-Scale Multilingual Speech-To-Text Translation Corpus	Changhan Wang Juan Pino Jiatao Gu	, , , ,	07.08.2020
Analyzing Tennis Serve	We'll use the Video Intelligence API to analyze a tennis serve, including the angle of the arms and legs during the serve	Dale Markowitz	blog post	14.07.2020
Eager Few Shot Object Detection	Fine tuning of a RetinaNet architecture on very few examples of a novel class after initializing from a pre-trained COCO checkpoint	kmindspark	data	11.07.2020
Optuna	An automatic hyperparameter optimization software framework, particularly designed for machine learning	Takuya Akiba Shotaro Sano Toshihiko Yanase others Takeru Ohta Masanori Koyama	docker website , , ,	08.07.2020
YOLOv4	This tutorial will help you build YOLOv4 easily in the cloud with GPU enabled so that you can run object detections in milliseconds!	Alexey Bochkovskiy	, , project ,	25.06.2020
Context R-CNN Demo	This notebook will walk you step by step through the process of using a pre-trained model to build up a contextual memory bank for a set of images, and then detect objects in those images+context using Context R-CNN	pkulzc	data	17.06.2020
Background Matting	The notebook is split into three parts: required setup, running the algorithm on photos, and running it on videos	Andrey Ryabtsev	blog post data	18.05.2020
GAN Dissection	Visualizing and Understanding Generative Adversarial Networks	David Bau Jun-Yan Zhu Hendrik Strobelt others Bolei Zhou Joshua Tenenbaum William Freeman Antonio Torralba	, , demo , project	04.05.2020
Sonnet	Library built on top of TensorFlow 2 designed to provide simple, composable abstractions for machine learning research	Malcolm Reynolds Jack Rae Andreas Fidjeland others Fabio Viola Adrià Puigdomènech Frederic Besse Tim Green Sébastien Racanière Gabriel Barth-Maron Diego de Las Casas	blog post ,	17.04.2020
Classification of chest vs. adominal X-rays	The goal of this tutorial is to build a deep learning classifier to accurately differentiate between chest and abdominal X-rays	tmoneyx01	annotator	07.03.2020
Lung X-Rays Semantic Segmentation	This lesson applies a U-Net for Semantic Segmentation of the lung fields on chest x-rays	tmoneyx01	annotator data	07.03.2020
Earth Engine Python API and Folium Interactive Mapping	This notebook demonstrates how to setup the Earth Engine and provides several examples for visualizing Earth Engine processed data interactively using the folium library	Qiusheng Wu	api	20.01.2020
Train a GPT-2 Model on Tweets	Train the model on your downloaded tweets, and generate massive amounts of Tweets from it	Max Woolf	GPT-2	16.01.2020
Traffic counting	Making Road Traffic Counting App based on Computer Vision and OpenCV	Andrey Nikishaev		10.01.2020
Imagededup	This package provides functionality to make use of hashing algorithms that are particularly good at finding exact duplicates as well as convolutional neural networks which are also adept at finding near duplicates	Tanuj Jain Christopher Lennan Dat Tran	project	03.10.2019
automl-gs on a TPU	Give an input CSV file and a target field you want to predict to automl-gs, and get a trained high-performing machine learning or deep learning model plus native Python code pipelines allowing you to integrate that model into any prediction workflow	Max Woolf		26.03.2019
RSNA Pneumonia Detection Challenge (Kaggel API)	The basics of parsing the competition dataset, training using a detector basd on the Mask-RCNN algorithm for object detection and instance segmentation	tmoneyx01	annotator	03.09.2018
RSNA Pneumonia Detection Challenge (MD.ai API)	The basics of parsing the competition dataset, training using a detector basd on the Mask-RCNN algorithm for object detection and instance segmentation	tmoneyx01	annotator	29.08.2018
HoF	This notebook will walk you step by step through the process of using a pre-trained model to detect faces in an image	Lucas Persona	data yolo	15.03.2018

Best of the best

authors	repositories
Billy Lamberta Daniel Cohen-Or Ziwei Liu Jesse Engel Max Woolf Adam Roberts Eli Shechtman Björn Ommer Yuval Alaluf Google Chen Change Loy Patrick Esser Robin Rombach Curtis Hawthorne Or Patashnik Bolei Zhou Krzysztof Ostrowski	tensorflow/models CompVis/stable-diffusion CorentinJ/Real-Time-Voice-Cloning iperov/DeepFaceLab ultralytics/yolov5 jakevdp/PythonDataScienceHandbook openai/whisper facebookresearch/segment-anything LAION-AI/Open-Assistant microsoft/visual-chatgpt google-research/google-research TencentARC/GFPGAN pytorch/fairseq ray-project/ray facebookresearch/detectron2 Stability-AI/stablediffusion google/jax

authors

repositories

tensorflow/models
CompVis/stable-diffusion
CorentinJ/Real-Time-Voice-Cloning
iperov/DeepFaceLab
ultralytics/yolov5
jakevdp/PythonDataScienceHandbook
openai/whisper
facebookresearch/segment-anything
LAION-AI/Open-Assistant
microsoft/visual-chatgpt
google-research/google-research
TencentARC/GFPGAN
pytorch/fairseq
ray-project/ray
facebookresearch/detectron2
Stability-AI/stablediffusion
google/jax

(generated by generate_markdown.py based on research.json and tutorials.json)

amrzv/awesome-colab-notebooks

amrzv

Reviews

Repository Details

Awesome colab notebooks collection for ML experiments

Research

Tutorials

Best of the best

More Repositories