Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

CSS

MATLAB

R

Go

TypeScript

Groovy

Perl

Shell

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Elm

C#

Objective-C

Elixir

Ruby

R

PowerShell

Python

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇳🇵 Nepal

🇦🇬 Antigua and Barbuda

🇨🇿 Czechia

🇳🇿 New Zealand

🇯🇵 Japan

🇲🇶 Martinique

🇬🇺 Guam

🇬🇹 Guatemala

All Countries Compare Countries

Lyken17/pytorch-memonger

Stars
587
Rank 76,145 (Top 2 %)
Language
Python
License
MIT License
Created over 5 years ago
Updated almost 5 years ago

Lyken17/pytorch-memonger

Lyken17

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Sublinear memory optimization for deep learning. https://arxiv.org/abs/1604.06174

pytorch-memonger

This is a re-implementation of Training Deep Nets with Sublinear Memory Cost. You may also want to have a look at the original mxnet implementation and OpenAI's tensorflow implementation.

Speed / Memory Comparision

Model (Batch size 16)	Memory	Speed
original resnet152	5459MiB	2.9258 iter/s
Checkpoint (Sublinear)	2455MiB	2.6273 iter/s

How to use

Different from TensorFlow and mxnet where the computation graph is static and known before actual computing, pytorch's philosophy is define-by-run and the graph details are not known until forward is finished. This implemention only supports Sequential models. By replacing nn.Sequential with memonger.SublinearSequential, the memory required for backward is reduced from O(N) to O(sqrt(N)).

# previous, O(N) memory footprint
import torch.nn as nn
net1 = nn.Sequential(
    nn.Conv2d(3, 16, kernel=3, padding=1),
    nn.BatchNorm2d(16),
    nn.ReLU(),
    nn.Conv2d(16, 16, kernel=3, padding=1),
    nn.BatchNorm2d(16),
    nn.ReLU(),
    nn.Conv2d(16, 16, kernel=3, padding=1),
    nn.BatchNorm2d(16),
    nn.ReLU(),
    ...
)

# optimized, O(sqrt(N)) memory footprint
from memonger import SublinearSequential
net2 = SublinearSequential(
    *list(net1.children())  
)

Caution

Since sublinear memory optimization requires re-forwarding, if your model contains layer with non-derministic behavior (e.g, BatchNorm, Dropout), you need to be careful when using the module. I have supported BatchNorm by re-scaling momentum , dropout by memorizing the random number generator (RNG).

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Efficient-PyTorch

My best practice of training large dataset using PyTorch.

SparseNet

[ECCV 2018] Sparsely Aggreagated Convolutional Networks https://arxiv.org/abs/1801.05895

arXiv-stats

hf-torrent

mxbox

Simple, efficient and flexible vision toolbox for mxnet framework.

Bayesian-Compression-for-Deep-Learning

Remplementation of paper https://arxiv.org/abs/1705.08665

PyTorch-Template

A template for PyTorch projects.

Colorize-Your-World

Let there be color!

Jupyter Notebook

Machine-Learning-for-Image-Colorization

(Torch + Tensorflow) A deep magic brings color to your monochrome image!

GroupNorm.pytorch

PyTorch implementation of Group Normalization https://arxiv.org/abs/1803.08494

Colorizing-Color-Images

[HVEI 2018] Colorizing Color Images

Jupyter Notebook

Project-Page-Render

Echoo

Let your program echo to you.

arch-viz

hf-torrent-store

Deep-Learning-Live

From linear regression to multi-layer perceptron, an introductive tutorial for deep learning beginners.

MNasNet-TensorFlow

Implementation of MnasNet: Platform-Aware Neural Architecture Search for Mobile

tvm-notes

PyTorch-via-PyTorch

FlashATM

HW-for-COMP

edge-cloud-train

ffmpeg-cuda-docker

A docker container to launch GPU accelerated FFmpeg

pi-tools

A repo includes some useful tools for raspberry pi farm setup. https://hub.docker.com/repository/docker/lyken/pi-tools

EIE-pytorch

PyTorch implementation for EIE https://arxiv.org/abs/1602.01528

Jupyter Notebook

Colorize.PyTorch

torch-mps-benchmark

sample-video

micro23

GPU-Speed-Benchmark

tvm-issue-07-12

Docker-Horovod

Deep-Learning-Framework-Popularity

pythonLearn

BeihangData

tiny-whisper

Neurips19-Statistics

gluon-multiple-gpu

tvm-hack

lith

Ligeng's extensions for PyTorch

ubuntun-research

Common scripts I've used for setting up my ubuntu server

wandb-example