Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

C++

Swift

Java

MATLAB

HTML

Emacs Lisp

Nix

Python

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Dart

Kotlin

Ruby

Nix

Julia

TypeScript

Ada

Groovy

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇾🇪 Yemen

🇳🇵 Nepal

🇸🇨 Seychelles

🇺🇸 United States

🇸🇮 Slovenia

🇱🇹 Lithuania

🇹🇩 Chad

🇭🇳 Honduras

All Countries Compare Countries

cybertronai/pytorch-lamb

Stars
367
Rank 116,257 (Top 3 %)
Language
Python
License
MIT License
Created over 5 years ago
Updated almost 4 years ago

cybertronai/pytorch-lamb

cybertronai

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Implementation of https://arxiv.org/abs/1904.00962

Implementation of https://arxiv.org/abs/1904.00962 for large batch, large learning rate training.

The paper doesn't specify clamp values for ϕ, so I use 10.

Bonus: TensorboardX logging (example below).

Try the sample

git clone [email protected]:cybertronai/pytorch-lamb.git
cd pytorch-lamb
pip install -e .
python test_lamb.py
tensorboard --logdir=runs

Sample results

At --lr=.02, the Adam optimizer is unable to train.

Red: python test_lamb.py --batch-size=512 --lr=.02 --wd=.01 --log-interval=30 --optimizer=adam

Blue: python test_lamb.py --batch-size=512 --lr=.02 --wd=.01 --log-interval=30 --optimizer=lamb

gradient-checkpointing

Make huge neural nets fit in memory

imagenet18_old

Code to reproduce "imagenet in 18 minutes" DAWN-benchmark entry

autograd-hacks

pytorch-sso

PyTorch-SSO: Scalable Second-Order methods in PyTorch

imagenet18

Train ImageNet in 18 minutes on AWS

ncluster

Lightweight interface to AWS

autograd-lib

bflm

pytorch-fd

Implementation of fluctuation dissipation relations for automatic learning rate annealing.

aws-network-benchmarks

Tools to benchmark AWS network performance

Jupyter Notebook

pytorch-aws

Example code for "PyTorch on AWS made easy"