cybertronai/autograd-hacks

Stars
153
Rank 243,368 (Top 5 %)
Language
Python
License
The Unlicense
Created about 5 years ago
Updated over 2 years ago

cybertronai/autograd-hacks

cybertronai

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

autograd-hacks

Extract useful quantities from PyTorch autograd

Per-example gradients

autograd_hacks.add_hooks(model)
output = model(data)
loss_fn(output, targets).backward()
autograd_hacks.compute_grad1()

# param.grad: gradient averaged over the batch
# param.grad1[i]: gradient with respect to example i

for param in model.parameters():
  assert(torch.allclose(param.grad1.mean(dim=0), param.grad))

Hessians

(assuming ReLU activations, oherwise produces Gauss-Newton matrix)

autograd_hacks.backprop_hess(model(data), hess_type='CrossEntropy')
autograd_hacks.compute_hess(model)
print(param.hess)  # print Hessian of param

gradient-checkpointing

Make huge neural nets fit in memory

imagenet18_old

Code to reproduce "imagenet in 18 minutes" DAWN-benchmark entry

pytorch-lamb

Implementation of https://arxiv.org/abs/1904.00962

pytorch-sso

PyTorch-SSO: Scalable Second-Order methods in PyTorch

imagenet18

Train ImageNet in 18 minutes on AWS

ncluster

Lightweight interface to AWS

autograd-lib

bflm

pytorch-fd

Implementation of fluctuation dissipation relations for automatic learning rate annealing.

aws-network-benchmarks

Tools to benchmark AWS network performance

Jupyter Notebook

pytorch-aws

Example code for "PyTorch on AWS made easy"