mit-han-lab/dlg

Stars
400
Rank 107,843 (Top 3 %)
Language
Python
License
MIT License
Created almost 5 years ago
Updated over 2 years ago

mit-han-lab/dlg

mit-han-lab

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

[NeurIPS 2019] Deep Leakage From Gradients

Deep Leakage From Gradients [arXiv] [Webside]

@inproceedings{zhu19deep,
  title={Deep Leakage from Gradients},
  author={Zhu, Ligeng and Liu, Zhijian and Han, Song},
  booktitle={Advances in Neural Information Processing Systems},
  year={2019}
}

Gradients exchaging is popular used in modern multi-node learning systems. People used to believe numerical gradients are safe to share. But we show that it is actually possible to obtain the training data from shared gradients and the leakage is pixel-wise accurate for images and token-wise matching for texts.

Overview

The core algorithm is to match the gradients between dummy data and real data.

It can be implemented in less than 20 lines with PyTorch!

def deep_leakage_from_gradients(model, origin_grad): 
  dummy_data = torch.randn(origin_data.size())
  dummy_label =  torch.randn(dummy_label.size())
  optimizer = torch.optim.LBFGS([dummy_data, dummy_label] )

  for iters in range(300):
    def closure():
      optimizer.zero_grad()
      dummy_pred = model(dummy_data) 
      dummy_loss = criterion(dummy_pred, F.softmax(dummy_label, dim=-1)) 
      dummy_grad = grad(dummy_loss, model.parameters(), create_graph=True)

      grad_diff = sum(((dummy_grad - origin_grad) ** 2).sum() \
        for dummy_g, origin_g in zip(dummy_grad, origin_grad))
      
      grad_diff.backward()
      return grad_diff
    
    optimizer.step(closure)
    
  return  dummy_data, dummy_label

Prerequisites

To run the code, following libraies are required

Python >= 3.6
PyTorch >= 1.0
torchvision >= 0.4

Code

Note: We provide for quick reproduction.

# Single image on CIFAR
python main.py --index 25

# Deep Leakage on your own Image
python main.py --image yours.jpg

Deep Leakage on Batched Images

Deep Leakage on Language Model

License

This repository is released under the MIT license. See LICENSE for additional details.

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

temporal-shift-module

[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

once-for-all

[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

llm-awq

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

proxylessnas

[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

torchquantum

A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

Jupyter Notebook

data-efficient-gans

[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

torchsparse

[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

gan-compression

[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs

anycost-gan

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

tinyml

TinyChatEngine

TinyChatEngine: On-Device LLM Inference Library

tinyengine

[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory

fastcomposer

[IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

pvcnn

[NeurIPS 2019, Spotlight] Point-Voxel CNN for Efficient 3D Deep Learning

lite-transformer

[ICLR 2020] Lite Transformer with Long-Short Range Attention

spvnas

[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

mcunet

[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

tiny-training

On-Device Training Under 256KB Memory [NeurIPS'22]

amc

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

haq

[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision

offsite-tuning

Offsite-Tuning: Transfer Learning without Full Model

hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

litepose

[CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

inter-operator-scheduler

[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration

amc-models

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

apq

[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

parallel-computing-tutorial

flatformer

[CVPR'23] FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

patch_conv

Patch convolution to avoid large GPU memory usage of Conv2D

6s965-fall2022

Jupyter Notebook

sparsevit

[CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

bnn-icestick

Binary Neural Network on IceStick FPGA.

Jupyter Notebook

e3d

Efficient 3D Deep Learning

neurips-micronet

[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion

Jupyter Notebook

spatten-llm

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

tinychat-tutorial

pruning-sparsity-publications

iccad-tinyml-open

[ICCAD'22 TinyML Contest] Efficient Heart Stroke Detection on Low-cost Microcontrollers

calo-cluster

Jupyter Notebook

ml-blood-pressure

gan-compression-dynamic

data-efficient-gans-dynamic