Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Go

Java

C

Emacs Lisp

Objective-C

Zig

HTML

Erlang

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Perl

Crystal

Ruby

Zig

C++

Jupyter Notebook

Groovy

Scala

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇱🇻 Latvia

🇱🇰 Sri Lanka

🇬🇭 Ghana

🇺🇦 Ukraine

🇮🇳 India

🇹🇩 Chad

🇬🇪 Georgia

🇲🇾 Malaysia

All Countries Compare Countries

ModelTC/MQBench

Stars
742
Rank 61,120 (Top 2 %)
Language
Shell
License
Apache License 2.0
Created over 3 years ago
Updated 6 months ago

ModelTC/MQBench

ModelTC

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Model Quantization Benchmark

Introduction

MQBench is an open-source model quantization toolkit based on PyTorch fx.

The envision of MQBench is to provide:

SOTA Algorithms. With MQBench, the hardware vendors and researchers can benefit from the latest research progress in academic.
Powerful Toolkits. With the toolkit, quantization node can be inserted to the original PyTorch module automatically with respect to the specific hardware. After training, the quantized model can be smoothly converted to the format that can inference on the real device.

Installation

git clone [email protected]:ModelTC/MQBench.git
cd MQBench
python setup.py install

Documentation

MQBench aims to support (1) various deployable quantization algorithms and (2) hardware backend libraries to facilitate the development of the community.

For the detailed information, please refer to MQBench documentation.

Citation

If you use this toolkit or benchmark in your research, please cite this project.

@article{MQBench,
  title   = {MQBench: Towards Reproducible and Deployable Model Quantization Benchmark},
  author  = {Yuhang Li* and Mingzhu Shen* and Jian Ma* and Yan Ren* and Mingxin Zhao* and
             Qi Zhang* and Ruihao Gong* and Fengwei Yu and Junjie Yan},
  journal= {Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks},
  year={2021}
}

License

This project is released under the Apache 2.0 license.

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

United-Perception

United Perception

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Dipoorlet

Offline Quantization Tools for Deploy.

awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model

TFMQ-DM

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".

Jupyter Notebook

mqbench-paper

rank_dataset

PyTorch Dataset Rank Dataset

NART

NART = NART is not A RunTime, a deep learning inference framework.

EasyLLM

Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.

Outlier_Suppression_Plus

Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling

NNLQP

QLLM

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"

LPCV2021_Winner_Solution

pyvlova

Yet another Polyhedra Compiler for DeepLearning

LPCV_2023_solution

AAAI2023_EAMPD

AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline

Prototype

L2_Compression

OmniBal

msbench

A tool for model sparse based on torch.fx

Imagenet-S

Robustness for real-world system noise

mtc-token-healing

Token healing implementation in Rust

FCPTS

general-sam

A general suffix automaton implementation in Rust with Python bindings

statecs

general-sam-py

Python bindings for general-sam and some utilities

pyrotom

Python Code Hotfix and Refactor on the fly