@ModelTC
  • Stars
    star
    4,126
  • Global Org. Rank 5,424 (Top 2 %)
  • Registered about 4 years ago
  • Most used languages
    Python
    81.5 %
    Rust
    11.1 %
    Shell
    3.7 %

Top repositories

1

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Python
2,304
star
2

MQBench

Model Quantization Benchmark
Shell
742
star
3

United-Perception

United Perception
Python
427
star
4

llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Python
184
star
5

Dipoorlet

Offline Quantization Tools for Deploy.
Python
109
star
6

awesome-lm-system

Summary of system papers/frameworks/codes/tools on training or serving large model
56
star
7

TFMQ-DM

[CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
Jupyter Notebook
50
star
8

mqbench-paper

Python
44
star
9

rank_dataset

PyTorch Dataset Rank Dataset
Python
37
star
10

NART

NART = NART is not A RunTime, a deep learning inference framework.
Python
37
star
11

EasyLLM

Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing usability, it also ensures training efficiency.
Python
35
star
12

Outlier_Suppression_Plus

Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Python
35
star
13

NNLQP

Python
33
star
14

QLLM

[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"
Python
33
star
15

LPCV2021_Winner_Solution

Python
29
star
16

pyvlova

Yet another Polyhedra Compiler for DeepLearning
Python
19
star
17

LPCV_2023_solution

Python
18
star
18

AAAI2023_EAMPD

AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline
13
star
19

Prototype

Python
12
star
20

L2_Compression

Python
11
star
21

OmniBal

Python
9
star
22

msbench

A tool for model sparse based on torch.fx
Python
7
star
23

Imagenet-S

Robustness for real-world system noise
Python
4
star
24

mtc-token-healing

Token healing implementation in Rust
Rust
3
star
25

FCPTS

Python
2
star
26

general-sam

A general suffix automaton implementation in Rust with Python bindings
Rust
2
star
27

statecs

Rust
1
star
28

general-sam-py

Python bindings for general-sam and some utilities
Python
1
star
29

pyrotom

Python Code Hotfix and Refactor on the fly
Python
1
star