Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Zig

R

Shell

Swift

Assembly

Go

Kotlin

Scala

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

PHP

Erlang

Ruby

Rust

Jupyter Notebook

Crystal

Racket

Elixir

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇨🇿 Czechia

🇬🇪 Georgia

🇬🇮 Gibraltar

🇷🇺 Russia

🇧🇲 Bermuda

🇺🇿 Uzbekistan

🇸🇾 Syria

🇺🇸 United States

All Countries Compare Countries

zhihu/TLLM_QMM

Stars
10
Rank 1,807,489 (Top 36 %)
Language
C++
License
Apache License 2.0
Created 4 months ago
Updated 4 months ago

zhihu/TLLM_QMM

zhihu

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pytorch module. We modified the dequantation and weight preprocessing to align with popular quantization alogirthms such as AWQ and GPTQ, and combine them with new FP8 quantization.

Matisse

🎆 A well-designed local image and video selector for Android

griffith

A React-based web video player

kids

Kids Is Data Stream

rucene

Rust port of Lucene

cuBERT

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

RxLifecycle

Bind observables to the lifecycle of Activity or Fragment in a non-invasive way.

redis-shard

Redis sharding client library

zhihu-rxjava-meetup

知乎 x RxJava Meetup

mirror

Yet another Sketch Mirror App for Android.

SugarAdapter

Make RecyclerView.Adapter Great Again!

zetta

Zetta Table Store

norm

An orm library support nGQL for Golang

tache

A tag based invalidation caching library

promate

Graphite On VictoriaMetrics

SERank

An efficient and effective learning to rank algorithm by mining information across ranking candidates. This repository contains the tensorflow implementation of SERank model. The code is developed based on TF-Ranking.

cmdb

Programmable CMDB

chaika

Elastic cache solution on Kubernetes

zetta-proto

Protobuf files for Zetta Table Store

presto-connectors

Presto Connectors project has been moved to TiBigData at PingCAP Incubator

zetta-client-go

Go client for Zetta Table Store