Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

HTML

Swift

Haskell

Assembly

Lua

Groovy

PowerShell

Zig

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Nix

Java

F#

C

Go

Clojure

PHP

MATLAB

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇵🇭 Philippines

🇵🇪 Peru

🇨🇿 Czechia

🇬🇹 Guatemala

🇱🇻 Latvia

🇲🇹 Malta

🇧🇪 Belgium

🇧🇸 The Bahamas

All Countries Compare Countries

abacaj/mpt-30B-inference

Stars
576
Rank 77,502 (Top 2 %)
Language
Python
License
MIT License
Created over 1 year ago
Updated over 1 year ago

abacaj

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Run inference on MPT-30B using CPU

MPT 30B inference code using CPU

Run inference on the latest MPT-30B model using your CPU. This inference code uses a ggml quantized model. To run the model we'll use a library called ctransformers that has bindings to ggml in python.

Turn style with history on latest commit:

Video of initial demo:

2023-06-25.20-13-24.mp4

Requirements

I recommend you use docker for this model, it will make everything easier for you. Minimum specs system with 32GB of ram. Recommend to use python 3.10.

Tested working on

Will post some numbers for these two later.

AMD Epyc 7003 series CPU
AMD Ryzen 5950x CPU

Setup

First create a venv.

python -m venv env && source env/bin/activate

Next install dependencies.

pip install -r requirements.txt

Next download the quantized model weights (about 19GB).

python download_model.py

Ready to rock, run inference.

python inference.py

Next modify inference script prompt and generation parameters.

fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Python

698

awesome-transformers

A curated list of awesome transformer models.

601

code-eval

Run evaluation on LLMs using human-eval benchmark

Python

373

chatgpt-backup

Single client side script to backup your entire ChatGPT conversation history

HTML

254

replit-3B-inference

Run inference on replit-3B code instruct model using CPU

Python

155

openhermes-function-calling

Jupyter Notebook

132

transformers

Understanding large language models

116

train-with-fsdp

Python

unofficial-chatgpt-api

Node.js client for the chatgpt API. No third party dependencies.

TypeScript

react-simple-filter-sort-ecommerce

Repo used in this video: https://youtu.be/c3WSziz_u_o

TypeScript

transformers-docker

Run, build, test transformer models using docker

Dockerfile

resolutejs

Finally get to retry during a Promise operation (works in modern browsers as well as nodejs), zero dependencies.

JavaScript

node-selenium-starter

A starter project for automated browser testing using Node.js.

JavaScript

alpaca-trainer

Python

electron-easy-ts

Build electron apps with typescript the easy way.

TypeScript

abacaj/mpt-30B-inference

abacaj

Reviews

Repository Details

MPT 30B inference code using CPU

Requirements

Tested working on

Setup

More Repositories