Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

HTML

Scala

Lua

Java

Swift

Julia

Dart

Kotlin

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Kotlin

Groovy

Python

Ada

Dart

Jupyter Notebook

Swift

Clojure

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇵🇭 Philippines

🇵🇪 Peru

🇨🇿 Czechia

🇬🇹 Guatemala

🇱🇻 Latvia

🇲🇹 Malta

🇧🇪 Belgium

🇧🇸 The Bahamas

All Countries Compare Countries

abacaj/replit-3B-inference

Stars
155
Rank 240,864 (Top 5 %)
Language
Python
License
MIT License
Created over 1 year ago
Updated over 1 year ago

abacaj

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Run inference on replit-3B code instruct model using CPU

Replit Code Instruct inference using CPU

Run inference on the replit code instruct model using your CPU. This inference code uses a ggml quantized model. To run the model we'll use a library called ctransformers that has bindings to ggml in python.

Demo:

2023-06-27.14-46-07.mp4

Requirements

Using docker should make all of this easier for you. Minimum specs, system with 8GB of ram. Recommend to use python 3.10.

Tested working on

Will post some numbers for these two later.

AMD Epyc 7003 series CPU
AMD Ryzen 5950x CPU

Setup

First create a venv.

python -m venv env && source env/bin/activate

Next install dependencies.

pip install -r requirements.txt

Next download the quantized model weights (about 1.5GB).

python download_model.py

Ready to rock, run inference.

python inference.py

Next modify inference script prompt and generation parameters.

fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Python

698

awesome-transformers

A curated list of awesome transformer models.

601

mpt-30B-inference

Run inference on MPT-30B using CPU

Python

576

code-eval

Run evaluation on LLMs using human-eval benchmark

Python

373

chatgpt-backup

Single client side script to backup your entire ChatGPT conversation history

HTML

254

openhermes-function-calling

Jupyter Notebook

132

transformers

Understanding large language models

116

train-with-fsdp

Python

unofficial-chatgpt-api

Node.js client for the chatgpt API. No third party dependencies.

TypeScript

react-simple-filter-sort-ecommerce

Repo used in this video: https://youtu.be/c3WSziz_u_o

TypeScript

transformers-docker

Run, build, test transformer models using docker

Dockerfile

resolutejs

Finally get to retry during a Promise operation (works in modern browsers as well as nodejs), zero dependencies.

JavaScript

node-selenium-starter

A starter project for automated browser testing using Node.js.

JavaScript

alpaca-trainer

Python

electron-easy-ts

Build electron apps with typescript the easy way.

TypeScript

abacaj/replit-3B-inference

abacaj

Reviews

Repository Details

Replit Code Instruct inference using CPU

Requirements

Tested working on

Setup

More Repositories