Ryan-yang125/ChatLLM-Web

Stars
618
Rank 72,605 (Top 2 %)
Language
JavaScript
License
MIT License
Created over 1 year ago
Updated 4 months ago

Ryan-yang125/ChatLLM-Web

Ryan-yang125

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.

ChatLLM Web

English / 简体中文 / 日本語

🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered By web-llm.

Features

🤖 Everything runs inside the browser with no server support and is accelerated with WebGPU.
⚙️ Model runs in a web worker, ensuring that it doesn't block the user interface and providing a seamless experience.
🚀 Easy to deploy for free with one-click on Vercel in under 1 minute, then you get your own ChatLLM Web.
💾 Model caching is supported, so you only need to download the model once.
💬 Multi-conversation chat, with all data stored locally in the browser for privacy.
📝 Markdown and streaming response support: math, code highlighting, etc.
🎨 responsive and well-designed UI, including dark mode.
💻 PWA supported, download it and run totally offline.

Instructions

🌐 To use this app, you need a browser that supports WebGPU, such as Chrome 113 or Chrome Canary. Chrome versions ≤ 112 are not supported.
💻 You will need a GPU with about 6.4GB of memory. If your GPU has less memory, the app will still run, but the response time will be slower.
📥 The first time you use the app, you will need to download the model. For the Vicuna-7b model that we are currently using, the download size is about 4GB. After the initial download, the model will be loaded from the browser cache for faster usage.
ℹ️ For more details, please visit mlc.ai/web-llm

Roadmap

[✅] LLM: using web worker to create an LLM instance and generate answers.
[✅] Conversations: Multi-conversation support is available.
[✅] PWA
[] Settings:
- ui: dark/light theme
- device:
  - gpu device choose
  - cache usage and manage
- model:
  - support multi models: vicuna-7b✅ RedPajama-INCITE-Chat-3B []
  - params config: temperature, max-length, etc.
  - export & import model

Deploy to Vercel

Click , follow the instructions, and finish in just 1 minute.
Enjoy it 😊

Development

git clone https://github.com/Ryan-yang125/ChatLLM-Web.git
cd ChatLLM-Web
npm i
npm run dev

Screenshots

LICENSE

Computer-Graphics

Zhejiang University 2019-2020 Spring and Summer Computer Graphics Course 6 Experiments + Programs + OpenGL Learning Material

Computer-Vision

浙江大学2020-2021秋冬计算机视觉课程6次实验+复习资料+OpenCV学习资料

nestjs-prisma-postgresql-starter-kit

An easy-to-use starter kit for NestJS, integrating Prisma, PostgreSQL, and Swagger, perfect for jump-starting your backend development.

Ryan-yang125.github.io

CKEditor5-CodeMirrorNext-Plugin

a codemirror6 plugin for CKEditor5

ChatDocs-Next

chat with your own data locally, safely, privately

eol-cloth

cpp implement of Paper Eulerian-on-Lagrangian Cloth Simulation

LeetcodeTags

Leetcode分类刷题

bio.ruiy.sh

personal portfolio website built with astro

srtp

HCI-Emotion

vis-project

vis-server

jsNotes

关于js的一些知识点

D3

A Data Visualization component by D3

prompt-gpt

This is an application built using langchainjs and nextjs. It automatically uses the best prompt based on your needs during chat.

mini-react

a react toy in typescript

WebTools

CA-Particle-System

计算机动画粒子系统

ShakeTea

mern-todo-app

keepapp

face-capture

krisryang-sites

TImage

a chrome extension used for translating text in image in web page.

CA

DAM-website

A DAM website about class sculpture