• Stars
    star
    510
  • Rank 83,970 (Top 2 %)
  • Language
    Python
  • Created about 2 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Live Training for Open-source Big Models

CPM-Live

Live Training for Open-source Big Models

WebsitePlanDiscussion简体中文

What's New

  • 2023/05/27 CPM-Bee is released!
  • 2023/04/12 CPM-Ant has been integrated into HuggingFace Transformers!
  • 2022/10/12 CPM-Ant+, a bilingual model, is released! In addition to generating Chinese/English text, you can now use our model for QA, summarization and translation tasks!
  • 2022/09/16 CPM-Ant is released!
  • 2022/05/29 The training of CPM-Live has launched today! See training dynamics.
  • 2022/05/25 The training plan for CPM-Live is now published. Look forward to the training!

Milestones

Training Plan

Considering the scale of data and computing resources, CPM-Live will start with a 10B model training.

During training we will do:

  • Real-time: Display model training metrics
  • Every day: Release the model training log
  • Every week: Deal with discussions and feedback from the community
  • Irregularly: Release checkpoints during model training which everyone can download

During training you can:

  • Raise your model proposal: Have better ideas on model architecture, training methods, or data sources? You can put forward your model proposal in the community. If the proposal receives more support and is practically feasible, we will add it to the model we are training, so that CPM-Live can learn continuously and progress with the help of everyone.

  • Develop your application: You can submit your initial ideas, prototypes, development code, or finished apps, which are based on CPM-Live, to the community. We will exhibit the most popular apps on the website.

  • Chat on the forum: You can talk about anything related to big models in our forums, such as academic research, engineering implementation, tool use, application design, etc. No matter whether you are experienced or not, we believe everyone can benefit from positive and open discussions.

  • Download the resource: Once the model training is complete, you are free to download the model parameters under an open use license. CPM-Live uses an open license that includes permission for commercialization. With model compression and inference acceleration tools, you can experience the power of big models on your own PC!

Community

Our community is based on GitHub Discussions.

Read the first post and start your exploration on CPM-Live!

More Repositories

1

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Shell
23,711
star
2

XAgent

An Autonomous LLM Agent for Complex Task Solving
Python
7,663
star
3

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Python
4,493
star
4

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
Jupyter Notebook
4,055
star
5

AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
JavaScript
3,695
star
6

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Python
2,854
star
7

CPM-Bee

百亿参数的中英文双语基座大模型
Python
2,673
star
8

MiniCPM-V

MiniCPM-V 2.0: An Efficient End-side MLLM with Strong OCR and Understanding Capabilities
Python
1,406
star
9

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
Python
984
star
10

ProAgent

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
Python
674
star
11

BMInf

Efficient Inference for Big Models
Python
565
star
12

BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models
Python
519
star
13

BMList

A List of Big Models
Python
339
star
14

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).
Python
247
star
15

BMPrinciples

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
222
star
16

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
Python
218
star
17

UltraEval

An open source framework for evaluating foundation models.
Python
166
star
18

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.
Python
148
star
19

InfiniteBench

100k+ Long-Context Benchmark for Large Language Models (paper upcoming)
Python
105
star
20

OlympiadBench

An Olympiad-level bilingual multimodal scientific benchmark, featuring 8,952 questions from Olympiad-level mathematics and physics competitions, including the Chinese college entrance exam.
Python
62
star
21

DecT

Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding
Python
42
star
22

XAgent-doc

Document for XAgent.
19
star
23

BMInf-demos

BMInf demos.
JavaScript
14
star
24

UltraLink

An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Python
11
star
25

General-Model-License

6
star