• Stars
    star
    512
  • Rank 86,323 (Top 2 %)
  • Language
    Python
  • Created over 2 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Live Training for Open-source Big Models

CPM-Live

Live Training for Open-source Big Models

WebsitePlanDiscussion简体中文

What's New

  • 2023/05/27 CPM-Bee is released!
  • 2023/04/12 CPM-Ant has been integrated into HuggingFace Transformers!
  • 2022/10/12 CPM-Ant+, a bilingual model, is released! In addition to generating Chinese/English text, you can now use our model for QA, summarization and translation tasks!
  • 2022/09/16 CPM-Ant is released!
  • 2022/05/29 The training of CPM-Live has launched today! See training dynamics.
  • 2022/05/25 The training plan for CPM-Live is now published. Look forward to the training!

Milestones

Training Plan

Considering the scale of data and computing resources, CPM-Live will start with a 10B model training.

During training we will do:

  • Real-time: Display model training metrics
  • Every day: Release the model training log
  • Every week: Deal with discussions and feedback from the community
  • Irregularly: Release checkpoints during model training which everyone can download

During training you can:

  • Raise your model proposal: Have better ideas on model architecture, training methods, or data sources? You can put forward your model proposal in the community. If the proposal receives more support and is practically feasible, we will add it to the model we are training, so that CPM-Live can learn continuously and progress with the help of everyone.

  • Develop your application: You can submit your initial ideas, prototypes, development code, or finished apps, which are based on CPM-Live, to the community. We will exhibit the most popular apps on the website.

  • Chat on the forum: You can talk about anything related to big models in our forums, such as academic research, engineering implementation, tool use, application design, etc. No matter whether you are experienced or not, we believe everyone can benefit from positive and open discussions.

  • Download the resource: Once the model training is complete, you are free to download the model parameters under an open use license. CPM-Live uses an open license that includes permission for commercialization. With model compression and inference acceleration tools, you can experience the power of big models on your own PC!

Community

Our community is based on GitHub Discussions.

Read the first post and start your exploration on CPM-Live!

More Repositories

1

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Shell
24,842
star
2

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Python
12,088
star
3

XAgent

An Autonomous LLM Agent for Complex Task Solving
Python
8,102
star
4

MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Jupyter Notebook
7,009
star
5

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Python
4,789
star
6

AgentVerse

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation
JavaScript
4,095
star
7

BMTools

Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Python
2,884
star
8

CPM-Bee

百亿参数的中英文双语基座大模型
Python
2,686
star
9

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
Python
1,075
star
10

ProAgent

An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
Python
754
star
11

BMInf

Efficient Inference for Big Models
Python
573
star
12

IoA

An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through internet-like connectivity.
Python
556
star
13

BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models
Python
554
star
14

BMList

A List of Big Models
Python
339
star
15

RepoAgent

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.
Python
336
star
16

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).
Python
302
star
17

ModelCenter

Efficient, Low-Resource, Distributed transformer implementation based on BMTrain
Python
234
star
18

BMPrinciples

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
222
star
19

UltraEval

[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.
Python
215
star
20

InfiniteBench

100k+ Long-Context Benchmark for Large Language Models (paper upcoming)
Python
105
star
21

OlympiadBench

[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems.
Python
89
star
22

MobileCPM

A Toolkit for Running On-device Large Language Models (LLMs) in APP
C++
53
star
23

RAGEval

Python
47
star
24

DecT

Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding
Python
42
star
25

XAgent-doc

Document for XAgent.
19
star
26

UltraLink

An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Python
17
star
27

BMInf-demos

BMInf demos.
JavaScript
13
star
28

General-Model-License

6
star
29

VisRAG

Python
1
star