There are no reviews yet. Be the first to send feedback to the community and the maintainers!
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)OpenELM
Evolution Through Large Modelscheese
Used for adaptive human in the loop evaluation of language and embedding models.DRLX
Diffusion Reinforcement Learning LibraryCode-Pile
This repository contains all the code for collecting large scale amounts of code from GitHub.autocrit
A repository for transformer critique learning and generationInstructGPT
For experiments involving instruct gpt. Currently used for documenting open research questions.squeakily
A library for squeakily cleaning and filtering language datasets.Algorithm-Distillation-RLHF
decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.treasure_trove
CodeReviewSE
Stuff related to scraping the Code Review StackExchangeArchitextRL
Polygraph
RLHF Mechanistic Interpretability and Deceptionmagicarp-v2
magiCARP is an API used for crossencoder training.AutoPaperclipMaximizer
👀goosebox
sandboxed eval server for running code snippetsLove Open Source and this site? Check out how you can help us