openai-cookbook
Examples and guides for using the OpenAI APIwhisper
Robust Speech Recognition via Large-Scale Weak Supervisiongym
A toolkit for developing and comparing reinforcement learning algorithms.gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imagegpt-3
GPT-3: Language Models are Few-Shot Learnersopenai-python
The official Python library for the OpenAI APIbaselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithmsevals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.DALL-E
PyTorch package for the discrete VAE used for DALL·E.shap-e
Generate 3D objects conditioned on text or imagestriton
Development repository for the Triton language and compilerspinningup
An educational resource to help anyone learn deep reinforcement learning.tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.universe
Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.jukebox
Code for the paper "Jukebox: A Generative Model for Music"point-e
Point cloud diffusion for 3D model synthesisconsistency_models
Official repo for consistency models.openai-node
The official Node.js / Typescript library for the OpenAI APIguided-diffusion
plugins-quickstart
Get a ChatGPT plugin up and running in under 5 minutes!glide-text2im
GLIDE: a diffusion-based text-conditional image synthesis modelretro
Retro Games in Gymglow
Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.openai-quickstart-node
Node.js example app from the OpenAI API quickstart tutorialimproved-gan
Code for the paper "Improved Techniques for Training GANs"improved-diffusion
Release for Improved Denoising Diffusion Probabilistic Modelsroboschool
DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.image-gpt
finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"pixel-cnn
Code for the paper "PixelCNN++: A PixelCNN Implementation with Discretized Logistic Mixture Likelihood and Other Modifications"gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and morerequests-for-research
A living collection of deep learning problemsgpt-discord-bot
Example Discord bot written in Python that uses the completions API to have conversations with the `text-davinci-003` model, and the moderations API to filter the messages.human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"openai-quickstart-python
Python example app from the OpenAI API quickstart tutorialevolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"generating-reviews-discovering-sentiment
Code for "Learning to Generate Reviews and Discovering Sentiment"neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"universe-starter-agent
A starter agent that can solve a number of universe environments.following-instructions-human-feedback
Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videosdalle-2-preview
InfoGAN
Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"supervised-reptile
Code for the paper "On First-Order Meta-Learning Algorithms"prm800k
800,000 step-level correctness labels on LLM solutions to MATH problemsblocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolutionprocgen
Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environmentslm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferenceskubernetes-ec2-autoscaler
A batch-optimized scaling manager for Kubernetessummarize-from-feedback
Code for "Learning to summarize from human feedback"random-network-distillation
Code for the paper "Exploration by Random Network Distillation"large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"multiagent-competition
Code for the paper "Emergent Complexity via Multi-agent Competition"automated-interpretability
imitation
Code for the paper "Generative Adversarial Imitation Learning"deeptype
Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"mlsh
Code for the paper "Meta-Learning Shared Hierarchies"grade-school-math
openai-openapi
OpenAPI specification for the OpenAI APIiaf
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"mujoco-worldgen
Automatic object XML generation for Mujocosafety-gym
Tools for accelerating safe exploration research.vdvae
Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images"coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"weightnorm
Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"atari-py
A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interfacerobogym
Robotics Gym Environmentsopenai-gemm
Open single and half precision gemm implementationsvime
Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.ebm_code_release
Code for Implicit Generation and Generalization with Energy Based ModelsCLIP-featurevis
code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"gym-soccer
gym-http-api
API to access OpenAI Gym from other languages via HTTProbosumo
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"EPG
Code for the paper "Evolved Policy Gradients"orrb
Code for the paper "OpenAI Remote Rendering Backend"phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"miniF2F
Formal to Formal Mathematics Benchmarkatari-reset
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"spinningup-workshop
For educational materials related to the spinning up workshops.train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"dallify-discord-bot
Example code for using OpenAI’s NodeJS SDK with discord.js SDK to create a Discord Bot that uses Slash Commands.gym3
Vectorized interface for reinforcement learning environmentsretro-baselines
Publicly releasable baselines for the Retro contestlean-gym
neural-gpu
Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"baselines-results
go-vncdriver
Fast VNC driverhuman-eval-infilling
Code for the paper "Efficient Training of Language Models to Fill in the Middle"tabulate
public release of Excel / OpenAI API integrationdistribution_augmentation
Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.consistency_models_cifar10
Consistency models trained on CIFAR-10, in JAX.Love Open Source and this site? Check out how you can help us