• Stars
    star
    7
  • Rank 2,294,772 (Top 46 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 2 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

More Repositories

1

CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Python
944
star
2

etna

ETNA – Time-Series Library
Python
857
star
3

katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
Python
64
star
4

ReBRAC

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
Jupyter Notebook
50
star
5

sac-rnd

Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
Python
40
star
6

palbert

Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight
Python
34
star
7

probabilistic-embeddings

"Probabilistic Embeddings Revisited" paper official repository
Python
26
star
8

eop

Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022
Jupyter Notebook
26
star
9

open-tlab

Примеры пропозалов для подачи заявки в Open.TLab
23
star
10

lb-sac

Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop
Python
17
star
11

cnf

Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, Offline RL Workshop
Python
10
star
12

exact

The original PyTorch implementation of the "EXACT: How Train Your Accuracy"
Python
10
star
13

dl-course

Jupyter Notebook
5
star
14

pycon-chit-chat

Jupyter Notebook
4
star
15

sigir-2021

4th place solution for the SIGIR 2021 challenge.
Python
4
star