• Stars
    star
    1
  • Language
    Jupyter Notebook
  • Created over 3 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Maximum Entropy Inverse Reinforcement Learning - notes and tutorial for IRL using the principle of maximum entropy

More Repositories

1

intro_continual_learning

This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of artificial intelligence
Jupyter Notebook
345
star
2

RL-Chat-pytorch

reinforcement learning on a encoder-decoder GRU for chatbot dialogue generation
Jupyter Notebook
17
star
3

minichatgpt

annotated tutorial of the huggingface TRL repo for reinforcement learning from human feedback connecting equations from PPO and GAE to the lines of code in the pytorch implementation
Jupyter Notebook
12
star
4

chat-transformer

A chatbot using the Vaswani transformer as it's sequence-to-sequence module
Jupyter Notebook
11
star
5

unsupervised-speech-representation-learning

This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose of speech recognition
Jupyter Notebook
10
star
6

triton-ft-api

tutorial on how to deploy a scalable autoregressive causal language model transformer using nvidia triton server
Python
3
star
7

va-irl

Variational Adversarial Inverse Reinforcement Learning
Jupyter Notebook
2
star
8

proximalpolicyoptimization

basic implementation of PPO reinforcement learning algorithm on lunar lander
Jupyter Notebook
2
star
9

adaptive-computation-time

The notebook connects the formulas used in the paper to the code that implements those formulas by implementing a training pipeline on a small but meaningful dataset
HTML
1
star
10

General-Deep-Learning-NLP-Classifier

Template for multi-class classification with variable length sequences using Gated Recurrent Units
Jupyter Notebook
1
star
11

xgboost-tutorial

detailed tutorial on xgboost
Jupyter Notebook
1
star