• Stars
    star
    5,900
  • Rank 6,855 (Top 0.2 %)
  • Language
    Jupyter Notebook
  • License
    The Unlicense
  • Created almost 8 years ago
  • Updated 23 days ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A course in reinforcement learning in the wild

Practical_RL

An open course on reinforcement learning in the wild. Taught on-campus at HSE and YSDA and maintained to be friendly to online students (both english and russian).

Manifesto:

  • Optimize for the curious. For all the materials that aren’t covered in detail there are links to more information and related materials (D.Silver/Sutton/blogs/whatever). Assignments will have bonus sections if you want to dig deeper.
  • Practicality first. Everything essential to solving reinforcement learning problems is worth mentioning. We won't shun away from covering tricks and heuristics. For every major idea there should be a lab that makes you to “feel” it on a practical problem.
  • Git-course. Know a way to make the course better? Noticed a typo in a formula? Found a useful link? Made the code more readable? Made a version for alternative framework? You're awesome! Pull-request it!

Github contributors

Course info

Additional materials

Syllabus

The syllabus is approximate: the lectures may occur in a slightly different order and some topics may end up taking two weeks.

  • week01_intro Introduction

    • Lecture: RL problems around us. Decision processes. Stochastic optimization, Crossentropy method. Parameter space search vs action space search.
    • Seminar: Welcome into openai gym. Tabular CEM for Taxi-v0, deep CEM for box2d environments.
    • Homework description - see week1/README.md.
  • week02_value_based Value-based methods

    • Lecture: Discounted reward MDP. Value-based approach. Value iteration. Policy iteration. Discounted reward fails.
    • Seminar: Value iteration.
    • Homework description - see week2/README.md.
  • week03_model_free Model-free reinforcement learning

    • Lecture: Q-learning. SARSA. Off-policy Vs on-policy algorithms. N-step algorithms. TD(Lambda).
    • Seminar: Qlearning Vs SARSA Vs Expected Value SARSA
    • Homework description - see week3/README.md.
  • recap_deep_learning - deep learning recap

    • Lecture: Deep learning 101
    • Seminar: Intro to pytorch/tensorflow, simple image classification with convnets
  • week04_approx_rl Approximate (deep) RL

    • Lecture: Infinite/continuous state space. Value function approximation. Convergence conditions. Multiple agents trick; experience replay, target networks, double/dueling/bootstrap DQN, etc.
    • Seminar: Approximate Q-learning with experience replay. (CartPole, Atari)
  • week05_explore Exploration

    • Lecture: Contextual bandits. Thompson Sampling, UCB, bayesian UCB. Exploration in model-based RL, MCTS. "Deep" heuristics for exploration.
    • Seminar: bayesian exploration for contextual bandits. UCB for MCTS.
  • week06_policy_based Policy Gradient methods

    • Lecture: Motivation for policy-based, policy gradient, logderivative trick, REINFORCE/crossentropy method, variance reduction(baseline), advantage actor-critic (incl. GAE)
    • Seminar: REINFORCE, advantage actor-critic
  • week07_seq2seq Reinforcement Learning for Sequence Models

    • Lecture: Problems with sequential data. Recurrent neural networks. Backprop through time. Vanishing & exploding gradients. LSTM, GRU. Gradient clipping
    • Seminar: character-level RNN language model
  • week08_pomdp Partially Observed MDP

    • Lecture: POMDP intro. POMDP learning (agents with memory). POMDP planning (POMCP, etc)
    • Seminar: Deep kung-fu & doom with recurrent A3C and DRQN
  • week09_policy_II Advanced policy-based methods

    • Lecture: Trust region policy optimization. NPO/PPO. Deterministic policy gradient. DDPG
    • Seminar: Approximate TRPO for simple robot control.
  • week10_planning Model-based RL & Co

    • Lecture: Model-Based RL, Planning in General, Imitation Learning and Inverse Reinforcement Learning
    • Seminar: MCTS for toy tasks
  • yet_another_week Inverse RL and Imitation Learning

    • All that cool RL stuff that you won't learn from this course :)

Course staff

Course materials and teaching by: [unordered]

Contributions

More Repositories

1

nlp_course

YSDA course in Natural Language Processing
Jupyter Notebook
9,774
star
2

Practical_DL

DL course co-developed by YSDA, HSE and Skoltech
Jupyter Notebook
1,559
star
3

AgentNet

Deep Reinforcement Learning library for humans
Python
301
star
4

deep_vision_and_graphics

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.
Jupyter Notebook
300
star
5

speech_course

YSDA course in Speech Processing.
Jupyter Notebook
200
star
6

sdc_course

Short course about self-driving cars
JavaScript
157
star
7

roc_comparison

The fast version of DeLong's method for computing the covariance of unadjusted AUC.
Python
144
star
8

YSDA_deeplearning17

Yandex SDA classes on deep learning. Version of year 2017
Jupyter Notebook
116
star
9

MLatImperial2017

Materials for the course of machine learning at Imperial College organized by Yandex SDA
Jupyter Notebook
81
star
10

mlhep2016

Machine Learning in High Energy Physics 2016
Jupyter Notebook
75
star
11

sklearn-deeprl

Deep reinforcement learning. In scikit-learn. In less than 50 effective lines.
Jupyter Notebook
52
star
12

MLatGradDays

Course of Machine Learning in Science and Industry at Heidelberg university
Jupyter Notebook
47
star
13

mlhep2019

MLHEP'19 slides and notebooks
Jupyter Notebook
45
star
14

ml-training-website

ML Training website
HTML
43
star
15

python_public

Открытые материалы курса по питону
Jupyter Notebook
42
star
16

flavours-of-physics-start

Starter kit for "Flavours of Physics" challenge at Kaggle
Python
41
star
17

gumbel_lstm

Experiments with binary LSTM using gumbel-sigmoid
Jupyter Notebook
30
star
18

satellite-collision-avoidance

RL for optimal satellite collision avoidance maneuvres
Python
26
star
19

mlhep2018

MLHEP-18 slides and stuff
Jupyter Notebook
26
star
20

mlhep2017

MLHEP 2017 slides & seminars
Jupyter Notebook
26
star
21

CSC_deeplearning

3-day dive into deep learning at csc
Jupyter Notebook
25
star
22

manytask

The auto solution checking system for YSDA; server, storing grades and managing deadlines
Python
24
star
23

IDAO-2019-muon-id

Problem for IDAO 2019 on LHCb Muon Identification
Jupyter Notebook
24
star
24

MLatImperial2016

Materials for the course of machine learning at Imperial College organized by YSDA
Jupyter Notebook
23
star
25

ML-Handbook-materials

Notebooks and other media for ML Handbook
Jupyter Notebook
19
star
26

mlhep2015

MLHEP 2015 materials (http://hse.ru/mlhep2015)
Shell
19
star
27

modelgym

Gym for predictive models
Jupyter Notebook
17
star
28

mlhep2020-assignments

Jupyter Notebook
17
star
29

algorithms

Тут находится страница курса алгоритмов yandexdataschool.ru
CSS
15
star
30

students_projects

Темы студенческих проектов
Makefile
13
star
31

cms-dqm

CMS data quality monitoring
Jupyter Notebook
11
star
32

gumbel_dpg

Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
Jupyter Notebook
11
star
33

MLatImperial2020

Jupyter Notebook
11
star
34

tinyverse

Universe RL trainer platform. Simple. Supple. Scalable.
Jupyter Notebook
10
star
35

MLatImperial2018

ML at Imperial College
Jupyter Notebook
9
star
36

neurohack-2016-starterkit

neurohack starter kit
Jupyter Notebook
8
star
37

asml

Jupyter Notebook
7
star
38

dlatscale_draft

This is an early version of Deep Learning at Scale course for Yandex School of Data Analysis
Jupyter Notebook
7
star
39

inverse-problem-intensive

A short course on simulation-based infernce for physics at YSDA in April 2021
Jupyter Notebook
7
star
40

QuantileTransformerTF

Tensorflow implementation of sklearn.preprocessing.QuantileTransformer
Python
7
star
41

dqn_binder

a deep reinforcement learning tutorial
Jupyter Notebook
6
star
42

PreciseGAN

A research repo for studying different techniques towards making more precise GANs
Python
6
star
43

reproducible_analysis_course

A course on tools for collaborative and reproducible machine learning
Jupyter Notebook
6
star
44

MLHEP-2020-muon-id

Muon identification chellenge for MLHEP-2020
Jupyter Notebook
5
star
45

MLatImperial2022

Jupyter Notebook
5
star
46

mlhep-course-2016

materials for course on machine learning for HEP at YSDA
Jupyter Notebook
5
star
47

crayimage

A toolkit for image manipulation. Not for humans.
Python
5
star
48

ML-Handbook

JavaScript
5
star
49

dt

Python
5
star
50

mlhep-course-2017

materials for course on machine learning for HEP at YSDA
Jupyter Notebook
5
star
51

neurohack-2016-winners

Winners of neuroscience hackathon
Jupyter Notebook
5
star
52

mlhep2018-starterkit

Starter kit for MLHEP-18 challenge
Jupyter Notebook
5
star
53

cuda_course

Cuda
4
star
54

DataPopularity

Storage optimization for LHCb experiment.
Jupyter Notebook
4
star
55

dl-course-tensorflow

[in progress] Translating all our materials from https://github.com/ddtm/dl-course to tensorflow
4
star
56

darkmatter-2017

Jupyter Notebook
4
star
57

ship_tracks_recognition

Jupyter Notebook
3
star
58

checker

The auto solution checking system for YSDA; client, checking solutions and sending grades
Python
3
star
59

lilbert

Jupyter Notebook
3
star
60

mlda

Machine Learning, Data Analysis course materials
Makefile
3
star
61

MLatMISiS2018

Machine Learning track for Physics at MISiS
Jupyter Notebook
3
star
62

manchester-cp-asymmetry-tutorial

Manchester CP asymmetry tutorial
Jupyter Notebook
2
star
63

datanight2015-starterkit

Data Analysis Night StarterKit (https://academy.yandex.ru/events/data-analysis-night/2015/)
Python
2
star
64

REP_tutorial

Examples of using yandex/rep framework
Jupyter Notebook
2
star
65

cern_summer_school_2017

CERN openlab Summer School 2017 Machine Learning - Parts 3 &4
Jupyter Notebook
2
star
66

sentinels

sentinels data analysis
Jupyter Notebook
2
star
67

cern-higgsml-baseline

baseline solution for HiggsML challenge using data from CERN open data portal
Jupyter Notebook
2
star
68

crowd_course

Jupyter Notebook
1
star
69

cpp0_course

Python
1
star
70

KSfinder

Jupyter Notebook
1
star
71

HSE-DataNight-StarterKit

Everware version of HSE data night starter kit
Jupyter Notebook
1
star
72

aleph2015

Applying (machine) Learning to Experimental Physics (ALEPH) and «Flavours of Physics» challenge
HTML
1
star
73

everware-base-image

Python
1
star
74

mamontov-lhc-display

The Large Hadron Collider status display in Yandex HQ
JavaScript
1
star
75

pyretina

Python
1
star
76

reproducible_analysis_course_py3

Jupyter Notebook
1
star
77

eScience-2016-everware

The presentation for https://www.esciencecenter.nl/event/4th-national-escience-symposium
CSS
1
star
78

datanight2015-advanced-starterkit

Data Analysis Night Advanced StarterKit (https://academy.yandex.ru/events/data-analysis-night/2015/)
Python
1
star
79

moseskit

Train phrase-based machine translation in one bash command with decent defaults. Docker-powered.
Ruby
1
star
80

MLHEP2020-black-box

A competition for the MLHEP 2020 summer school.
Jupyter Notebook
1
star