• Stars
    star
    772
  • Rank 58,858 (Top 2 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created almost 7 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.

David-Silver-Reinforcement-learning

Tweet

apm Build Status PRs Welcome

This repository contains the notes for the Reinforcement Learning course by David Silver along with the implementation of the various algorithms discussed, both in Keras (with TensorFlow backend) and OpenAI's gym framework.

Syllabus:

  • Week 1: Introduction to Reinforcement Learning [slide][video]

  • Week 2: Markov Decision Processes [slide][video]

  • Week 3: Planning by Dynamic Programming [slide][video]

  • Week 4: Model-Free Prediction [slide][video]

  • Week 5: Model-Free Control [slide][video]

  • Week 6: Value Function Approximation [slide][video]

  • Week 7: Policy Gradient Methods [slide][video]

  • Week 8: Integrating Learning and Planning [slide][video]

  • Week 9: Exploration and Exploitation [slide][video]

  • Week 10: Case Study: RL in Classic Games [slide][video]

Dependencies

  • TensorFlow
  • Keras
  • Gym
  • Numpy

Install them using pip.

Contributing

Please feel free to create a Pull Request for adding implementations of the algorithms discussed in different frameworks like PyTorch, Caffe, etc. or improving the existing implementations. If you are a beginner, you can refer this for getting started.

Support

If you found this useful, please consider starring(★) the repo so that it can reach a broader audience.

License

This project is licensed under the MIT License - see the LICENSE file for details.

References

More Repositories

1

Deep-Learning-Book-Chapter-Summaries

Attempting to make the Deep Learning Book easier to understand.
Jupyter Notebook
1,068
star
2

siren

PyTorch implementation of Sinusodial Representation networks (SIREN)
Python
262
star
3

Operating-Systems

'Operating System Concepts' - Solutions to exercises and projects
C
125
star
4

Coursera-Specializations

Solutions to assignments of Coursera Specializations - Deep learning, Machine learning, Algorithms & Data Structures, Image Processing and Python For Everybody
Jupyter Notebook
78
star
5

udacity-deep-reinforcement-learning

My solutions to the projects (and mini-projects) of the Deep Reinforcement Learning Nanodegree by Udacity
Jupyter Notebook
63
star
6

WannaPark

Project aimed at presenting a model to find a vacant parking spot in real time and ensure car safety using Deep Learning (Parking spot Classification and Face recognition).
Python
34
star
7

Quora-Question-Pairs

The code for our submission in Kaggle's competition Quora Question Pairs which ranked in the top 25%.
Python
30
star
8

Brain-Tumor-Segmentation-Keras

Keras implementation of the multi-channel cascaded architecture introduced in the paper "Brain Tumor Segmentation with Deep Neural Networks"
Jupyter Notebook
23
star
9

Deep-learning-tutorials

Deep learning tutorials for classification of MNIST digits using CNNs and solutions to assignments for Udacity's deep learning course
Jupyter Notebook
18
star
10

machine-learning-paper-notes

This repository contains my notes for the research papers that I read for anyone to briefly glance over the details.
13
star
11

P2_Continuous_Control

My solution code for the second project of Udacity's Deep Reinforcement Learning Nanodegree
ASP
5
star
12

Bayesian_Decision_Making-Datagiri_Mumbai

Jupyter notebook accompanying my talk on "Bayesian Decision Making" for DataGiri
Jupyter Notebook
3
star
13

P1_Navigation

My solution code for the first project of Udacity's Deep Reinforcement Learning Nanodegree
ASP
2
star
14

CarND-Advance-Lane-Lines-P2

The code for my submission to the second project of Udacity's Self-driving Car Nanodegree Program
Jupyter Notebook
2
star
15

Social-Network

Social networking website using laravel framework
PHP
2
star
16

CarND-Traffic-Sign-Classifier-P3

The code for my submission for the third project of Udacity's Self-driving Car Nanodegree Program
Jupyter Notebook
2
star
17

streamlit-basics

A very simple app to learn the basics of streamlit
Python
1
star
18

CapsNet-Keras

Keras implementation of the NIPS 2017 paper "Dynamic Routing between Capsule"
Jupyter Notebook
1
star
19

Communication-Networks

Java
1
star
20

Sunshine-Advanced

Displays the weather data for the next 2 weeks using OpenWeatherMap API
Java
1
star
21

Cython-tutorials

HTML
1
star
22

CarND-Lane-Finding-P1

The code for my submission to the first project of Udacity's Self-driving Car Nanodegree Program
Jupyter Notebook
1
star
23

CarND-Behavioral-Cloning-P4

The code for my submission for the fourth project of Udacity's Self-driving Car Nanodegree Program
Python
1
star