dalmia/David-Silver-Reinforcement-learning

Stars
772
Rank 58,858 (Top 2 %)
Language
Jupyter Notebook
License
MIT License
Created almost 7 years ago
Updated over 2 years ago

dalmia/David-Silver-Reinforcement-learning

dalmia

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.

David-Silver-Reinforcement-learning

This repository contains the notes for the Reinforcement Learning course by David Silver along with the implementation of the various algorithms discussed, both in Keras (with TensorFlow backend) and OpenAI's gym framework.

Syllabus:

Week 1: Introduction to Reinforcement Learning [slide][video]
Week 2: Markov Decision Processes [slide][video]
Week 3: Planning by Dynamic Programming [slide][video]
Week 4: Model-Free Prediction [slide][video]
Week 5: Model-Free Control [slide][video]
Week 6: Value Function Approximation [slide][video]
Week 7: Policy Gradient Methods [slide][video]
Week 8: Integrating Learning and Planning [slide][video]
Week 9: Exploration and Exploitation [slide][video]
Week 10: Case Study: RL in Classic Games [slide][video]

Dependencies

TensorFlow
Keras
Gym
Numpy

Install them using pip.

Contributing

Please feel free to create a Pull Request for adding implementations of the algorithms discussed in different frameworks like PyTorch, Caffe, etc. or improving the existing implementations. If you are a beginner, you can refer this for getting started.

Support

If you found this useful, please consider starring(★) the repo so that it can reach a broader audience.

License

This project is licensed under the MIT License - see the LICENSE file for details.

References

Deep-Learning-Book-Chapter-Summaries

Attempting to make the Deep Learning Book easier to understand.

Jupyter Notebook

siren

PyTorch implementation of Sinusodial Representation networks (SIREN)

Operating-Systems

'Operating System Concepts' - Solutions to exercises and projects

Coursera-Specializations

Solutions to assignments of Coursera Specializations - Deep learning, Machine learning, Algorithms & Data Structures, Image Processing and Python For Everybody

Jupyter Notebook

udacity-deep-reinforcement-learning

My solutions to the projects (and mini-projects) of the Deep Reinforcement Learning Nanodegree by Udacity

Jupyter Notebook

WannaPark

Project aimed at presenting a model to find a vacant parking spot in real time and ensure car safety using Deep Learning (Parking spot Classification and Face recognition).

Quora-Question-Pairs

The code for our submission in Kaggle's competition Quora Question Pairs which ranked in the top 25%.

Brain-Tumor-Segmentation-Keras

Keras implementation of the multi-channel cascaded architecture introduced in the paper "Brain Tumor Segmentation with Deep Neural Networks"

Jupyter Notebook

Deep-learning-tutorials

Deep learning tutorials for classification of MNIST digits using CNNs and solutions to assignments for Udacity's deep learning course

Jupyter Notebook

machine-learning-paper-notes

This repository contains my notes for the research papers that I read for anyone to briefly glance over the details.

P2_Continuous_Control

My solution code for the second project of Udacity's Deep Reinforcement Learning Nanodegree

Bayesian_Decision_Making-Datagiri_Mumbai

Jupyter notebook accompanying my talk on "Bayesian Decision Making" for DataGiri

Jupyter Notebook

P1_Navigation

My solution code for the first project of Udacity's Deep Reinforcement Learning Nanodegree

CarND-Advance-Lane-Lines-P2

The code for my submission to the second project of Udacity's Self-driving Car Nanodegree Program

Jupyter Notebook

Social-Network

Social networking website using laravel framework

CarND-Traffic-Sign-Classifier-P3

The code for my submission for the third project of Udacity's Self-driving Car Nanodegree Program

Jupyter Notebook

streamlit-basics

A very simple app to learn the basics of streamlit

CapsNet-Keras

Keras implementation of the NIPS 2017 paper "Dynamic Routing between Capsule"

Jupyter Notebook

Communication-Networks

Sunshine-Advanced

Displays the weather data for the next 2 weeks using OpenWeatherMap API

Cython-tutorials

CarND-Lane-Finding-P1

The code for my submission to the first project of Udacity's Self-driving Car Nanodegree Program

Jupyter Notebook

CarND-Behavioral-Cloning-P4

The code for my submission for the fourth project of Udacity's Self-driving Car Nanodegree Program