Zhenye-Na/reinforcement-learning-stanford

Stars
297
Rank 140,075 (Top 3 %)
Language
Python
Created over 5 years ago
Updated over 1 year ago

Zhenye-Na/reinforcement-learning-stanford

Zhenye-Na

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

🕹️ CS234: Reinforcement Learning, Winter 2019 | YouTube videos 👉

CS234: Reinforcement Learning, Stanford

Reinforcement Learning (Agent and environment). image source: Unity's blog on Unity Machine Learning Agents Toolkit

This repo contains homework, exams and slides I collected from internet without solutions. This repo is only for students / developers who are interested in this topic. If this repo conflicts your right, please do not hesitate to contact me. I promise I will delete this (both repo and history) ASAP.

Course Description

To realize the dreams and impact of AI requires autonomous systems that learn to make good decisions. Reinforcement learning is one powerful paradigm for doing so, and it is relevant to an enormous range of tasks, including robotics, game playing, consumer modeling and healthcare. This class will provide a solid introduction to the field of reinforcement learning and students will learn about the core challenges and approaches, including generalization and exploration. Through a combination of lectures, and written and coding assignments, students will become well versed in key ideas and techniques for RL. Assignments will include the basics of reinforcement learning as well as deep reinforcement learning — an extremely promising new area that combines deep learning techniques with reinforcement learning. In addition, students will advance their understanding and the field of RL through a final project.

Textbooks

There is no official textbook for the class but a number of the supporting readings will come from:

Reinforcement Learning: An Introduction, Sutton and Barto, 2nd Edition. This is available for free here and references will refer to the final pdf version available here.

Some other additional references that may be useful are listed below:

Reinforcement Learning: State-of-the-Art, Marco Wiering and Martijn van Otterlo, Eds. [link]
Artificial Intelligence: A Modern Approach, Stuart J. Russell and Peter Norvig. [link]
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville. [link]
David Silver's course on Reinforcement Learning. [link]

Course Materials

Lecture notes & slides could be found [here].

DA-RNN

📃 𝖀𝖓𝖔𝖋𝖋𝖎𝖈𝖎𝖆𝖑 PyTorch Implementation of DA-RNN (arXiv:1704.02971)

Jupyter Notebook

machine-learning-uiuc

🖥️ CS446: Machine Learning in Spring 2018, University of Illinois at Urbana-Champaign

CSAPP-Labs

💻 Computer Systems: A Programmer's Perspective, Lab Assignments Solutions

image-similarity-using-deep-ranking

🖼️ 𝖀𝖓𝖔𝖋𝖋𝖎𝖈𝖎𝖆𝖑 PyTorch implementation of "Learning Fine-grained Image Similarity with Deep Ranking" (arXiv:1404.4661)

advanced-deep-learning-and-reinforcement-learning-deepmind

🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉

Jupyter Notebook

data-structures-ucb

🌳 CS 61B: Data Structures in Spring 2018, University of California, Berkeley

zhenye-na

e2e-learning-self-driving-cars

🚗 𝖀𝖓𝖔𝖋𝖋𝖎𝖈𝖎𝖆𝖑 PyTorch implementation of "End-to-End Learning for Self-Driving Cars" (arXiv:1604.07316) with Udacity's Simulation env

Jupyter Notebook

crnn-pytorch

✍️ Convolutional Recurrent Neural Network in Pytorch | Text Recognition

Jupyter Notebook

giligili

computer-vision-uiuc

🖼️ CS543 / ECE549: Computer Vision in Spring 2018, University of Illinois at Urbana-Champaign

gcn-spp

Shortest Path prediction using Graph Convolutional Networks

Jupyter Notebook

SQL-Exercises

💾 WIKIBOOKS: SQL Exercises

neural-style-pytorch

📄 𝖀𝖓𝖔𝖋𝖋𝖎𝖈𝖎𝖆𝖑 PyTorch implementation of "A Neural Algorithm of Artistic Style" (arXiv:1508.06576)

data-structures-uiuc

🌳 CS225: Data Structures

cs106b

:neckbeard: CS 106B: Programming Abstractions (C++) | Spring 2017

database-systems-uiuc

💾 CS411: Database Systems in Spring 2018, UIUC

leetcode

👨‍💻 This repository contains the solutions and explanations for algorithm problems in LeetCode, implemented by Python or Java. Code Skeletons are generated automatically via the `vscode-leetcode` plugin.

pokemon-gan

🐼 Generating new Pokemons with Wasserstein DCGAN | TensorFlow Implementation

lintcode

👨‍💻 This repository contains the solutions and explanations to the algorithm problems on LintCode. All are written in Python/Java/C++ and implemented by myself.

coursera-ml

💡This repository contains all of the lecture exercises of Machine Learning course by Andrew Ng, Stanford University @ Coursera. All are implemented by myself and in MATLAB/Octave.

computational-advertising-uiuc

💸 CS498HS4: Computational Advertising in Fall 2018, UIUC

aws-certs-cheatsheet

💯 Cheatsheets for AWS Certified Exams - AWS Certified Solutions Architect Associate

blog

📔 Technical blog

algo-for-data-analytics

IE531: Algorithms for Data Analytics in 2018 Spring, UIUC

pan.go

💾 A Tiny Golang based Distributed Cloud Storage Service | MySQL, Reids, RabbitMQ, Docker and Ceph

viola-jones-algo

👨👩 Viola Jones Face Detection

marketplace

🏪 Node.js based Marketplace Web Application

Pymelody

🎶 Classical Music Generation with Machine Learning

tiny-url

🔗 URL shortening service built with Golang

Deep-Learning-Specialization

⚛️ Deep Learning Specialization by deeplearning.ai

Jupyter Notebook

practical-http

HTTP 协议原理 + 实践 Web 开发工程师必学

zhenye-na.github.io

learn.go

analysis-of-network-data

IE532: Analysis of Network Data in 2017 Fall, UIUC

Jupyter Notebook