applenob/rl_learn

Stars
335
Rank 125,904 (Top 3 %)
Language
Jupyter Notebook
Created about 7 years ago
Updated over 5 years ago

applenob/rl_learn

applenob

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

我的强化学习笔记和学习材料📖 still updating ... ...

[WIP]强化学习的学习仓库

这是我个人学习强化学习的时候收集的比较经典的学习资料、笔记和代码，分享给所有人。

为了直接在GitHub上用markdown文件看公式，推荐安装chrome插件：MathJax Plugin for Github

入门指南

入门指南

课程笔记

David Silver 的 Reinforcement Learning 课程学习笔记。
课程对应的所有PPT
Sutton 的 Reinforcement Learning: An Introduction书本学习笔记
书本的各版本pdf
- 2017-6 draft
- 2018 second edition

实验目录

所有的实验源代码都在lib目录下，来自dennybritz。在原先代码的基础上，增加了对实验背景的具体介绍、代码和公式的对照。

Gridworld：对应MDP的Dynamic Programming
Blackjack：对应Model Free的Monte Carlo的Planning和Controlling
Windy Gridworld：对应Model Free的Temporal Difference的On-Policy Controlling：SARSA。
Cliff Walking：对应Model Free的Temporal Difference的Off-Policy Controlling：Q-learning。
Mountain Car：对应Q表格很大无法处理（state空间连续）的Q-Learning with Linear Function Approximation。
Atari：对应Deep-Q Learning。

其他重要学习资料：

Cpp_Primer_Practice

搞定C++👊。C++ Primer 中文版第5版学习仓库，包括笔记和课后练习答案。

RNN-for-Joint-NLU

Tensorflow implementation of "Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling" (https://arxiv.org/abs/1609.01454)

Jupyter Notebook

paper_manager

A command-line manager programed in python, help with managing your local academic papers.

pick_a_name

😄 从此爸妈没烦恼！！！

simple_crf

simple Conditional Random Field implementation in Python

intern

使用python爬取水木清华和北大未名的实习信息，在微信客户端进行展示。

algorithm_note

算法和数据结构学习笔记

Jupyter Notebook

clip_chinese_text_encoder

CLIP中文encoder

Jupyter Notebook

machine_learning_basic

my machine learning notes

Jupyter Notebook

tf_jieba

Tensorflow Operation Wrapper of cppjieba (Chinese Word Segamentation)

nlp_projects

my nlp projects notebook

Jupyter Notebook

text_normalization

code and notes for kaggle competetion Text Normalization Challenge - English Language(https://www.kaggle.com/c/text-normalization-challenge-english-language)

Jupyter Notebook

intern_email

爬取实习信息，并发送邮件。

applenob.github.io

tensorflow_learning_notes

tensorflow实践集

Jupyter Notebook

tf_chat_seq2seq

A Seq2Seq generative chat-bot baseline implemented by Tensorflow.

linux_basic

notebooks for my linux learning blog.

Jupyter Notebook

weibo_spider

使用selenium和Firefox浏览器，模拟登陆新浪微博，并且爬取微博内容和评论。

delta_demo

A demo for delta-nlp usage

learning_notebook

各方面零散的学习笔记。

Jupyter Notebook

paper_note

我的论文阅读笔记

reading_note

我的读书笔记

Jupyter Notebook

myNote

a note app written in js

spider

爬取北京大学软件学院新闻

deep_learning_note

深度学习相关笔记

Jupyter Notebook

pandas_note

my pandas/numpy/matplotlib learning note

Jupyter Notebook

travis_test

topic_class

凤凰网新闻爬取和分类（Toy using sklearn）