• Stars
    star
    247
  • Rank 164,117 (Top 4 %)
  • Language
    HTML
  • License
    MIT License
  • Created almost 5 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

This is a private learning repository for reinforcement learning techniques used in robotics.

#! https://zhuanlan.zhihu.com/p/143392167

如需转发,烦请邮件告知 [email protected]

Reinforcement-Learning-in-Robotics Content 专栏目录

This is a private learning repository about Reinforcement learning techniques, Reasoning, and Representation learning used in Robotics, founded for Real intelligence.

Reinforcement Learning Foundation

  1. 神经网络基础:反向传播推导与卷积公式 [Zhihu]
  2. 强化学习基础 Ⅰ:马尔可夫与值函数 [Zhihu]
  3. 强化学习基础 Ⅱ:动态规划,蒙特卡洛,时序差分 [Zhihu]
  4. 强化学习基础 Ⅲ:on-policy, off-policy & Model-based, Model-free & Rollout [Zhihu]
  5. 强化学习基础 Ⅳ:State-of-the-art 强化学习经典算法汇总 [Zhihu]
  6. 强化学习基础 Ⅴ:Q learning 原理与实战 [Zhihu]
  7. 强化学习基础 Ⅵ:DQN 原理与实战 [Zhihu]
  8. 强化学习基础 Ⅶ:Double DQN & Dueling DQN 原理与实战 [Zhihu]
  9. 强化学习基础 Ⅷ:Vanilla Policy Gradient 策略梯度原理与实现 [Zhihu]
  10. 强化学习基础 Ⅸ:一文读懂 TRPO 原理与实现 [Zhihu]
  11. 强化学习基础 Ⅹ:一文读懂两种 PPO 原理与实现 [Zhihu]
  12. 强化学习基础 Ⅺ: Actor-Critic & A2C 原理与实现 [Zhihu]
  13. 强化学习基础 Ⅻ:DDPG 原理与实现 [Zhihu]
  14. 强化学习基础 XIII:Twin Delayed DDPG TD3原理与实现 [Zhihu]

Model-based RL

  1. Model-Based RL Ⅰ:Dyna, MVE & STEVE [Zhihu]
  2. Model-Based RL Ⅱ:MBPO原理解读 [Zhihu]
  3. Model-Based RL Ⅲ:从源码读懂PILCO [Zhihu]

Probabilistic in Robotics

  1. PR 序:机器人学的概率方法学习路径 [Zhihu]
  2. PR Ⅰ:最大似然估计MLE与最大后验概率估计MAP [Zhihu]
  3. PR Ⅱ:贝叶斯估计/推断及其与MAP的区别 [Zhihu]
  4. PR Ⅲ:从高斯分布到高斯过程、高斯过程回归、贝叶斯优化 [Zhihu]
  5. PR Ⅳ:贝叶斯神经网络 Bayesian Neural Network [Zhihu]
  6. PR Ⅴ:熵、KL散度、交叉熵、JS散度及python实现 [Zhihu]
  7. PR Ⅵ:多元连续高斯分布的KL散度及python实现 [Zhihu]
  8. PR Sampling Ⅰ:蒙特卡洛采样、重要性采样及python实现 [Zhihu]
  9. PR Sampling Ⅱ:马尔可夫链蒙特卡洛 MCMC及python实现 [Zhihu]
  10. PR Sampling Ⅲ:M-H and Gibbs 采样 [Zhihu]
  11. PR Structured Ⅰ:Graph Neural Network: An Introduction Ⅰ [Zhihu]
  12. PR Structured Ⅱ:Structured Probabilistic Model 结构化概率模型 [Zhihu]
  13. PR Structured Ⅲ:马尔可夫、隐马尔可夫 HMM 、条件随机场 CRF 全解析及其python实现 [Zhihu]
  14. PR Structured Ⅳ:General / Graph Conditional Random Field (CRF) 及其 python 实现 [Zhihu]
  15. PR Structured Ⅴ:GraphRNN——将图生成问题转化为序列生成 [Zhihu]
  16. PR Reasoning 序:Reasoning Robotics 推理机器人学习路线与资源汇总 [Zhihu]
  17. PR Reasoning Ⅰ:Bandit问题与 UCB / UCT / AlphaGo [Zhihu]
  18. PR Reasoning Ⅱ:Relational Inductive bias 关系归纳偏置及其在深度学习中的应用 [Zhihu]
  19. PR Reasoning Ⅲ:基于图表征的关系推理框架 —— Graph Network [Zhihu]
  20. PR Reasoning Ⅳ:数理逻辑(命题逻辑、谓词逻辑)知识整理 [Zhihu]
  21. PR Memory Ⅰ:Memory systems 2018 – towards a new paradigm 【重磅综述】记忆系统——神经科学的启示 [Zhihu]
  22. PR Perspective Ⅰ:Embodied AI 的新浪潮 —— new generation of AI [Zhihu]
  23. PR Perspective Ⅱ:2021/08/03 近期 Robot Learning 领域大事件及思考 [Zhihu]
  24. PR Efficient Ⅰ:机器人中的数据高效强化学习 [Zhihu]
  25. PR Efficient Ⅱ:Bayesian Transfer RL with prior knowledge [Zhihu]
  26. PR Efficient Ⅲ:像训练狗狗一样高效地训练机器人 [Zhihu]
  27. PR Efficient Ⅳ:五分钟内让四足机器人自主学会行走 [Zhihu]
  28. PR Efficient Ⅴ:自预测表征,让RL agent高效地理解世界 [Zhihu]

Meta-Learning

  1. Meta-Learning: An Introduction Ⅰ [Zhihu]
  2. Meta-Learning: An Introduction Ⅱ [Zhihu]
  3. Meta-Learning: An Introduction Ⅲ [Zhihu]

Imitation Learning

  1. Imitation Learning Ⅰ:模仿学习 (Imitation Learning) 入门指南 [Zhihu]
  2. Imitation Learning Ⅱ:DAgger透彻理论分析 [Zhihu]
  3. Imitation Learning Ⅲ:EnsembleDAgger 一种贝叶斯DAgger [Zhihu]

RL from Demonstrations

  1. RLfD Ⅰ:Deep Q-learning from Demonstrations 解读 [Zhihu]
  2. RLfD Ⅱ:Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance [Zhihu]

Multi-agent Reinforcement Learning

  1. MARL Ⅰ:A Selective Overview of Theories and Algorithms 【重磅综述】 多智能体强化学习算法理论研究 [Zhihu]

Paper Reading

Active Visual Navigation

  1. Reading:利用物体关系的目标驱动视觉导航 [Zhihu]
  2. Reading:Learning to learn how to learn-Meta自适应视觉导航 [Zhihu]
  3. Reading:Bayesian Relational Memory 在视觉导航中的应用 [Zhihu]
  4. Reading:Attention+3D空间关系图在视觉导航中的应用 [Zhihu]
  5. Reading:机器人导航的半参数化拓扑记忆结构 [Zhihu]
  6. Reading:将Transformer应用到机器人视觉导航中 [Zhihu]

RL robotics in the physical world with micro-data / data-efficiency

  1. 【重磅综述】如何在少量尝试下学习机器人强化学习控制 [Zhihu]

Others

  1. End-to-End Robotic Reinforcement Learning without Reward Engineering: [Medium] [Zhihu]
  2. Overcoming Exploration in RL with Demonstrations: [Medium] [Zhihu]
  3. The Predictron: End-To-End Learning and Planning: [Zhihu]
  4. IROS2019 Paper速读(一) [Zhihu]
  5. IROS2019 Paper速读(二) [Zhihu]
  6. IROS2019 Paper速读(三) [Zhihu]
  7. IROS2019 Paper速读(四) [Zhihu]

Simulator

  1. MuJoCo自定义机器人建模指南
  2. Sim2real in Robotics: An Introduction

Tools

  1. Tools 1:如何用 PyQt5 和 Qt Designer 在 Pycharm 中愉快地开发软件 [Zhihu]
  2. Tools 2:Arxiv 论文提交流程——看这篇就够了 [Zhihu]
  3. Tools 3:Python socket 服务器与客户端双向通信(服务器NAT,文件传输) [Zhihu]
  4. Tools 4:Python三行转并行——真香![Zhihu]
  5. Tools 5:Python三行转并行后续——多进程全局变量 [Zhihu]

专栏关联Github代码库

Reinforcement-Learning-in-Robotics
Machine-Learning-is-ALL-You-Need

More Repositories

1

Rofunc

🤖 The Full Process Python Package for Robot Learning from Demonstration and Robot Manipulation
Python
437
star
2

Machine-Learning-is-ALL-You-Need

🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Python
369
star
3

awesome-bimanual-manipulation

Robot bimanual manipulation / dual-arm manipulation
116
star
4

System_Identification

The usage of MATLAB System Identification Toolbox and PID parameters adjustment
MATLAB
42
star
5

awesome-target-driven-navigation

Paper and summaries about state-of-the-art robot Target-driven Navigation task
40
star
6

Rofunc-ros

A Ros Package for Human-centered Interactive Intelligent Humanoid Robots
Python
22
star
7

Gamma_Reward

Python
13
star
8

advanced_robotics_project

Final project for ENGG 5402 Advanced Robotics in CUHK
Python
13
star
9

awesome-offlineRL-in-robotics

8
star
10

awesome-long-horizon-manipulation

6
star
11

TendonTrack_PILCO

Python
5
star
12

To_be_a_Roboticist

TeX
5
star
13

HuggingFace-finetune-tutorial

Python
5
star
14

DDPG-pytorch-tf-cpp

DDPG libtorch version + tensorflow c++ version: for real-time accelerate reinforcement learning
Makefile
4
star
15

RM_Infantry-1

Robomaster 2019 SJTU JDragon
C
3
star
16

awesome-model-based-reinforcement-learning

3
star
17

TendonTrack

Active Vision Tracking of a Tendon-driven continuum robot by using efficient model-based Reinforcement Learning
C
3
star
18

Solution_of_quantum_books

A gathering of unofficial solutions to the exercises in Quantum Computation and Quantum Information and Quantum Information Theory
TeX
2
star
19

Mathematics-for-engineering

Record my learning notes about Mathematics
2
star
20

awesome-data-efficient-reinforcement-learning

1
star