• Stars
    star
    1,632
  • Rank 27,682 (Top 0.6 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 1 year ago
  • Updated 8 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

PPO x Family 决策智能入门公开课

欢迎来到 PPO x Family 系列决策智能入门公开课。该系列将深入理解深度强化学习算法 PPO ,灵活运用一个 PPO 算法解决几乎所有常见的决策智能应用 ,帮助一切对于深度强化学习技术有好奇心的人,轻便且高效地制作应用原型,了解和学习最强大最易用的 PPO Family 。

P.S. 路过记得点个 star stars - ppof ,2022年12月起持续更新中~

News

课程大纲

内容导航

章节(视频课) 算法理论资料 补充资料 习题 代码样例 应用样例
第一章:开启决策AI探索之旅 课程PPT
课程文字稿
微课视频
策略梯度
A2C
TRPO
符号表
QA总结
习题
习题题解
PG算法示例
A2C算法示例
PPO算法示例
应用混剪
第二章:解构复杂动作空间 课程PPT
课程文字稿
重参数化
PPO&DDPG
HyAR
QA总结
习题
习题题解
离散动作示例
连续动作示例
混合动作示例
应用训练代码
火箭回收等
第三章:表征多模态动作空间 课程PPT
课程文字稿
表征学习
PPG
不变性
QA总结
习题
习题题解
编码方法示例
Wrapper示例
计算图示例
应用训练代码
软体机器人等
第四章:解密稀疏奖励空间 课程PPT
课程文字稿
逆强化学习
行为克隆BC
QA总结
习题
习题题解
好奇心奖励
Pop-Art示例
应用训练代码
自动驾驶等
第五章:探索时序建模 课程PPT 随机性策略
RWKV
Belief MDP
QA总结
习题
习题题解
LSTM示例
GTrXL示例
应用训练代码
记忆型决策
第六章:统筹多智能体 课程PPT HAPPO
ACE
值分解
QA总结
习题
习题题解
IndependentPG
MAPG
MAPPO
[HAPPO]
应用训练代码
多智能体协作
第七章:挖掘黑科技 课程PPT Adv 估计
PPO off 版
Entropy
习题 GAE
Recompute
梯度裁剪
正交初始化
Dual Clip
Value Clip
应用训练代码
学术基准环境

课程特点

一个算法解决万千应用 视频传送门

算法理论和代码实现一一对应 网站传送门

项目结构

.
├── LICENSE
├── assets                       --> 相关图片素材(转载请注明来源)
├── chapter2_action              --> 课程第二章相关内容
└── chapter1_overview            --> 课程第一章相关内容
    ├── chapter1_manuscript.pdf  --> 课程第一章文字稿(对于PPT的补充说明)
    ├── chapter1_lecture.pdf     --> 课程第一章PPT
    ├── chapter1_qa.pdf          --> 课程第一章答疑文稿
    ├── chapter1_homework.pdf    --> 课程第一章习题作业
    ├── chapter1_hw_solution.pdf   --> 课程第一章习题作业题解
    ├── chapter1_supp_trpo.pdf          --> 课程第一章补充材料(算法理论推导等)
    └── chapter1_demo_code.py    --> 课程第一章相关代码实现

课程答疑和反馈

License

PPOxFamily is released under the Apache 2.0 license.

More Repositories

1

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)
2,642
star
2

DI-engine

OpenDILab Decision AI Engine
Python
2,616
star
3

DI-star

An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Python
1,149
star
4

LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
Python
885
star
5

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)
661
star
6

awesome-decision-transformer

A curated list of Decision Transformer resources (continually updated)
536
star
7

DI-drive

Decision Intelligence Platform for Autonomous Driving simulation.
Python
510
star
8

awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)
509
star
9

LLMRiddles

Open-Source Reproduction/Demo of the LLM Riddles Game
Python
448
star
10

InterFuser

[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Python
435
star
11

GoBigger

[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger could also help you with multi-agent decision intelligence study.
Python
431
star
12

LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
Jupyter Notebook
370
star
13

DI-sheep

羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)
Python
362
star
14

awesome-multi-modal-reinforcement-learning

A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
295
star
15

awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)
262
star
16

awesome-end-to-end-autonomous-driving

A curated list of awesome End-to-End Autonomous Driving resources (continually updated)
252
star
17

DI-engine-docs

DI-engine docs (Chinese and English)
Python
243
star
18

DI-orchestrator

OpenDILab RL Kubernetes Custom Resource and Operator Lib
Go
217
star
19

treevalue

Here are the most awesome tree structure computing solutions, make your life easier. (这里有目前性能最优的树形结构计算解决方案)
Python
215
star
20

DI-smartcross

Decision Intelligence platform for Traffic Crossing Signal Control
Python
207
star
21

SO2

[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Python
205
star
22

DI-hpc

OpenDILab RL HPC OP Lib
Python
197
star
23

GoBigger-Challenge-2021

Interested in multi-agents? The 1st Go-Bigger Multi-Agent Decision Intelligence Challenge is coming and a big bonus is waiting for you!
Python
192
star
24

DI-treetensor

Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)
Python
176
star
25

ACE

[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".
Python
174
star
26

awesome-AI-based-protein-design

A collection of research papers for AI-based protein design
170
star
27

Gobigger-Explore

Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own multi-agents!
Python
167
star
28

DI-store

OpenDILab RL Object Store
Go
164
star
29

LightTuner

Python
146
star
30

DI-bioseq

Decision Intelligence platform for Biological Sequence Searching
Python
97
star
31

DOS

[CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Python
94
star
32

DI-1024

1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game)
JavaScript
91
star
33

DI-toolkit

A simple toolkit package for opendilab
Python
89
star
34

DIgging

Decision Intelligence for digging best parameters in target environment.
Python
81
star
35

awesome-driving-behavior-prediction

A collection of research papers for Driving Behavior Prediction
63
star
36

DI-adventure

Decision Intelligence Adventure for Beginners
Python
54
star
37

SmartRefine

[CVPR 2024] SmartRefine: An Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Python
39
star
38

CodeMorpheus

CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)
Python
28
star
39

PsyDI

PsyDI: A MBTI agent that helps you understand your personality type through a relaxed multi-modal interaction.
TypeScript
25
star
40

OpenPaL

Building open-ended embodied agent in battle royale FPS game
19
star
41

huggingface_ding

Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggingface Hub with auto-created model card.
Python
17
star
42

.github

The first decision intelligence platform covering the most complete algorithms in academia and industry
10
star