license: apache-2.0 | |
<div align="center"> | |
<a href="https://github.com/opendilab/PPOxFamily"><img width="500px" height="auto" src="./ppof_logo.png"></a> | |
</div> | |
# PPO x Family 决策智能入门公开课 | |
欢迎来到 [**PPO x Family**](https://github.com/opendilab/PPOxFamily) 系列决策智能入门公开课。该系列将深入理解深度强化学习算法 PPO ,灵活运用**一个 PPO 算法**解决几乎**所有常见的决策智能应用** ,帮助一切对于深度强化学习技术有好奇心的人,轻便且高效地制作应用原型,了解和学习最强大最易用的 PPO Family 。 | |
# NEWS | |
- 2025.03.14: 此存储库用于为[**PPO x Family**](https://github.com/opendilab/PPOxFamily)提供课程作业数据集及其他相关附加材料 | |
# File Structure | |
. <br> | |
├── README.md <br> | |
├── asserts <br> | |
└── ppof_logo.png <br> | |
├── ppof_ch4_code_p1.py [[1]](https://github.com/opendilab/PPOxFamily/blob/main/chapter4_reward/chapter4_hw_solution.pdf)<br> | |
├── ppof_ch4_data_lunarlander.pkl [[1]](https://github.com/opendilab/PPOxFamily/blob/main/chapter4_reward/popart.py) [[2]](https://github.com/opendilab/PPOxFamily/blob/main/chapter4_reward/popart_zh.py)<br> | |
├── ppof_ch4_data_p1.zip [[1]](https://github.com/opendilab/PPOxFamily/blob/main/chapter4_reward/chapter4_hw_solution.pdf)<br> | |
├── ppof_ch5_code_p1.py [[1]](https://github.com/opendilab/PPOxFamily/blob/main/chapter5_time/chapter5_hw_solution.pdf)<br> | |
├── ppof_ch6_code_p1.py [[1]](https://github.com/opendilab/PPOxFamily/blob/main/chapter6_marl/chapter6_hw_solution.pdf)<br> | |
└── ppof_ch7_code_p1.py[[1]](https://github.com/opendilab/PPOxFamily/blob/main/chapter7_tricks/chapter7_hw_solution.pdf)<br> |