arth-shukla
commited on
Commit
·
ed283ef
1
Parent(s):
d79b5ae
readme
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ viewer: false
|
|
13 |
| **[Dataset](https://arth-shukla.github.io/mshab/#dataset-section)**
|
14 |
| **[Supplementary](https://sites.google.com/view/maniskill-hab)**
|
15 |
|
16 |
-
RL (SAC, PPO) and IL (BC) baselines for ManiSkill-HAB. Each checkpoint includes a torch checkpoint `policy.pt` (model, optimizer/scheduler state, other trainable parameters) and a train config `config.yml` with hyperparemeters and env kwargs.
|
17 |
|
18 |
RL Pick/Place policies are trained using SAC due to improved performance, while Open/Close is trained with PPO for wall-time efficiency (see Appendix A.4.3). All-object RL policies are under `all/` directories, while per-object policies are under directories labeled by the object name. IL policies do not have per-object Pick/Place variants.
|
19 |
|
|
|
13 |
| **[Dataset](https://arth-shukla.github.io/mshab/#dataset-section)**
|
14 |
| **[Supplementary](https://sites.google.com/view/maniskill-hab)**
|
15 |
|
16 |
+
RL (SAC, PPO) and IL (BC, DP) baselines for ManiSkill-HAB. Each checkpoint includes a torch checkpoint `policy.pt` (model, optimizer/scheduler state, other trainable parameters) and a train config `config.yml` with hyperparemeters and env kwargs.
|
17 |
|
18 |
RL Pick/Place policies are trained using SAC due to improved performance, while Open/Close is trained with PPO for wall-time efficiency (see Appendix A.4.3). All-object RL policies are under `all/` directories, while per-object policies are under directories labeled by the object name. IL policies do not have per-object Pick/Place variants.
|
19 |
|