Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
ivangabriele
/
trl-sandbox
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
trl-sandbox
/
trl
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
ivangabriele
feat: initialize project
2f5127c
verified
14 days ago
__init__.py
Safe
1 kB
feat: initialize project
14 days ago
dpo.py
Safe
5.23 kB
feat: initialize project
14 days ago
env.py
Safe
3.58 kB
feat: initialize project
14 days ago
grpo.py
Safe
5.17 kB
feat: initialize project
14 days ago
kto.py
Safe
4.12 kB
feat: initialize project
14 days ago
sft.py
Safe
5.25 kB
feat: initialize project
14 days ago
utils.py
Safe
11.3 kB
feat: initialize project
14 days ago
vllm_serve.py
Safe
24.5 kB
feat: initialize project
14 days ago