Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
vishaljoshi24
/
trl-4-dnd
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
trl-4-dnd
/
trl
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
vishaljoshi24
Initial Commit
a080fe0
10 days ago
__init__.py
Safe
1 kB
Initial Commit
10 days ago
dpo.py
Safe
5.31 kB
Initial Commit
10 days ago
env.py
Safe
3.68 kB
Initial Commit
10 days ago
grpo.py
Safe
5.28 kB
Initial Commit
10 days ago
kto.py
Safe
4.19 kB
Initial Commit
10 days ago
sft.py
Safe
5.39 kB
Initial Commit
10 days ago
utils.py
Safe
11.3 kB
Initial Commit
10 days ago
vllm_serve.py
Safe
29 kB
Initial Commit
10 days ago