Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
vishaljoshi24
/
trl-4-dnd
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
e2368f1
trl-4-dnd
/
examples
/
research_projects
/
stack_llama
/
scripts
34.4 kB
1 contributor
History:
1 commit
vishaljoshi24
Initial Commit
a080fe0
about 1 month ago
README.md
Safe
1.87 kB
Initial Commit
about 1 month ago
merge_peft_adapter.py
Safe
2.61 kB
Initial Commit
about 1 month ago
reward_modeling.py
Safe
11.9 kB
Initial Commit
about 1 month ago
rl_training.py
Safe
10.3 kB
Initial Commit
about 1 month ago
supervised_finetuning.py
Safe
7.73 kB
Initial Commit
about 1 month ago