Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
vishaljoshi24
/
trl-4-dnd
like
0
Paused
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
trl-4-dnd
/
examples
/
research_projects
/
stack_llama
/
scripts
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
vishaljoshi24
Initial Commit
a080fe0
11 days ago
README.md
Safe
1.87 kB
Initial Commit
11 days ago
merge_peft_adapter.py
Safe
2.61 kB
Initial Commit
11 days ago
reward_modeling.py
Safe
11.9 kB
Initial Commit
11 days ago
rl_training.py
Safe
10.3 kB
Initial Commit
11 days ago
supervised_finetuning.py
Safe
7.73 kB
Initial Commit
11 days ago