Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
grpo-feature-vector-step-55
like
0
Text Generation
PEFT
Safetensors
lora
Model card
Files
Files and versions
Community
Use this model
main
grpo-feature-vector-step-55
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
thejaminator
verl GRPO trained model at step 55
dd18903
verified
about 21 hours ago
.gitattributes
Safe
1.52 kB
initial commit
about 21 hours ago
README.md
139 Bytes
verl GRPO trained model at step 55
about 21 hours ago
adapter_config.json
1.03 kB
verl GRPO trained model at step 55
about 21 hours ago
adapter_model.safetensors
864 MB
LFS
verl GRPO trained model at step 55
about 21 hours ago