Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
grpo-feature-vector-step-1
like
0
PEFT
Safetensors
English
verl
grpo
math
reasoning
rl
lora
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
grpo-feature-vector-step-1
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
thejaminator
verl GRPO trained model at step 1
1c54e58
verified
9 days ago
.gitattributes
Safe
1.52 kB
initial commit
9 days ago
README.md
735 Bytes
verl GRPO trained model at step 1
9 days ago
adapter_config.json
1.1 kB
verl GRPO trained model at step 1
9 days ago
adapter_model.safetensors
864 MB
LFS
verl GRPO trained model at step 1
9 days ago