Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
il-pugin
/
hse-prog-task-transformer-reward-model
like
0
Reinforcement Learning
Transformers
Safetensors
il-pugin/hse-prog-task-transformer-feedback
reward_model
code
License:
mit
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
README.md exists but content is empty.
Downloads last month
3
Safetensors
Model size
7.5B params
Tensor type
F32
·
BF16
·
Chat template
Files info
Video Preview
Reinforcement Learning
loading
Model tree for
il-pugin/hse-prog-task-transformer-reward-model
Base model
sfairXC/FsfairX-LLaMA3-RM-v0.1
Finetuned
(
2
)
this model
Dataset used to train
il-pugin/hse-prog-task-transformer-reward-model
il-pugin/hse-prog-task-transformer-feedback
Viewer
•
Updated
May 26
•
1.57k
•
5