Spaces:
Paused
Paused
# Reward Functions | |
This module contains some useful reward functions, primarily intended for use with the [`GRPOTrainer`]. | |
## Format rewards | |
### think_format_reward | |
[[autodoc]] rewards.think_format_reward | |