CodeGoat24
/

UnifiedReward-7b

Model card Files Files and versions Community

CodeGoat24 commited on Mar 10

Commit

e0a8877

·

verified ·

1 Parent(s): 72774ca

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ base_model:
 `Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
 For further details, please refer to the following resources:
-- 📰 Paper:
 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/
 - 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
 - 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
@@ -95,5 +95,10 @@ print(text_outputs[0])
 ## Citation
 ```
 ```

 `Unified-Reward-7b` is the first unified reward model for multimodal understanding and generation assessment, enabling both pairwise ranking and pointwise scoring, which can be employed for vision model preference alignment.
 For further details, please refer to the following resources:
+- 📰 Paper: https://arxiv.org/pdf/2503.05236
 - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/
 - 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
 - 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
 ## Citation
 ```
+@article{UnifiedReward,
+  title={Unified Reward Model for Multimodal Understanding and Generation.},
+  author={Wang, Yibin and Zang, Yuhang, and Li, Hao and Jin, Cheng and Wang Jiaqi},
+  journal={arXiv preprint arXiv:2503.05236},
+  year={2025}
+}
 ```