yichaodu commited on
Commit
b6bbe3e
·
verified ·
1 Parent(s): 3ba8654

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -7,7 +7,7 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- [Project page](https://mj-bench.github.io/)
11
 
12
  Multimodal judges reward models play a pivotal role in Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from AI Feedback (RLAIF). They serve as judges, providing crucial feedback to align foundation models (FMs) with desired behaviors. However, the evaluation of these multimodal judges often lacks thoroughness, leading to potential misalignment and unsafe fine-tuning outcomes.
13
 
 
7
  pinned: false
8
  ---
9
 
10
+ [Project page](https://mj-bench.github.io/) [Github](https://github.com/MJ-Bench)
11
 
12
  Multimodal judges reward models play a pivotal role in Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from AI Feedback (RLAIF). They serve as judges, providing crucial feedback to align foundation models (FMs) with desired behaviors. However, the evaluation of these multimodal judges often lacks thoroughness, leading to potential misalignment and unsafe fine-tuning outcomes.
13