Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
14
8
ZiYi Yang
AALF
Follow
duguodong's profile picture
sigridjineth's profile picture
RoadToNowhere's profile picture
23 followers
·
9 following
https://github.com/yangzy39
yangzy39
AI & ML interests
None yet
Recent Activity
authored
a paper
10 days ago
ThinkSwitcher: When to Think Hard, When to Think Fast
authored
a paper
10 days ago
Mutual-Taught for Co-adapting Policy and Reward Models
authored
a paper
10 days ago
FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion
View all activity
Organizations
AALF
's datasets
1
Sort: Recently updated
AALF/ultrafeedback_wrpo
Viewer
•
Updated
Feb 28
•
59.9k
•
36