ZiYi Yang's picture

4 14 8

ZiYi Yang

AALF

·

https://github.com/yangzy39

yangzy39

AI & ML interests

None yet

Recent Activity

authored a paper 10 days ago

ThinkSwitcher: When to Think Hard, When to Think Fast

authored a paper 10 days ago

Mutual-Taught for Co-adapting Policy and Reward Models

authored a paper 10 days ago

FuseRL: Dense Preference Optimization for Heterogeneous Model Fusion

View all activity

Organizations

AALF 's datasets 1

AALF/ultrafeedback_wrpo

Viewer • Updated Feb 28 • 59.9k • 36