Shengyi Costa Huang

vwxyzjn

AI & ML interests

None yet

Recent Activity

updated a dataset 8 days ago
vwxyzjn/the-algorithm-python
updated a dataset 8 days ago
vwxyzjn/rlvr_acecoder
published a model 13 days ago
allenai/OLMo-2-0425-1B-DPO
View all activity

Organizations

Ai2's profile picture cleanrl's profile picture lm-human-preference-details's profile picture ICML2023's profile picture Brrr Gang's profile picture AI2 Adapt Dev's profile picture Dev Mode Explorers's profile picture OLMoE's profile picture

vwxyzjn's activity

published an article 10 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

By yfleureau and 7 others
120
published an article 10 months ago
view article
Article

Preference Optimization for Vision Language Models

By qgallouedec and 3 others
70
published an article 11 months ago
published an article over 1 year ago
view article
Article

Constitutional AI with Open LLMs

By vwxyzjn and 6 others
13
published an article over 1 year ago
view article
Article

The N Implementation Details of RLHF with PPO

By vwxyzjn and 2 others
52