Zhaolin Gao's picture

2 5 6

Zhaolin Gao

GitBag

·

https://zhaolingao.github.io/

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

updated a dataset about 1 month ago

GitBag/deepscaler-Qwen3-8B-Base-4096-n-16

updated a dataset about 1 month ago

GitBag/deepscaler-Qwen3-4B-Base-4096-n-16

updated a dataset about 1 month ago

GitBag/deepscaler-Qwen3-1.7B-Base-4096-n-16

View all activity

Organizations

published an article 10 months ago

Article

RLHF 101: A Technical Dive into RLHF

By

•

Dec 11, 2024

• 8