Andrew Zhao's picture

Andrew Zhao

andrewzh

·

https://andrewzh112.github.io/

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper 14 days ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

upvoted a paper about 1 month ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

upvoted a paper 2 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

View all activity

Organizations

None yet

andrewzh 's models 3

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 311 • 28

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 342 • 11

andrewzh/Absolute_Zero_Reasoner-Coder-7b

8B • Updated May 5 • 992 • 18