Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
20
22
Andrew Zhao
andrewzh
Follow
MrDevolver's profile picture
Inflammable1230's profile picture
isemmanuelolowe's profile picture
49 followers
·
3 following
https://andrewzh112.github.io/
_AndrewZhao
Andrewzh112
andrewqzhao
AI & ML interests
Reinforcement Learning, Agents
Recent Activity
upvoted
a
paper
14 days ago
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
upvoted
a
paper
about 1 month ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
upvoted
a
paper
2 months ago
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
View all activity
Organizations
None yet
andrewzh
's models
3
Sort: Recently updated
andrewzh/Absolute_Zero_Reasoner-Coder-14b
15B
•
Updated
May 6
•
311
•
28
andrewzh/Absolute_Zero_Reasoner-Coder-3b
3B
•
Updated
May 6
•
342
•
11
andrewzh/Absolute_Zero_Reasoner-Coder-7b
8B
•
Updated
May 5
•
992
•
18