Zhiyuan Hu's picture

6 14

Zhiyuan Hu

zhiyuanhucs

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

authored a paper about 1 month ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper about 1 month ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

View all activity

Organizations

Papers 6

arxiv:2601.09667

arxiv:2601.08763

arxiv:2505.10554

arxiv:2504.00050

spaces 1

Meta Ability

Enhance reasoning in large models by aligning meta-abilities

models 164

zhiyuanhucs/creativitty-self-classifier-med-lastest-step175

8B • Updated Nov 13, 2025

zhiyuanhucs/creativitty-self-classifier-med-lastest-step120

8B • Updated Nov 13, 2025

zhiyuanhucs/creativitty-self-classifier-med-lastest-step85

8B • Updated Nov 13, 2025 • 2

zhiyuanhucs/creativitty-self-classifier-med-lastest-step45

8B • Updated Nov 13, 2025

zhiyuanhucs/creativitty-self-classifier-med-lastest-step20

8B • Updated Nov 13, 2025 • 2

zhiyuanhucs/creativitty-self-classifier-med-24Sep-step20

8B • Updated Sep 26, 2025 • 2

zhiyuanhucs/creativitty-self-classifier-physics-simplr-10Sep-step130

8B • Updated Sep 16, 2025 • 1

zhiyuanhucs/creativitty-self-classifier-physics-simplr-10Sep-step120

8B • Updated Sep 16, 2025 • 1

zhiyuanhucs/creativitty-self-classifier-physics-simplr-10Sep-step110

8B • Updated Sep 16, 2025

zhiyuanhucs/creativitty-self-classifier-physics-simplr-10Sep-step100

8B • Updated Sep 16, 2025 • 1

View 164 models

datasets 5

zhiyuanhucs/amc_23-24-25

Viewer • Updated Aug 18, 2025 • 115

zhiyuanhucs/aime24_25

Viewer • Updated Aug 17, 2025 • 60 • 190

zhiyuanhucs/amc23_dup32

Viewer • Updated May 7, 2025 • 1.28k • 7

zhiyuanhucs/AIME24_dup32

Viewer • Updated May 7, 2025 • 960 • 2

zhiyuanhucs/AIME_1983_2024

Viewer • Updated Mar 3, 2025 • 933 • 16