Yuzhe Gu's picture

Yuzhe Gu

vanilla1116

·

https://guyuzhe.site/

Liqu1d-G

AI & ML interests

LLM; Reasoning; Hallucination; Self-Improvement

Recent Activity

commented on a paper about 2 months ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

authored a paper about 2 months ago

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

authored a paper about 2 months ago

Intern-S1: A Scientific Multimodal Foundation Model

View all activity

Organizations

commented a paper about 2 months ago

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published Dec 11, 2025 • 47 •

commented a paper 6 months ago

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published Jul 22, 2025 • 21 •

commented a paper 7 months ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published Jul 17, 2025 • 49 •

commented a paper 10 months ago

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31, 2025 • 29 •

New activity in opencompass/anah 11 months ago

Update dataset card, link to paper, add category

#2 opened 11 months ago by

New activity in opencompass/anah-7b 11 months ago

Add missing metadata and clarify license

#1 opened 11 months ago by

New activity in opencompass/anah-20b 11 months ago

Add missing metadata: `pipeline_tag`, `library_name`, and `license`

#1 opened 11 months ago by

New activity in opencompass/anah-v2 11 months ago

Improve model card with library_name and pipeline_tag

#1 opened 11 months ago by

commented a paper 11 months ago

Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs

Paper • 2503.02846 • Published Mar 4, 2025 • 19 •

commented a paper 12 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10, 2025 • 58 •

New activity in opencompass/anah over 1 year ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter

commented 2 papers over 1 year ago

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Paper • 2407.04693 • Published Jul 5, 2024 • 3 •

ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models

Paper • 2407.04693 • Published Jul 5, 2024 • 3 •