4 16 41

sunyuhan

yuuhan

sunyuhan19981208

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

upvoted a paper about 1 month ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

View all activity

Organizations

upvoted a paper 26 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published 28 days ago • 113

upvoted 6 papers about 1 month ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 112

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 283

upvoted a paper 3 months ago

BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese

Paper • 2504.19314 • Published Apr 27, 2025 • 7

upvoted a paper 4 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

upvoted a paper 5 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 268

upvoted an article 6 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

•

1.03k

upvoted 2 papers 7 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 103

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143

upvoted a paper 8 months ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22, 2025 • 64

upvoted an article 11 months ago

Article

Open R1: Update #2

Feb 10, 2025

•

218

upvoted a collection over 1 year ago

GLM-4

Collection

GLM-4 Open Models • 14 items • Updated Jun 30, 2025 • 125