Fan Zhou's picture

Fan Zhou

koalazf99

·

https://koalazf99.github.io/

AI & ML interests

Deep Learning; Natural Language Processing; Foundation Models

Recent Activity

upvoted a paper 11 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

upvoted a paper 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

upvoted a paper 2 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

View all activity

Organizations

upvoted a paper 11 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 12 days ago • 48

upvoted 2 papers 2 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22, 2025 • 19

upvoted a paper 4 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 79

authored a paper 6 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 48

New activity in OctoThinker/MegaMath-Web-Pro-Max 6 months ago

[bot] Conversion to Parquet

#3 opened 6 months ago by

parquet-converter

liked a dataset 6 months ago

OctoThinker/MegaMath-Web-Pro-Max

Viewer • Updated Jul 6, 2025 • 69.2M • 23.9k • 36

updated a collection 6 months ago

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling • 18 items • Updated Jun 26, 2025 • 2

upvoted a paper 6 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 48

updated a collection 6 months ago

🐙 OctoThinker

Mid-training Incentivizes Reinforcement Learning Scaling • 18 items • Updated Jun 26, 2025 • 2

updated a collection 7 months ago

🧙 Guru

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective • 4 items • Updated Jun 20, 2025

authored a paper 7 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 49

upvoted a paper 7 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17, 2025 • 49

liked 2 datasets 7 months ago

princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18, 2025 • 500 • 578k • 243

LLM360/guru-RL-92k

Viewer • Updated Aug 20, 2025 • 91.9k • 1.94k • 42

upvoted 2 papers 7 months ago

Thinking with Generated Images

Paper • 2505.22525 • Published May 28, 2025 • 15

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104