arxiv:2602.21818
Peiyu Wang PRO
OrlandoHugBot
AI & ML interests
LLM/MLLM/Agent
Recent Activity
liked a dataset 2 days ago
SWE-bench/SWE-smith commentedon a paper 20 days ago
$ฯ$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows upvoted a paper 20 days ago
ฯ-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows