IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards Paper • 2508.04632 • Published 8 days ago • 2
Sel3DCraft: Interactive Visual Prompts for User-Friendly Text-to-3D Generation Paper • 2508.00428 • Published 13 days ago • 3
Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR Paper • 2504.11101 • Published Apr 15
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents Paper • 2507.19478 • Published 20 days ago • 29
TextCenGen: Attention-Guided Text-Centric Background Adaptation for Text-to-Image Generation Paper • 2404.11824 • Published Apr 18, 2024 • 1
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows Paper • 2505.19897 • Published May 26 • 103
CritiQ: Mining Data Quality Criteria from Human Preferences Paper • 2502.19279 • Published Feb 26 • 10
Breaking the Data Barrier -- Building GUI Agents Through Task Generalization Paper • 2504.10127 • Published Apr 14 • 17
SEAGraph: Unveiling the Whole Story of Paper Review Comments Paper • 2412.11939 • Published Dec 16, 2024 • 1
RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning Paper • 2408.03195 • Published Aug 6, 2024
Let's Be Self-generated via Step by Step: A Curriculum Learning Approach to Automated Reasoning with Large Language Models Paper • 2410.21728 • Published Oct 29, 2024
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 88
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published Oct 30, 2024 • 51
Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis Paper • 2407.12857 • Published Jul 9, 2024 • 1
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 47
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models Paper • 2309.09958 • Published Sep 18, 2023 • 19