xinyi chen's picture

10 6

xinyi chen

quasdo

·

AI & ML interests

None yet

Recent Activity

authored a paper 24 days ago

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

authored a paper 24 days ago

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

authored a paper 24 days ago

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published Dec 4, 2025 • 47

MM-ACT: Learn from Multimodal Parallel Generation to Act

Paper • 2512.00975 • Published Nov 30, 2025 • 6

upvoted 3 papers 3 months ago

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Paper • 2510.13778 • Published Oct 15, 2025 • 16

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13, 2025 • 34

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 174

upvoted a paper 4 months ago

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8, 2025 • 32

upvoted a paper 6 months ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published Jul 17, 2025 • 48

upvoted a paper 7 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 147

upvoted 2 papers 10 months ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10, 2025 • 61

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25, 2025 • 74