10 32 13

Wen Wang

wwen1997

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

upvoted a paper 1 day ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

commented on a paper 1 day ago

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

View all activity

Organizations

upvoted 2 papers 1 day ago

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published 7 days ago • 18

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published 2 days ago • 30

upvoted a paper 9 days ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published 10 days ago • 181

upvoted a paper 14 days ago

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Paper • 2507.22886 • Published 15 days ago • 9

upvoted a paper 18 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 21 days ago • 286

upvoted 2 papers 20 days ago

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published 24 days ago • 34

TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation

Paper • 2507.18537 • Published 21 days ago • 17

upvoted a paper 23 days ago

GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published 24 days ago • 130

upvoted a paper 25 days ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published 28 days ago • 64

upvoted an article about 1 month ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

Jul 9

• 643

upvoted a paper about 1 month ago

Calligrapher: Freestyle Text Image Customization

Paper • 2506.24123 • Published Jun 30 • 36

upvoted 2 papers about 2 months ago

Unified Vision-Language-Action Model

Paper • 2506.19850 • Published Jun 24 • 27

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 74

upvoted 3 papers 2 months ago

InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions

Paper • 2506.09984 • Published Jun 11 • 15

Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting

Paper • 2506.05327 • Published Jun 5 • 11

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation

Paper • 2506.03139 • Published Jun 3 • 16

upvoted 2 papers 3 months ago

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Paper • 2505.21457 • Published May 27 • 14

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Paper • 2505.20256 • Published May 26 • 17

upvoted a collection 3 months ago

Deepseek Papers

Collection

Deepseek papers collection • 24 items • Updated 10 days ago • 268

upvoted a paper 5 months ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 88

Wen Wang

AI & ML interests

Recent Activity

Organizations

wwen1997's activity

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders