BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese Paper • 2504.19314 • Published 16 days ago • 4 • 2
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning Paper • 2505.02363 • Published 9 days ago • 6 • 2
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes Paper • 2505.05288 • Published 5 days ago • 11 • 2
Vision-Language-Action Models: Concepts, Progress, Applications and Challenges Paper • 2505.04769 • Published 6 days ago • 7 • 2
FG-CLIP: Fine-Grained Visual and Textual Alignment Paper • 2505.05071 • Published 6 days ago • 16 • 2
X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains Paper • 2505.03981 • Published 7 days ago • 14 • 3
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 6 days ago • 127 • 3
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published 6 days ago • 71 • 8
ICon: In-Context Contribution for Automatic Data Selection Paper • 2505.05327 • Published 5 days ago • 11 • 2
StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant Paper • 2505.05467 • Published 5 days ago • 13 • 2
WaterDrum: Watermarking for Data-centric Unlearning Metric Paper • 2505.05064 • Published 6 days ago • 8 • 2
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models Paper • 2505.02847 • Published 12 days ago • 24 • 4
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper • 2505.04842 • Published 6 days ago • 12 • 3