PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning Paper ⢠2507.06415 ⢠Published Jul 8 ⢠6
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper ⢠2507.05687 ⢠Published Jul 8 ⢠26
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality Paper ⢠2507.07202 ⢠Published Jul 9 ⢠22
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS Paper ⢠2507.07136 ⢠Published Jul 9 ⢠35
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding Paper ⢠2507.07984 ⢠Published Jul 10 ⢠40
Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs Paper ⢠2507.07990 ⢠Published Jul 10 ⢠45
view article Article Building the Hugging Face MCP Server By evalstate and 3 others ⢠Jul 10 ⢠59
RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy Paper ⢠2503.24388 ⢠Published Mar 31 ⢠31
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization Paper ⢠2503.19901 ⢠Published Mar 25 ⢠41
Efficient Inference for Large Reasoning Models: A Survey Paper ⢠2503.23077 ⢠Published Mar 29 ⢠47
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper ⢠2503.24290 ⢠Published Mar 31 ⢠63
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper ⢠2503.23461 ⢠Published Mar 30 ⢠95
MoCha: Towards Movie-Grade Talking Character Synthesis Paper ⢠2503.23307 ⢠Published Mar 30 ⢠138