Submitted by CNcreator0331 76 LongAnimation: Long Animation Generation with Dynamic Global-Local Memory · 4 authors 215 10
Submitted by Yifan-Zhong 36 A Survey on Vision-Language-Action Models: An Action Tokenization Perspective · 14 authors 182 1
Submitted by zhuoyang20 20 Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation · 7 authors 66 1
Submitted by yukangcao 19 FreeMorph: Tuning-Free Generalized Image Morphing with Diffusion Model · 4 authors 66 1
Submitted by SiyouLi 15 μ^2Tokenizer: Differentiable Multi-Scale Multi-Modal Tokenizer for Radiology Report Generation · 7 authors 261 1
Submitted by multimodalart 11 ARIG: Autoregressive Interactive Head Generation for Real-time Conversations · 5 authors 1
Submitted by shash42 8 Answer Matching Outperforms Multiple Choice for Language Model Evaluation · 5 authors 2
Submitted by jslee525 5 STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing · 3 authors 7 1