view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 9 days ago • 448
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation Paper • 2505.10238 • Published May 15 • 10
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution Paper • 2505.00497 • Published May 1 • 17
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published Feb 3 • 222
One Shot, One Talk: Whole-body Talking Avatar from a Single Image Paper • 2412.01106 • Published Dec 2, 2024 • 22
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers Paper • 2506.00830 • Published Jun 1 • 7
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation Paper • 2504.02542 • Published Apr 3 • 49
view article Article The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models By pminervini and 5 others • Jan 29, 2024 • 30
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.28k
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22, 2024 • 133
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation Paper • 2406.19251 • Published Jun 27, 2024 • 9
LLaVA++ (LLaMA-3 and Phi-3-Mini) Collection Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 • 11 items • Updated Jun 11, 2024 • 23