SmolVLM: Redefining small and efficient multimodal models Paper ⢠2504.05299 ⢠Published Apr 7 ⢠197
ABC: Achieving Better Control of Multimodal Embeddings using VLMs Paper ⢠2503.00329 ⢠Published Mar 1 ⢠19
SwarmFormer Collection Our collection of our frontier SwarmFormer architecture models. ⢠2 items ⢠Updated Jan 24 ⢠3
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper ⢠2411.14974 ⢠Published Nov 22, 2024 ⢠16
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper ⢠2411.18613 ⢠Published Nov 27, 2024 ⢠59
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper ⢠2410.19008 ⢠Published Oct 21, 2024 ⢠24
T3M: Text Guided 3D Human Motion Synthesis from Speech Paper ⢠2408.12885 ⢠Published Aug 23, 2024 ⢠13
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper ⢠2408.06292 ⢠Published Aug 12, 2024 ⢠127
view article Article WWDC 24: Running Mistral 7B with Core ML By FL33TW00D-HF and 3 others ⢠Jul 22, 2024 ⢠61
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks Paper ⢠2407.02855 ⢠Published Jul 3, 2024 ⢠13
Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper ⢠2407.04620 ⢠Published Jul 5, 2024 ⢠35