SmolVLM: Redefining small and efficient multimodal models Paper ⢠2504.05299 ⢠Published Apr 7 ⢠181
Unified Reward Model for Multimodal Understanding and Generation Paper ⢠2503.05236 ⢠Published Mar 7 ⢠123
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 ⢠74
Cohere Labs Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. ⢠5 items ⢠Updated 28 days ago ⢠67
CHASE Collection Generate challenging synthetic data to evaluate LLMs ⢠5 items ⢠Updated Feb 21 ⢠4
How to Get Your LLM to Generate Challenging Problems for Evaluation Paper ⢠2502.14678 ⢠Published Feb 20 ⢠17
MMTEB: Massive Multilingual Text Embedding Benchmark Paper ⢠2502.13595 ⢠Published Feb 19 ⢠34
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Paper ⢠2502.13791 ⢠Published Feb 19 ⢠5
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper ⢠2501.17161 ⢠Published Jan 28 ⢠121
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper ⢠2501.07301 ⢠Published Jan 13 ⢠98
METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring Paper ⢠2501.02045 ⢠Published Jan 3 ⢠21
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Paper ⢠2501.01895 ⢠Published Jan 3 ⢠56
LiveBench: A Challenging, Contamination-Free LLM Benchmark Paper ⢠2406.19314 ⢠Published Jun 27, 2024 ⢠23
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper ⢠2412.06559 ⢠Published Dec 9, 2024 ⢠83
PepTune: De Novo Generation of Therapeutic Peptides with Multi-Objective-Guided Discrete Diffusion Paper ⢠2412.17780 ⢠Published Dec 23, 2024 ⢠5
Bridging the Data Provenance Gap Across Text, Speech and Video Paper ⢠2412.17847 ⢠Published Dec 19, 2024 ⢠9
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. ⢠40 items ⢠Updated Feb 13 ⢠86