view article Article Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training By siro1 and 4 others • 7 days ago • 44
view article Article 🇵🇭 FilBench - Can LLMs Understand and Generate Filipino? By ljvmiranda921 and 8 others • 3 days ago • 7
view article Article Build an AI Shopping Assistant with Gradio MCP Servers By freddyaboulton • 15 days ago • 46
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 17 days ago • 153
view article Article TextQuests: How Good are LLMs at Text-Based Video Games? By justinphan3110 and 1 other • 3 days ago • 15
👁️ LFM2-VL Collection LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 2 items • Updated 2 days ago • 18
view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • 2 days ago • 10
RP3D-Diag Collection [nature communications 2024] Large-scale Long-tailed Disease Diagnosis on Radiology Images • 4 items • Updated 2 days ago • 2
DiagRL Collection Include data and model checkpoint of our paper: DIagRL • 2 items • Updated 2 days ago • 2
SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 15 items • Updated 2 days ago • 25
view article Article <p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p> By hba123 • 2 days ago • 10
Open-R1 Reproduce Collection Reproduce Deepseek distilled models based on open-r1. • 4 items • Updated Mar 15 • 1
view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 6 days ago • 11
view article Article OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* By stefanwebb and 2 others • 5 days ago • 10