Running 3.05k 3.05k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Running 1.03k 1.03k FineWeb: decanting the web for the finest text data at scale ๐ท Generate high-quality web text data for LLM training
Running 143 143 Qwen 2.5 Code Interpreter ๐ Execute code snippets through natural language prompts
HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation โข 3B โข Updated Sep 19, 2024 โข 1.66k โข 192