view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware By RakshitAralimatti • 6 days ago • 5
The Well Collection A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24 • 29
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel By smangrul and 1 other • May 2, 2022 • 7
view article Article Fine-tuning Llama 2 70B using PyTorch FSDP By smangrul and 3 others • Sep 13, 2023 • 29
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 7 days ago • 278