Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Nanotron Research

community
Activity Feed Request to join this org

AI & ML interests

Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr

Recent Activity

thomwolf  authored a paper about 19 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
lvwerra  authored a paper about 19 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
hynky  authored a paper about 19 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
View all activity

Thomas Wolf's profile picture Nouamane Tazi's profile picture Loubna Ben Allal's profile picture Ferdinand Mom's profile picture neuralink's profile picture Nathan Habib's profile picture Leandro von Werra's profile picture Guilherme Penedo's profile picture Hynek Kydlicek's profile picture Elie Bakouch's profile picture Haojun Zhao's profile picture Mohamed Mekkouri's profile picture

nanotron 's Spaces 2

pinned
Running
2.72k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

May 8
pinned
Running
74

Predict Memory

🧮

Calculate memory usage from model configurations

Mar 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs