Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

datablations

https://github.com/huggingface/datablations
Activity Feed Request to join this org

AI & ML interests

Scaling Data-Constrained Language Models

Recent Activity

thomwolf  authored a paper 1 day ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
craffel  authored a paper 1 day ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
craffel  authored a paper 21 days ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
View all activity

Niklas Muennighoff's profile picture Teven Le Scao's profile picture Nouamane Tazi's profile picture Risto Luukkonen's profile picture Aleksandra Piktus's profile picture Sampo Pyysalo's profile picture Colin Raffel's profile picture Thomas Wolf's profile picture Sasha Rush's profile picture

datablations 's models 38

datablations/lm1-1b1-21b-c4-repetitions

Updated Apr 24, 2023

datablations/lm1-2b8-55b-realtasky

Updated Mar 21, 2023

datablations/lm1-1b1-21b-c4seeds

Updated Mar 21, 2023

datablations/lm1-8b7-176b-c4-ckpts

Updated Feb 28, 2023

datablations/lm1-2b8-55b-oscarpy

Updated Feb 27, 2023

datablations/lm1-2b8-55b-c4py

Updated Feb 13, 2023

datablations/lm1-1b1-21b-c4

Updated Dec 21, 2022

datablations/lm1-83m-20b-nowarmup

Updated Nov 24, 2022
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs