Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DataComp
non-profit
https://www.datacomp.ai/dclm/index.html#home
Activity Feed
Follow
93
AI & ML interests
None defined yet.
Recent Activity
thomwolf
authored
a paper
1 day ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
thaottn
authored
a paper
1 day ago
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
MasterVito
authored
a paper
5 days ago
TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression
View all activity
Team members
88
+54
+41
+20
+10
dclm
's models
None public yet