Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DataComp
non-profit
https://www.datacomp.ai/dclm/index.html#home
Activity Feed
Follow
93
AI & ML interests
None defined yet.
Recent Activity
thomwolf
authored
a paper
about 23 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
thaottn
authored
a paper
about 23 hours ago
Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models
MasterVito
authored
a paper
5 days ago
TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression
View all activity
Team members
88
+54
+41
+20
+10
models
0
None public yet
datasets
0
None public yet