Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
StarCoder2 Data
community
https://www.bigcode-project.org/
Activity Feed
Follow
27
AI & ML interests
None defined yet.
Recent Activity
lvwerra
authored
a paper
2 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
hynky
authored
a paper
2 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
loubnabnl
authored
a paper
22 days ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
View all activity
Team members
18
starcoder2data
's datasets
None public yet