Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BigCode Data
non-profit
BigCodeProject
bigcode-project
Activity Feed
Request to join this org
Follow
28
AI & ML interests
None defined yet.
Recent Activity
thomwolf
authored
a paper
about 17 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
lvwerra
authored
a paper
about 17 hours ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
loubnabnl
authored
a paper
21 days ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
View all activity
Team members
16
models
0
None public yet
datasets
1
bigcode-data/license_list
Viewer
•
Updated
Oct 18, 2023
•
824
•
26