Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Vinko Sabolcec's picture
17 1 2

Vinko Sabolcec

vsabolcec
21world's profile picture NXz64Fdf8Y's profile picture nataliaElv's profile picture
·

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
updated a dataset 6 months ago
epfml/FineWeb2-embedded
updated a dataset 6 months ago
epfml/FineWeb2-HQ
View all activity

Organizations

EPFL Machine Learning and Optimization Laboratory's profile picture FineData's profile picture mlo-data-cleaning's profile picture HuggingFaceFW-Dev's profile picture mlo-data-collab's profile picture mlo-mhq's profile picture

authored a paper about 2 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 65
updated 5 datasets 6 months ago

epfml/FineWeb2-embedded

Viewer • Updated Feb 19 • 3.98B • 930 • 4

epfml/FineWeb2-HQ

Viewer • Updated Feb 19 • 380M • 15.7k • 20

epfml/FineWeb2-HQ

Viewer • Updated Feb 19 • 380M • 15.7k • 20

epfml/FineWeb2-embedded

Viewer • Updated Feb 19 • 3.98B • 930 • 4

epfml/FineWeb2-embedded

Viewer • Updated Feb 19 • 3.98B • 930 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs