Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
EssentialAI 's Collections
Essential-Web v1.0
Rethinking Reflection in Pre-Training

Essential-Web v1.0

updated Jun 18
Upvote
8

  • Essential-Web v1.0: 24T tokens of organized web data

    Paper • 2506.14111 • Published Jun 17 • 43

  • EssentialAI/essential-web-v1.0

    Preview • Updated Jun 22 • 36k • 196

  • EssentialAI/eai-distill-0.5b

    0.6B • Updated Jun 18 • 1.46k • 22

  • EssentialAI/eai-taxonomy-math-w-fm

    Viewer • Updated Jun 22 • 21.6M • 3.28k • 6

  • EssentialAI/eai-taxonomy-code-w-dclm

    Viewer • Updated Jun 22 • 274M • 6.81k • 7

  • EssentialAI/eai-taxonomy-code-w-dclm-100b-sample

    Viewer • Updated Jun 22 • 46.2M • 565 • 2

  • EssentialAI/eai-taxonomy-med-w-dclm

    Viewer • Updated Jun 22 • 81.2M • 3.51k • 8

  • EssentialAI/eai-taxonomy-med-w-dclm-100b-sample

    Viewer • Updated Jun 22 • 36.6M • 1.82k • 2

  • EssentialAI/eai-taxonomy-stem-w-dclm

    Preview • Updated Jun 22 • 4.7k • 5

  • EssentialAI/eai-taxonomy-stem-w-dclm-100b-sample

    Viewer • Updated Jun 22 • 35.5M • 2.57k • 4
Upvote
8
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs