Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

bigscience-catalogue-data-dev
/
byte-level-bpe-tokenizer-no-norm-250k-whitespace-and-eos-regex-alpha-v3-dedup-lines-articles

Model card Files Files and versions Community
byte-level-bpe-tokenizer-no-norm-250k-whitespace-and-eos-regex-alpha-v3-dedup-lines-articles
Ctrl+K
Ctrl+K
  • 2 contributors
History: 3 commits
SaulLu's picture
SaulLu
Create README.md
91b871b about 3 years ago
  • .gitattributes
    1.23 kB
    Add tokenizer about 3 years ago
  • README.md
    565 Bytes
    Create README.md about 3 years ago
  • special_tokens_map.json
    85 Bytes
    Add tokenizer about 3 years ago
  • tokenizer.json
    14.5 MB
    LFS
    Add tokenizer about 3 years ago
  • tokenizer_config.json
    131 Bytes
    Add tokenizer about 3 years ago