hiroshi-matsuda-rit commited on
Commit
2df511e
·
1 Parent(s): ddb7b9a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -1,3 +1,18 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ The Hugging Face fast tokenizer for LLM-jp ABCI challenge 2023.
5
+
6
+ The vocab size is 96,868.
7
+
8
+ Requirements:
9
+ - transformers>=4.34.0
10
+ - tokenizers>=0.14.0
11
+ - torch
12
+
13
+ Usage:
14
+ ```Python
15
+ from transformers import AutoTokenizer
16
+
17
+ tokenizer = AutoTokenizer.from_pretrained("llm-jp/hf-fast-tokenizer-v22b2")
18
+ ```