lbourdois commited on
Commit
070fafc
·
verified ·
1 Parent(s): eb0d06c

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +56 -42
README.md CHANGED
@@ -1,43 +1,57 @@
1
- ---
2
- base_model: Qwen/Qwen2.5-32B-Instruct
3
- library_name: transformers
4
- model_name: step-conditional-control
5
- tags:
6
- - generated_from_trainer
7
- - trl
8
- - sft
9
- license: apache-2.0
10
- ---
11
-
12
- # Model Summary
13
-
14
- - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
15
- - **Paper:** https://arxiv.org/abs/2501.19393
16
-
17
- # Use
18
-
19
- This is the token-conditional control model for our paper. You can evaluate using the information [here](https://github.com/simplescaling/s1?tab=readme-ov-file#evaluation).
20
-
21
- # Training information
22
-
23
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/hashimoto-group/o1/runs/xaantfal)
24
-
25
- - TRL: 0.13.0
26
- - Transformers: 4.48.0
27
- - Pytorch: 2.3.1
28
- - Datasets: 3.0.1
29
- - Tokenizers: 0.21.0
30
-
31
- # Citation
32
-
33
- ```bibtex
34
- @misc{muennighoff2025s1simpletesttimescaling,
35
- title={s1: Simple test-time scaling},
36
- author={Niklas Muennighoff and Zitong Yang and Weijia Shi and Xiang Lisa Li and Li Fei-Fei and Hannaneh Hajishirzi and Luke Zettlemoyer and Percy Liang and Emmanuel Candès and Tatsunori Hashimoto},
37
- year={2025},
38
- eprint={2501.19393},
39
- archivePrefix={arXiv},
40
- primaryClass={cs.CL},
41
- url={https://arxiv.org/abs/2501.19393},
42
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
  ```
 
1
+ ---
2
+ base_model: Qwen/Qwen2.5-32B-Instruct
3
+ library_name: transformers
4
+ model_name: step-conditional-control
5
+ tags:
6
+ - generated_from_trainer
7
+ - trl
8
+ - sft
9
+ license: apache-2.0
10
+ language:
11
+ - zho
12
+ - eng
13
+ - fra
14
+ - spa
15
+ - por
16
+ - deu
17
+ - ita
18
+ - rus
19
+ - jpn
20
+ - kor
21
+ - vie
22
+ - tha
23
+ - ara
24
+ ---
25
+
26
+ # Model Summary
27
+
28
+ - **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
29
+ - **Paper:** https://arxiv.org/abs/2501.19393
30
+
31
+ # Use
32
+
33
+ This is the token-conditional control model for our paper. You can evaluate using the information [here](https://github.com/simplescaling/s1?tab=readme-ov-file#evaluation).
34
+
35
+ # Training information
36
+
37
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/hashimoto-group/o1/runs/xaantfal)
38
+
39
+ - TRL: 0.13.0
40
+ - Transformers: 4.48.0
41
+ - Pytorch: 2.3.1
42
+ - Datasets: 3.0.1
43
+ - Tokenizers: 0.21.0
44
+
45
+ # Citation
46
+
47
+ ```bibtex
48
+ @misc{muennighoff2025s1simpletesttimescaling,
49
+ title={s1: Simple test-time scaling},
50
+ author={Niklas Muennighoff and Zitong Yang and Weijia Shi and Xiang Lisa Li and Li Fei-Fei and Hannaneh Hajishirzi and Luke Zettlemoyer and Percy Liang and Emmanuel Candès and Tatsunori Hashimoto},
51
+ year={2025},
52
+ eprint={2501.19393},
53
+ archivePrefix={arXiv},
54
+ primaryClass={cs.CL},
55
+ url={https://arxiv.org/abs/2501.19393},
56
+ }
57
  ```