wzebrowski commited on
Commit
965cc44
·
verified ·
1 Parent(s): e3b05c9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -10,4 +10,21 @@ pipeline_tag: text-generation
10
 
11
  Introduce **OpenSML**, a series of **Open** **SM**a**L**l Language Models. These models arcitecture are built stricly will Apple's [MLX](https://ml-explore.github.io/mlx/build/html/index.html#) framework.
12
 
13
- The pre-training dataset is a slice of OpenWebText dataset with approximately 2.3 billion tokens.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
 
11
  Introduce **OpenSML**, a series of **Open** **SM**a**L**l Language Models. These models arcitecture are built stricly will Apple's [MLX](https://ml-explore.github.io/mlx/build/html/index.html#) framework.
12
 
13
+ The pre-training dataset is a slice of OpenWebText dataset with approximately 2.3 billion tokens.
14
+
15
+
16
+
17
+ ## Bias, Risks, and Limitations
18
+
19
+ OpenSMLis shared to advance open research by granting access to cutting-edge language models. However, because it’s trained on publicly sourced data and released without safety warranties, it may produce content that is inaccurate, harmful, biased, or otherwise objectionable. Users and developers should therefore conduct rigorous safety evaluations and put in place filtering or other safeguards that suit their specific use cases.
20
+
21
+ ## Citation
22
+
23
+ If you find our work useful, please cite:
24
+
25
+ @misc{zebrowski2025opensml,
26
+ title={OpenSML: A Family of Small Language Models},
27
+ author={William Zebrowski},
28
+ year={2025},
29
+ howpublished={\url{https://github.com/wzebrowski/opensml}}
30
+ }