Update README.md
Browse files
README.md
CHANGED
@@ -28,7 +28,7 @@ SmallThinker is a family of **on-device native** Mixture-of-Experts (MoE) langua
|
|
28 |
| **Activated Parameters** | 0.6B |
|
29 |
| **Number of Layers** | 32 |
|
30 |
| **Attention Hidden Dimension** | 1536 |
|
31 |
-
| **MoE Hidden Dimension** (per Expert) |
|
32 |
| **Number of Attention Heads** | 12 |
|
33 |
| **Number of Experts** | 32 |
|
34 |
| **Selected Experts per Token** | 4 |
|
|
|
28 |
| **Activated Parameters** | 0.6B |
|
29 |
| **Number of Layers** | 32 |
|
30 |
| **Attention Hidden Dimension** | 1536 |
|
31 |
+
| **MoE Hidden Dimension** (per Expert) | 768 |
|
32 |
| **Number of Attention Heads** | 12 |
|
33 |
| **Number of Experts** | 32 |
|
34 |
| **Selected Experts per Token** | 4 |
|