motionlabs
/

random-llama-small

Text Generation

text-generation-inference

Model card Files Files and versions

saneowl commited on Apr 19

Commit

ec7d67c

·

verified ·

1 Parent(s): 9e7c31d

correct mistake in readme

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -55,7 +55,7 @@ The LLaMA architecture, developed by Meta AI, is a family of efficient transform
 ## Random-Llama-Small Specifics
 This model uses random weights and:
-- Has ~1.52B parameters across 22 layers.
 - Uses a 2304 hidden size and 9216 FFN size.
 - Supports 128K+ vocab tokens and bfloat16 precision.
 - Supports extended context lengths of 131,072 tokens.

 ## Random-Llama-Small Specifics
 This model uses random weights and:
+- Has ~2B parameters across 22 layers.
 - Uses a 2304 hidden size and 9216 FFN size.
 - Supports 128K+ vocab tokens and bfloat16 precision.
 - Supports extended context lengths of 131,072 tokens.