openaccess-ai-collective
/

StableLManticore-7B

Text Generation

text-generation-inference

Model card Files Files and versions Community

winglian commited on May 20, 2023

Commit

45b69cb

·

1 Parent(s): 50fc442

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

	@@ -3,4 +3,8 @@
3
4	# StableLManticore 7B
5




6

 # StableLManticore 7B
+Yeah, don't use this. It was mostly an experiment if it's even plausible. Unfortunately StableLM has poor support for SFT with the huggingface trainer, so no things like flash attention, etc. Ed result is this is nearly impossible to train efficiently.
+Yes, it's plausible to try to train this with LoRA, but it's not very usable at all.
+WandB: https://wandb.ai/wing-lian/stable-manticore-7b/runs/b1qqzf2s