Model Description

This is a Medusa model for Mistral 7B Instruct v0.2. This is trained using the latest Medusa 2 commit.

Training:

  • Dataset used is the self distillation dataset from Mistral 7B Instruct v0.2, temperature 0.3 with output token of 2048.
  • It has been trained using axolotl fork as describe in Medusa 2 README.md

Inference:

  • To load the model please follow the instruction found in Github
Downloads last month
6
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for EmbeddedLLM/Medusa2-Mistral-7B-Instruct-v0.2

Merges
1 model