Unable to inference beyond sliding window length
#128 opened over 1 year ago
by
kreas
How to finetune this model mistralai/Mistral-7B-v0.1 and also merge the weights
5
#126 opened over 1 year ago
by
yeniceriSGK
Pretrain?
3
#125 opened over 1 year ago
by
limha
Mistral 7B produces different results when we hit via postman api
7
#124 opened over 1 year ago
by
DivyaKanniah
Load and extract the model for language modeling
1
#123 opened over 1 year ago
by
theodpzz

Unexpected keyword 'rope_scaling' while loading model
3
#122 opened over 1 year ago
by
gandhipratik65j
Kernel crashed while loading checkpoint shards
3
#121 opened over 1 year ago
by
clemennntt
Is there any way to increase the vocabulary of the tokenizer and use it fine tune the model on the new language
4
#120 opened over 1 year ago
by
Tejaswi006
I hope he can respond according to the language used by the user
#118 opened over 1 year ago
by
poarpeak
Fix context length in config
#117 opened over 1 year ago
by
imone

Finetuning with PEFT - Some weights of MistralForSequenceClassification were not initialized from the model
6
#116 opened over 1 year ago
by
RobbieTheRobot
Data collator removing eos token
#115 opened over 1 year ago
by
MaBrThesis2023
Thanks to Mistral for making our dream a reality
β€οΈ
1
1
#114 opened over 1 year ago
by
Muhammadreza

Is SWA used during pertaining?
π€
2
#113 opened over 1 year ago
by
EarthWorm001
FT Mistral Generate Slowly
#112 opened over 1 year ago
by
yixliu1
PEFT based Fine Tuned model hallucinates values from the fine tuning training data while inferencing.
7
#111 opened over 1 year ago
by
Pradeep1995

should we follow the same mistral prompt structure while finetuning time?
#110 opened over 1 year ago
by
Pradeep1995

npz file for apple MLX
2
#109 opened over 1 year ago
by
joy2000
Error in config.json
3
#108 opened over 1 year ago
by
aimlBysoham

Incomplete Output even with max_new_tokens
12
#107 opened over 1 year ago
by
Pradeep1995

can't generate embedding vector
#106 opened over 1 year ago
by
philgrey

Maximum number of input tokens ?
1
#104 opened over 1 year ago
by
Kirolos

Mistral Custom Chatbot Code Sample
4
#100 opened over 1 year ago
by
unixguru2k
how to increase response max token size
#99 opened over 1 year ago
by
philgrey

Huggingface.com
#98 opened over 1 year ago
by
Khalid776826
How to remember conversation history (prior prompts and responses)
2
#97 opened over 1 year ago
by
TheBacteria
Why is this 7B model only showing 5GB of gpu ram allocation?
π€
1
3
#96 opened over 1 year ago
by
shayak
Add Flax checkpoints
#95 opened over 1 year ago
by
ksmcg

Update README.md
#93 opened over 1 year ago
by
AzerOuerghi
can i use mistral as embedding model?
π€
1
8
#92 opened over 1 year ago
by
raynWest
Adding `safetensors` variant of this model
π
2
2
#91 opened over 1 year ago
by
lcahill
Adding Evaluation Results
#90 opened over 1 year ago
by
leaderboard-pr-bot

Embeddings API
π
2
3
#88 opened almost 2 years ago
by
priamai
Update config.json
#86 opened almost 2 years ago
by
PlanetDOGE

Create README.md
#80 opened almost 2 years ago
by
joey1895
Keyerror "Mistral"
7
#79 opened almost 2 years ago
by
lakshmiu
Korean data rate in pretraining datasets.
π
5
3
#78 opened almost 2 years ago
by
Korabbit
Model outputs only <unk> tokens after training on my data
β
4
#77 opened almost 2 years ago
by
Fico
MemGPT, Function Calling and Mistral-7b-v0.1
#76 opened almost 2 years ago
by
Joseph717171
I create a site for someone want full guide of this model
π
1
#72 opened almost 2 years ago
by
LLMhacker
Can you give an example of a good prompt template?
π
6
3
#70 opened almost 2 years ago
by
iplayfast
Hosting Mistral 7B API
2
#69 opened almost 2 years ago
by
wahab12
ImportError: Using `load_in_8bit=True` requires Accelerate
4
#68 opened almost 2 years ago
by
ubermenchh

Update README.md
#67 opened almost 2 years ago
by
Enoughking
Suggested Architecture for Small Mistral Model
#66 opened almost 2 years ago
by
mnitin73
Does Mistral support accelerate library?
π
5
4
#65 opened almost 2 years ago
by
Sp1der
The attention mask and the pad token id were not set.
2
#64 opened almost 2 years ago
by
victor314159
[AUTOMATED] Model Memory Requirements
#63 opened almost 2 years ago
by
model-sizer-bot
If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?
2
#62 opened almost 2 years ago
by
brando
