Make model config compatible with Hugging Face MiniMax implementation

#39

by geetu040 - opened Jun 27

base: refs/heads/main

←

from: refs/pr/39

Discussion Files changed

+89

-93

geetu040

Jun 27

•

edited Jun 27

This PR updates the model configuration to be compatible with the upstream Hugging Face transformers implementation of the MiniMax architecture, introduced in transformers#35831. With these changes, the model can now be loaded without relying on trust_remote_code=True.

This enables easier and safer usage via the standard AutoModelForCausalLM interface:

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "MiniMaxAI/MiniMax-Text-01-hf",
    revision="refs/pr/39",
    device_map="auto"
)

tokenizer = AutoTokenizer.from_pretrained(
    "MiniMaxAI/MiniMax-Text-01-hf",
    revision="refs/pr/39"
)

prompt = "Hello!"
messages = [
    {"role": "system", "content": [{"type": "text", "text": "You are a helpful assistant created by MiniMax based on MiniMax-Text-01 model."}]},
    {"role": "user", "content": [{"type": "text", "text": prompt}]},
]

model_inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to("cuda")

generated_ids = model.generate(model_inputs, max_new_tokens=100, do_sample=True)
tokenizer.batch_decode(generated_ids)[0]

Update for huggingface/transformers compatible0c0a1fa0

geetu040 changed pull request title from Update for huggingface/transformers compatible to Make model config compatible with Hugging Face MiniMax implementation Jun 27

sriting

MiniMax org Jun 27

Hi @geetu040 , thanks a lot for opening the PR to add Transformers support! We’ve hit a couple of config-parameter clashes in the main repo, so we’re spinning up a dedicated HF version for this integration. Could you please retarget your PR to MiniMax-Text-01-hf instead? Appreciate your help!

sriting changed pull request status to merged Jun 27

mgoin

Jun 27

FYI this is a breaking change for existing users in vLLM who used this model with the old format https://github.com/vllm-project/vllm/issues/20198

We may want to keep around this format and just point HF users to the new "-hf" version

geetu040

Jun 28

@sriting , copying the message from slack if you missed.

There is some confusion! You merged the PR into MiniMaxAI/MiniMax-Text-01, which was not really intended to merge (my appologies, I should have mentioned) which means it is not backward compatible now, users using trust_remote_code and transformers<4.53.0 will not be able to load the model. However MiniMaxAI/MiniMax-Text-01-hf is completely fine and doesnot need any change.
To simply fix everything, you just need to revert the last commit on MiniMaxAI/MiniMax-Text-01

sriting

MiniMax org Jun 28

Oh! My mistake! Let me fix this problem

sriting

MiniMax org Jun 28

Hi @geetu040 @mgoin , thanks for report this error!
This mistake has been fixed in https://huggingface.co/MiniMaxAI/MiniMax-Text-01/discussions/40.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment