runtime error
Exit code: 1. Reason: Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. Traceback (most recent call last): File "/home/user/app/app.py", line 185, in <module> handler = Chat(model_path, load_8bit=False, load_4bit=True) File "/home/user/app/app.py", line 82, in __init__ self.model, self.processor, self.tokenizer = model_init(model_path, load_8bit=load_8bit, load_4bit=load_4bit) File "/home/user/app/./VideoLLaMA2/videollama2/__init__.py", line 17, in model_init tokenizer, model, processor, context_len = load_pretrained_model(model_path, None, model_name, **kwargs) File "/home/user/app/./VideoLLaMA2/videollama2/model/__init__.py", line 172, in load_pretrained_model model = Videollama2Qwen2ForCausalLM.from_pretrained(model_path, low_cpu_mem_usage=True, config=config, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3165, in from_pretrained hf_quantizer.validate_environment( File "/usr/local/lib/python3.10/site-packages/transformers/quantizers/quantizer_bnb_4bit.py", line 62, in validate_environment raise ImportError( ImportError: Using `bitsandbytes` 8-bit quantization requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes: `pip install -i https://pypi.org/simple/ bitsandbytes`
Container logs:
Fetching error logs...