How to use aoxo/kevin-token-compressor with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("unsloth/Meta-Llama-3.1-8B-bnb-4bit") model = PeftModel.from_pretrained(base_model, "aoxo/kevin-token-compressor")