Spaces:

AteetVatan
/

masx-openchat-llm

Sleeping

ateetvatan commited on Jul 3

Commit

6b456ee

1 Parent(s): 8dd6602

fixed bug

Files changed (1) hide show

model_loader.py CHANGED Viewed

@@ -16,10 +16,10 @@ CTX_LEN = int(os.getenv("CTX_LEN", "8192"))       # Use full 8K context
 # === Load Model ===
 model = AutoModelForCausalLM.from_pretrained(
-    model_path=MODEL_REPO,
     model_file=MODEL_FILE,
     model_type=MODEL_TYPE,
     context_length=CTX_LEN,
-    gpu_layers=0,               # Set >0 if you want to offload layers to GPU
     local_files_only=False,
 )

 # === Load Model ===
 model = AutoModelForCausalLM.from_pretrained(
+    MODEL_REPO,
     model_file=MODEL_FILE,
     model_type=MODEL_TYPE,
     context_length=CTX_LEN,
+    gpu_layers=0,
     local_files_only=False,
 )