Spaces:

jdelavande
/

chat-ui-energy

Running on CPU Upgrade

nsarrazin commited on Nov 28, 2024

Commit

1b505b4

unverified ·

1 Parent(s): ba5294b

feat(hchat): add QwQ to prod config (#1598)

* feat(hchat): add QwQ to prod config

* fix: change context to 16k

Files changed (1) hide show

chart/env/prod.yaml CHANGED Viewed

@@ -137,6 +137,23 @@ envVars:
           }
         ]
       },
       {
         "name": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
         "tokenizer": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",

           }
         ]
       },
+      {
+        "name": "Qwen/QwQ-32B-Preview",
+        "preprompt": "You are a helpful and harmless assistant. You are Qwen developed by Alibaba. You should think step-by-step.",
+        "modelUrl": "https://huggingface.co/Qwen/QwQ-32B-Preview",
+        "websiteUrl": "https://qwenlm.github.io/blog/qwq-32b-preview/",
+        "logoUrl": "https://huggingface.co/datasets/huggingchat/models-logo/resolve/main/qwen-logo.png",
+        "description": "QwQ is an experiment model from the Qwen Team with advanced reasoning capabilities.",
+        "parameters": {
+          "stop": ["<|im_end|>"],
+          "truncate": 12288,
+          "max_new_tokens": 4096,
+          "temperature": 0.7,
+          "top_k": 20,
+          "top_p": 0.8,
+          "repetition_penalty": 1.05
+        }
+      },
       {
         "name": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",
         "tokenizer": "nvidia/Llama-3.1-Nemotron-70B-Instruct-HF",