Spaces:

Priyanshukr-1
/

openhermes_mistral_API

Sleeping

Priyanshukr-1 commited on 29 days ago

Commit

6c97f34

verified ·

1 Parent(s): f747bda

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -166,7 +166,7 @@ async def generate(request: Request):
     try:
         response = llm.create_chat_completion(
             messages=messages_for_llm,
-            max_tokens=1024,  # Keep response length short for maximum speed
             temperature=0.7, # Adjust temperature for creativity vs. coherence (0.0-1.0)
             stop=["</s>"] # Stop sequence for TinyLlama Chat
         )

     try:
         response = llm.create_chat_completion(
             messages=messages_for_llm,
+            max_tokens=300,  # Keep response length short for maximum speed
             temperature=0.7, # Adjust temperature for creativity vs. coherence (0.0-1.0)
             stop=["</s>"] # Stop sequence for TinyLlama Chat
         )