Spaces:

Priyanshukr-1
/

openhermes_mistral_API

Sleeping

Priyanshukr-1 commited on 29 days ago

Commit

17420fe

verified ·

1 Parent(s): 9441b05

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -190,7 +190,7 @@ async def generate(request: Request):
     try:
         response = llm.create_chat_completion(
             messages=messages_for_llm,
-            max_tokens=1024,  # Keep response length short for maximum speed
             temperature=0.7, # Adjust temperature for creativity vs. coherence (0.0-1.0)
             stop=["</s>"] # Stop sequence for TinyLlama Chat
         )

     try:
         response = llm.create_chat_completion(
             messages=messages_for_llm,
+            max_tokens=800,  # Keep response length short for maximum speed
             temperature=0.7, # Adjust temperature for creativity vs. coherence (0.0-1.0)
             stop=["</s>"] # Stop sequence for TinyLlama Chat
         )