Spaces:

Priyanshukr-1
/

openhermes_mistral_API

Sleeping

Priyanshukr-1 commited on about 1 month ago

Commit

f747bda

verified ·

1 Parent(s): e5669ba

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -166,7 +166,7 @@ async def generate(request: Request):
     try:
         response = llm.create_chat_completion(
             messages=messages_for_llm,
-            max_tokens=2048,  # Keep response length short for maximum speed
             temperature=0.7, # Adjust temperature for creativity vs. coherence (0.0-1.0)
             stop=["</s>"] # Stop sequence for TinyLlama Chat
         )

     try:
         response = llm.create_chat_completion(
             messages=messages_for_llm,
+            max_tokens=1024,  # Keep response length short for maximum speed
             temperature=0.7, # Adjust temperature for creativity vs. coherence (0.0-1.0)
             stop=["</s>"] # Stop sequence for TinyLlama Chat
         )