Nymbo commited on
Commit
d1c6c5d
·
verified ·
1 Parent(s): d6038df

no more 30 second time limit on kokoro, can go for several minutes.

Browse files
Files changed (1) hide show
  1. app.py +5 -4
app.py CHANGED
@@ -583,18 +583,18 @@ def Generate_Speech( # <-- MCP tool #4 (Generate Speech)
583
  Japanese, Portuguese, and Chinese speakers.
584
 
585
  Enhanced for longer audio generation:
 
586
  - Can generate audio of any length based on input text
587
  - Concatenates multiple segments for seamless longer audio
588
 
589
  Default behavior:
590
- - Speed defaults to 1.25 (slightly brisk cadence).
591
- - Voice defaults to "af_heart".
592
 
593
  Args:
594
  text: The text to synthesize. Works best with English but supports multiple languages.
595
  speed: Speech speed multiplier in 0.5–2.0; 1.0 = normal speed. Default: 1.25 (slightly brisk).
596
- voice: Voice identifier from 54 available options. Use List_Kokoro_Voices() to see all choices.
597
- Examples: 'af_heart' (US female), 'am_adam' (US male), 'bf_bella' (British female),
598
 
599
  Returns:
600
  A tuple of (sample_rate_hz, audio_waveform) where:
@@ -605,6 +605,7 @@ def Generate_Speech( # <-- MCP tool #4 (Generate Speech)
605
  - Requires the 'kokoro' package (>=0.9.4). If unavailable, an error is raised.
606
  - Runs on CUDA if available; otherwise CPU.
607
  - Supports 54 voices across 9 language/accent categories.
 
608
  - Use List_Kokoro_Voices() MCP tool to discover all available voice options.
609
  """
610
  if not text or not text.strip():
 
583
  Japanese, Portuguese, and Chinese speakers.
584
 
585
  Enhanced for longer audio generation:
586
+ - Processes ALL text segments (not just the first one)
587
  - Can generate audio of any length based on input text
588
  - Concatenates multiple segments for seamless longer audio
589
 
590
  Default behavior:
591
+ - Speed defaults to 1.25 (slightly brisk cadence) for clearer, snappier delivery.
592
+ - Voice defaults to "af_heart" (American Female, Heart voice)
593
 
594
  Args:
595
  text: The text to synthesize. Works best with English but supports multiple languages.
596
  speed: Speech speed multiplier in 0.5–2.0; 1.0 = normal speed. Default: 1.25 (slightly brisk).
597
+ voice: Voice identifier from 54 available options. Use List_Kokoro_Voices() to see all choices. Default: 'af_heart'.
 
598
 
599
  Returns:
600
  A tuple of (sample_rate_hz, audio_waveform) where:
 
605
  - Requires the 'kokoro' package (>=0.9.4). If unavailable, an error is raised.
606
  - Runs on CUDA if available; otherwise CPU.
607
  - Supports 54 voices across 9 language/accent categories.
608
+ - Can generate audio of any length - no 30 second limit!
609
  - Use List_Kokoro_Voices() MCP tool to discover all available voice options.
610
  """
611
  if not text or not text.strip():