luke9705 commited on
Commit
e6a82a1
·
verified ·
1 Parent(s): 4b4e8cf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -18,12 +18,12 @@ tag: agent-demo-track
18
  **Scriptura** is a multi-agent AI framework based on HF-SmolAgents that streamlines the creation of screenplays, storyboards, and soundtracks by automating the stages of analysis, summarization, and multimodal enrichment—freeing authors to focus on pure creativity.
19
 
20
  At its heart:
21
- -Qwen3-32B serves as the primary orchestrating agent, coordinating workflows and managing high-level reasoning across the system.
22
- -Gemma-3-27B-IT acts as a specialized assistant for multimodal tasks, supporting both text and audio inputs to refine narrative elements and prepare them for downstream generation.
23
 
24
  For media generation, Scriptura integrates:
25
- -MusicGen models (per the AudioCraft MusicGen specification), deployed via Hugging Face Spaces, enabling the agent to produce original soundtracks and sound effects from text prompts or combined text + audio samples.
26
- -FLUX (black-forest-labs/FLUX.1-dev) for on-the-fly image creation—ideal for storyboards, concept art, and visual references that seamlessly tie into the narrative flow.
27
 
28
  Optionally, Scriptura can query external sources (e.g., via a DuckDuckGo API integration) to pull in reference scripts, sound samples, or research materials, ensuring that every draft is not only creatively rich but also contextually informed.
29
 
 
18
  **Scriptura** is a multi-agent AI framework based on HF-SmolAgents that streamlines the creation of screenplays, storyboards, and soundtracks by automating the stages of analysis, summarization, and multimodal enrichment—freeing authors to focus on pure creativity.
19
 
20
  At its heart:
21
+ -Qwen3-32B serves as the primary orchestrating agent, coordinating workflows and managing high-level reasoning across the system.
22
+ -Gemma-3-27B-IT acts as a specialized assistant for multimodal tasks, supporting both text and audio inputs to refine narrative elements and prepare them for downstream generation.
23
 
24
  For media generation, Scriptura integrates:
25
+ -MusicGen models (per the AudioCraft MusicGen specification), deployed via Hugging Face Spaces, enabling the agent to produce original soundtracks and sound effects from text prompts or combined text + audio samples.
26
+ -FLUX (black-forest-labs/FLUX.1-dev) for on-the-fly image creation—ideal for storyboards, concept art, and visual references that seamlessly tie into the narrative flow.
27
 
28
  Optionally, Scriptura can query external sources (e.g., via a DuckDuckGo API integration) to pull in reference scripts, sound samples, or research materials, ensuring that every draft is not only creatively rich but also contextually informed.
29