Update README.md
Browse files
README.md
CHANGED
@@ -15,22 +15,17 @@ tag: agent-demo-track
|
|
15 |
|
16 |
## Introduction
|
17 |
|
18 |
-
**Scriptura** is a multi-agent AI
|
19 |
|
20 |
-
|
21 |
-
-
|
22 |
-
-
|
23 |
-
- **MusicGen (facebook/musicgen-melody)** to create short audio tracks or sound effects, useful for prototyping or presenting.
|
24 |
-
- Optional web search (integrated with DuckDuckGo API) to fetch external resources (original scripts, sound effects, reference materials).
|
25 |
|
26 |
-
|
27 |
-
|
28 |
-
-
|
29 |
-
- **Audio**: MP3, WAV (for transcribing dialogue or analyzing uploaded soundtracks)
|
30 |
|
31 |
-
|
32 |
-
|
33 |
-
---
|
34 |
|
35 |
## Agent Capabilities
|
36 |
|
|
|
15 |
|
16 |
## Introduction
|
17 |
|
18 |
+
**Scriptura** is a multi-agent AI framework based on HF-SmolAgents that streamlines the creation of screenplays, storyboards, and soundtracks by automating the stages of analysis, summarization, and multimodal enrichment—freeing authors to focus on pure creativity.
|
19 |
|
20 |
+
At its heart:
|
21 |
+
• Qwen3-32B serves as the primary orchestrating agent, coordinating workflows and managing high-level reasoning across the system.
|
22 |
+
• Gemma-3-27B-IT acts as a specialized assistant for multimodal tasks, supporting both text and audio inputs to refine narrative elements and prepare them for downstream generation.
|
|
|
|
|
23 |
|
24 |
+
For media generation, Scriptura integrates:
|
25 |
+
• MusicGen models (per the AudioCraft MusicGen specification), deployed via Hugging Face Spaces, enabling the agent to produce original soundtracks and sound effects from text prompts or combined text + audio samples.
|
26 |
+
• FLUX (black-forest-labs/FLUX.1-dev) for on-the-fly image creation—ideal for storyboards, concept art, and visual references that seamlessly tie into the narrative flow.
|
|
|
27 |
|
28 |
+
Optionally, Scriptura can query external sources (e.g., via a DuckDuckGo API integration) to pull in reference scripts, sound samples, or research materials, ensuring that every draft is not only creatively rich but also contextually informed.
|
|
|
|
|
29 |
|
30 |
## Agent Capabilities
|
31 |
|