Spaces:
Sleeping
Sleeping
Update app.py
Browse files
app.py
CHANGED
@@ -79,31 +79,27 @@ class YouTubeDownloader:
|
|
79 |
- Visual effects, text overlays, or graphics
|
80 |
- Mood, tone, and atmosphere
|
81 |
- Camera movements or angles (if apparent)
|
|
|
|
|
|
|
|
|
|
|
|
|
82 |
|
83 |
-
|
84 |
- For videos under 1 minute: 2-3 second segments
|
85 |
- For videos 1-5 minutes: 3-5 second segments
|
86 |
- For videos 5-15 minutes: 5-10 second segments
|
87 |
- For videos over 15 minutes: 10-15 second segments
|
88 |
- Maximum 20 scenes total for longer videos
|
89 |
|
90 |
-
|
91 |
**[MM:SS-MM:SS]**: Detailed description including who is visible, what they're wearing, what they're doing, what they're saying (if applicable), setting details, objects shown, and any visual elements.
|
92 |
|
93 |
-
|
94 |
-
- Character descriptions (appearance, clothing, expressions)
|
95 |
-
- Actions and movements
|
96 |
-
- Objects, products, or props being displayed
|
97 |
-
- Setting and background details
|
98 |
-
- Any text, graphics, or overlays
|
99 |
-
- Transitions between scenes
|
100 |
-
|
101 |
5. Write descriptions as if you're watching the video in real-time, noting everything visible and audible.
|
102 |
|
103 |
-
|
104 |
-
- Include short direct speech/dialogue wherever possible.
|
105 |
-
- If no exact lines are known, intelligently infer short probable phrases (e.g., "Let's get started!", "Here's how you do it.", etc.)
|
106 |
-
|
107 |
Based on the title and description, intelligently infer what would likely happen in each time segment. Consider the video type and create contextually appropriate, detailed descriptions.
|
108 |
"""
|
109 |
|
|
|
79 |
- Visual effects, text overlays, or graphics
|
80 |
- Mood, tone, and atmosphere
|
81 |
- Camera movements or angles (if apparent)
|
82 |
+
|
83 |
+
2. Dialogue Emphasis:
|
84 |
+
- Include short dialogue lines in **every scene** wherever plausible.
|
85 |
+
- Write lines like: Character: "Actual or inferred line..."
|
86 |
+
- If dialogue is not available, intelligently infer probable phrases (e.g., "Welcome!", "Try this now!", "It feels amazing!").
|
87 |
+
- Do NOT skip dialogue unless it’s clearly impossible.
|
88 |
|
89 |
+
3. Timestamp Guidelines:
|
90 |
- For videos under 1 minute: 2-3 second segments
|
91 |
- For videos 1-5 minutes: 3-5 second segments
|
92 |
- For videos 5-15 minutes: 5-10 second segments
|
93 |
- For videos over 15 minutes: 10-15 second segments
|
94 |
- Maximum 20 scenes total for longer videos
|
95 |
|
96 |
+
4. Format each scene EXACTLY like this:
|
97 |
**[MM:SS-MM:SS]**: Detailed description including who is visible, what they're wearing, what they're doing, what they're saying (if applicable), setting details, objects shown, and any visual elements.
|
98 |
|
99 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
100 |
5. Write descriptions as if you're watching the video in real-time, noting everything visible and audible.
|
101 |
|
102 |
+
|
|
|
|
|
|
|
103 |
Based on the title and description, intelligently infer what would likely happen in each time segment. Consider the video type and create contextually appropriate, detailed descriptions.
|
104 |
"""
|
105 |
|