Spaces:

jjjimim
/

bettervideos

Configuration error

jjjimim commited on Jul 29

Commit

291ba5c

verified ·

1 Parent(s): d63c580

Upload 3 files

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,13 +1,25 @@
----
-title: Bettervideos
-emoji: 💻
-colorFrom: indigo
-colorTo: purple
-sdk: gradio
-sdk_version: 5.38.2
-app_file: app.py
-pinned: false
-short_description: 'video vidoe '
----
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

+# Faceless Video Generator (Free)
+This is a minimal talking avatar video generator app using open-source models.
+## Setup
+1. Install dependencies:
+   ```bash
+   pip install -r requirements.txt
+   ```
+2. Run the app:
+   ```bash
+   python app.py
+   ```
+3. Open the URL displayed in your terminal in your browser.
+4. Upload your avatar image and enter text to generate talking avatar videos.
+## Notes
+- This uses the First Order Motion Model (https://github.com/AliaksandrSiarohin/first-order-model) for animation.
+- Coqui TTS is used for text-to-speech.
+- Model weights will be downloaded automatically on first run.

app.py ADDED Viewed

+from huggingface_hub import snapshot_download
+import gradio as gr
+import subprocess
+import os
+import uuid
+def setup_models():
+    if not os.path.exists("checkpoints"):
+        print("Downloading model...")
+        snapshot_download(repo_id="deepinsight/first-order-model", local_dir="checkpoints")
+setup_models()
+def generate(text, image):
+    session = str(uuid.uuid4())[:8]
+    os.makedirs(f"results/{session}", exist_ok=True)
+    image_path = f"results/{session}/avatar.jpg"
+    image.save(image_path)
+    audio_path = f"results/{session}/audio.wav"
+    tts_cmd = f'tts --text "{text}" --out_path {audio_path}'
+    subprocess.run(tts_cmd, shell=True, check=True)
+    video_cmd = f'python checkpoints/inference.py --driven_audio {audio_path} --source_image {image_path} --result_dir results/{session}'
+    subprocess.run(video_cmd, shell=True, check=True)
+    return f"results/{session}/video.mp4"
+gr.Interface(
+    fn=generate,
+    inputs=[gr.Textbox(label="Script"), gr.Image(label="Avatar Image", type="pil")],
+    outputs=gr.Video(label="Generated Video"),
+    title="Faceless Video Generator (Free)",
+    description="Type your message + upload an image. Get a talking avatar video!"
+).launch()

requirements.txt ADDED Viewed

+gradio
+huggingface_hub
+tts
+torch
+opencv-python
+numpy