Spaces:

fastrtc
/

echo-audio

Running

freddyaboulton HF Staff commited on Feb 25

Commit

45482ae

verified ·

1 Parent(s): c33f70d

Upload folder using huggingface_hub

Files changed (2) hide show

scratch.py ADDED Viewed

+from fastrtc import Stream, ReplyOnPause
+import numpy as np
+def echo(audio: tuple[int, np.ndarray]):
+    # The function will be passed the audio until the user pauses
+    # Implement any iterator that yields audio
+    # See "LLM Voice Chat" for a more complete example
+    yield audio
+stream = Stream(
+    handler=ReplyOnPause(echo),
+    modality="audio",
+    mode="send-receive",
+    ui_args={
+        "icon": "https://upload.wikimedia.org/wikipedia/commons/thumb/0/01/Portrait-of-a-woman.jpg/960px-Portrait-of-a-woman.jpg?20200608215745",
+        "pulse_color": "rgb(35, 157, 225)",
+        "icon_button_color": "rgb(35, 157, 225)",
+        "title": "Gemini Audio Video Chat",
+    },
+)
+stream.ui.launch()

script.md CHANGED Viewed

@@ -1,5 +1,15 @@
 Hi, I'm Freddy and I want to give a tour of FastRTC - the real-time communication library for Python.
-FastRTC makes it easy to stream audio or video using WebRTC or Websockets - the gold standard for real-time communication.
 Let's start with the basics - echoing audio.

 Hi, I'm Freddy and I want to give a tour of FastRTC - the real-time communication library for Python.
+Why is this important? In the last few months, we've seen many advances in real-time speech and vision models coming from closed-source models, open-source models, and API providers.
+Despite these innovations, it's still difficult to build real-time AI applications that stream audio and video, especially in Python. This is because:
+- ML engineers may not have experience with the technologies needed to build real-time applications, such as WebRTC or Websockets.
+- Implementing algorithms for voice detection and turn taking is tricky!
+- Best practices are scattered across various sources and even code assistant tools like Cursor and Copilot struggle to write Python code that supports real-time audio/video applications. I learned that the hard way!
+All this means that if you want to take advantage of the latest advances in AI, you have to spend a lot of time figuring out how to do real-time streaming.
+`FastRTC` solves this problem by automatically turning any python function into a real-time audio and video stream over WebRTC or WebSockets with little additional code or overhead. Let's see how it works.
 Let's start with the basics - echoing audio.