File size: 2,299 Bytes
167c640
 
 
 
 
 
ee545fd
167c640
 
 
 
 
 
 
ee545fd
bdcc963
3afe6fc
3843461
 
 
 
bdcc963
167c640
3afe6fc
167c640
bdcc963
b5dd9b3
bdcc963
 
b5dd9b3
5679594
b5dd9b3
5679594
167c640
 
 
75207e3
167c640
3afe6fc
 
 
 
 
 
167c640
 
 
 
 
 
 
75207e3
 
167c640
75207e3
 
167c640
 
75207e3
167c640
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
<!DOCTYPE html>
<html lang="en">

<head>
	<meta charset="utf-8" />
	<meta name="viewport" content="width=device-width, initial-scale=1.0" />
	<title>V-JEPA 2 Video Classification Demo</title>
	<link rel="stylesheet" href="style.css" />
	<link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600&display=swap" rel="stylesheet">
</head>

<body>
	<div class="container">
		<header>
			<h1>V-JEPA 2: Streaming Video Classification</h1>
			<nav class="links">
				<a href="https://huggingface.co/collections/facebook/v-jepa-2-6841bad8413014e185b497a6"
					target="_blank">πŸ€–
					Hugging Face Collection</a>
				<a href="https://github.com/facebookresearch/vjepa2" target="_blank">πŸ’» GitHub</a>
				<a href="https://huggingface.co/papers/2506.09985" target="_blank">πŸ“‘ Paper</a>
			</nav>
		</header>

		<section class="description">
			<h4>πŸŽ₯ Video Classification</h4>
			<p>This demo showcases a specialized version of V-JEPA 2, fine-tuned for real-time video action recognition!
				Model optimized specifically for classifying 174 different actions from the Something-Something-V2
				dataset.
				Watch as it instantly understands what's happening in the video! ⚑
				See instructions below to get started with your webcam. πŸš€
			</p>

		</section>

		<section class="demo">
			<h2>Recorded Demo</h2>
			<div class="video-container">
				<video controls>
					<source
						src="https://huggingface.co/spaces/qubvel-hf/vjepa2-streaming-video-classification/resolve/main/misc/vjepa2-fastrtc-demo.mp4"
						type="video/mp4">
					Your browser does not support the video tag.
				</video>
			</div>
		</section>

		<section class="installation">
			<h2>Getting Started</h2>
			<div class="instructions">
				<h3>1. Clone the Repository</h3>
				<pre><code>git clone https://huggingface.co/spaces/qubvel-hf/vjepa2-streaming-video-classification
cd vjepa2-streaming-video-classification</code></pre>

				<h3>2. Install requirements</h3>
				<pre><code>pip install -r requirements.txt</code></pre>

				<h3>3. Run the Demo</h3>
				<pre><code>gradio app.py</code></pre>
			</div>
		</section>

		<footer>
			<p>Built with ❀️ using <a href="https://huggingface.co/transformers" target="_blank">Hugging Face
					Transformers</a></p>
		</footer>
	</div>
</body>

</html>