ghostai1 commited on
Commit
3c2d6d4
ยท
verified ยท
1 Parent(s): befb35b

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +138 -70
index.html CHANGED
@@ -116,8 +116,7 @@
116
  </div>
117
  </div>
118
  </section>
119
-
120
- <!-- Optimizations Section -->
121
  <section id="optimizations" class="py-5 bg-dark text-white">
122
  <div class="container">
123
  <h2 class="text-center mb-5 animate__animated animate__fadeIn">๐Ÿง™โ€โ™‚๏ธ Ghostai1โ€™s Math Sorcery</h2>
@@ -182,100 +181,169 @@
182
  </div>
183
  </div>
184
 
185
- <!-- Memory Tab -->
186
- <div class="tab-pane fade" id="memory" role="tabpanel" aria-labelledby="memory-tab">
187
- <div class="row row-cols-1 row-cols-md-2 g-4 mb-4">
188
- <div class="col">
189
- <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#contextModal">
190
- <div class="card-body text-center">
191
- <h3 class="card-title">๐Ÿงฌ Context Packing</h3>
192
- <p class="card-text">Compresses contexts, saving ~50% VRAM.</p>
193
- <p><strong>Boost: 50%</strong><br>Stat: ~2โ€“3GB saved<br>Math: \( M_{\text{VRAM}} \propto O(1) \)</p>
194
- </div>
195
  </div>
196
  </div>
197
- <div class="col">
198
- <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#tcmallocModal">
199
- <div class="card-body text-center">
200
- <h3 class="card-title">๐Ÿ’พ tcmalloc</h3>
201
- <p class="card-text">Cuts memory overhead by ~5โ€“20%.</p>
202
- <p><strong>Boost: 5โ€“20%</strong><br>Stat: ~15% CPU relief<br>Math: \( O_{\text{mem}} \approx 0.8 \cdot O_{\text{glibc}} \)</p>
203
- </div>
204
  </div>
205
  </div>
206
- <div class="col">
207
- <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#cacheModal">
208
- <div class="card-body text-center">
209
- <h3 class="card-title">๐Ÿ“ฆ Memory Cache</h3>
210
- <p class="card-text">Preloads data, reducing memory swaps by ~25%.</p>
211
- <p><strong>Boost: 25%</strong><br>Stat: ~1โ€“2GB less swaps<br>Math: \( M_{\text{swap}} \approx 0.75 \cdot M_{\text{base_swap}} \)</p>
212
- </div>
213
  </div>
214
  </div>
215
  </div>
216
  </div>
 
217
 
218
- <!-- Compute Tab -->
219
- <div class="tab-pane fade" id="compute" role="tabpanel" aria-labelledby="compute-tab">
220
- <div class="row row-cols-1 row-cols-md-1 g-4 mb-4">
221
- <div class="col">
222
- <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#batchingModal">
223
- <div class="card-body text-center">
224
- <h3 class="card-title">โšก Dynamic Batching</h3>
225
- <p class="card-text">Adapts batches for ~30โ€“50% throughput gain.</p>
226
- <p><strong>Boost: 30โ€“50%</strong><br>Stat: ~1.5x FPS<br>Math: \( \text{Throughput} \propto B \cdot \text{FPS}_{\text{base}} \)</p>
227
- </div>
228
  </div>
229
  </div>
230
  </div>
231
  </div>
 
232
 
233
- <!-- Efficiency Tab -->
234
- <div class="tab-pane fade" id="efficiency" role="tabpanel" aria-labelledby="efficiency-tab">
235
- <div class="row row-cols-1 row-cols-md-2 g-4 mb-4">
236
- <div class="col">
237
- <div class="card bg-ghost-card text-white h-100" data-bs-toggle="modal" data-bs-target="#powerModal">
238
- <div class="card-body text-center">
239
- <h3 class="card-title">๐Ÿ”‹ Power Optimization</h3>
240
- <p class="card-text">Reduces power draw by ~20โ€“30% during idle states.</p>
241
- <p><strong>Boost: 20โ€“30%</strong><br>Stat: ~10W saved<br>Math: \( P_{\text{idle}} \approx 0.7 \cdot P_{\text{base_idle}} \)</p>
242
- </div>
243
  </div>
244
  </div>
245
- <div class="col">
246
- <div class="card bg-ghost-card text-white h-100" data-bs-toggle="modal" data-bs-target="#threadModal">
247
- <div class="card-body text-center">
248
- <h3 class="card-title">๐Ÿงต Thread Tuning</h3>
249
- <p class="card-text">Optimizes thread allocation for ~10โ€“15% CPU efficiency.</p>
250
- <p><strong>Boost: 10โ€“15%</strong><br>Stat: ~5โ€“10% less overhead<br>Math: \( C_{\text{thread}} \approx 0.85 \cdot C_{\text{base_thread}} \)</p>
251
- </div>
252
  </div>
253
  </div>
254
  </div>
255
  </div>
256
  </div>
 
257
 
258
- <!-- Optimization Breakdown -->
259
- <div class="row mt-4">
260
- <div class="col-md-12">
261
- <h3 class="mb-3 text-white">Optimization Breakdown</h3>
262
- <ul class="text-white">
263
- <li><strong>๐Ÿ”ฎ Compressed Context Packing</strong>: Collapses frame contexts into a fixed-size matrix, slashing VRAM by ~50% (\( M_{\text{VRAM}} \propto O(1) \)), enabling 60s videos on 6GB VRAM GPUs like GTX 1650.</li>
264
- <li><strong>๐Ÿงฌ Dynamic Batching</strong>: Adapts batches (2โ€“4 frames), boosting throughput by ~30โ€“50% (\( \text{Throughput} \propto B \)), perfect for RTX 3050 with enhanced frame processing.</li>
265
- <li><strong>โšก๏ธ Teacache Efficiency</strong>: Caches diffusion states, cutting ~40% off compute time (\( T_{\text{total}} \approx 0.6T_{\text{base}} \)), delivering ~10โ€“15s/frame on RTX 3060.</li>
266
- <li><strong>๐Ÿง™โ€โ™‚๏ธ Sage-Attention</strong>: Streamlines attention layers, saving ~5โ€“10% time (\( T_{\text{attn}} \approx 0.9T_{\text{base_attn}} \)), boosting low-VRAM performance.</li>
267
- <li><strong>๐Ÿ’พ tcmalloc</strong>: Reduces memory overhead by ~5โ€“20% (\( O_{\text{mem}} \approx 0.8O_{\text{glibc}} \)), easing CPU load by ~15% for smoother operation.</li>
268
- <li><strong>โšก CUDA Tweaks</strong>: Cuts latency by ~10โ€“15% (\( L_{\text{CUDA}} \approx 0.85L_{\text{base}} \)) with optimized memory allocation, maximizing GPU efficiency.</li>
269
- <li><strong>๐Ÿš€ Dynamic Scheduling</strong>: Adapts processing queues, reducing task completion time by ~15โ€“20% (\( T_{\text{sched}} \approx 0.8T_{\text{base_sched}} \)), enhancing workflow speed.</li>
270
- <li><strong>๐Ÿ“ฆ Memory Cache</strong>: Preloads data, cutting memory swaps by ~25% (\( M_{\text{swap}} \approx 0.75M_{\text{base_swap}} \)), improving data access times.</li>
271
- <li><strong>๐Ÿ”‹ Power Optimization</strong>: Lowers power draw by ~20โ€“30% (\( P_{\text{idle}} \approx 0.7P_{\text{base_idle}} \)), ideal for energy-efficient setups.</li>
272
- <li><strong>๐Ÿงต Thread Tuning</strong>: Optimizes thread allocation, boosting CPU efficiency by ~10โ€“15% (\( C_{\text{thread}} \approx 0.85C_{\text{base_thread}} \)), reducing overhead.</li>
273
- </ul>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
274
  </div>
275
  </div>
276
 
277
  <!-- VRAM Requirements Table -->
278
- <div class="row mt-4">
279
  <div class="col-md-12">
280
  <h3 class="text-center text-white mb-3">VRAM Requirements ๐Ÿ–ฅ๏ธ</h3>
281
  <div class="table-responsive">
 
116
  </div>
117
  </div>
118
  </section>
119
+ <!-- Optimizations Section -->
 
120
  <section id="optimizations" class="py-5 bg-dark text-white">
121
  <div class="container">
122
  <h2 class="text-center mb-5 animate__animated animate__fadeIn">๐Ÿง™โ€โ™‚๏ธ Ghostai1โ€™s Math Sorcery</h2>
 
181
  </div>
182
  </div>
183
 
184
+ <!-- Memory Tab -->
185
+ <div class="tab-pane fade" id="memory" role="tabpanel" aria-labelledby="memory-tab">
186
+ <div class="row row-cols-1 row-cols-md-2 g-4 mb-4">
187
+ <div class="col">
188
+ <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#contextModal">
189
+ <div class="card-body text-center">
190
+ <h3 class="card-title">๐Ÿงฌ Context Packing</h3>
191
+ <p class="lead-text">Compresses contexts, saving ~50% VRAM.</p>
192
+ <p class="lead-text"><strong>Boost:</strong> 50% | <strong>Stat:</strong> ~2โ€“3GB saved | <strong>Math:</strong> \( M_{\text{VRAM}} \propto O(1) \)</p>
 
193
  </div>
194
  </div>
195
+ </div>
196
+ <div class="col">
197
+ <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#tcmallocModal">
198
+ <div class="card-body text-center">
199
+ <h3 class="card-title">๐Ÿ’พ tcmalloc</h3>
200
+ <p class="lead-text">Cuts memory overhead by ~5โ€“20%.</p>
201
+ <p class="lead-text"><strong>Boost:</strong> 5โ€“20% | <strong>Stat:</strong> ~15% CPU relief | <strong>Math:</strong> \( O_{\text{mem}} \approx 0.8 \cdot O_{\text{glibc}} \)</p>
202
  </div>
203
  </div>
204
+ </div>
205
+ <div class="col">
206
+ <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#cacheModal">
207
+ <div class="card-body text-center">
208
+ <h3 class="card-title">๐Ÿ“ฆ Memory Cache</h3>
209
+ <p class="lead-text">Preloads data, reducing memory swaps by ~25%.</p>
210
+ <p class="lead-text"><strong>Boost:</strong> 25% | <strong>Stat:</strong> ~1โ€“2GB less swaps | <strong>Math:</strong> \( M_{\text{swap}} \approx 0.75 \cdot M_{\text{base_swap}} \)</p>
211
  </div>
212
  </div>
213
  </div>
214
  </div>
215
+ </div>
216
 
217
+ <!-- Compute Tab -->
218
+ <div class="tab-pane fade" id="compute" role="tabpanel" aria-labelledby="compute-tab">
219
+ <div class="row row-cols-1 row-cols-md-1 g-4 mb-4">
220
+ <div class="col">
221
+ <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#batchingModal">
222
+ <div class="card-body text-center">
223
+ <h3 class="card-title">โšก Dynamic Batching</h3>
224
+ <p class="lead-text">Adapts batches for ~30โ€“50% throughput gain.</p>
225
+ <p class="lead-text"><strong>Boost:</strong> 30โ€“50% | <strong>Stat:</strong> ~1.5x FPS | <strong>Math:</strong> \( \text{Throughput} \propto B \cdot \text{FPS}_{\text{base}} \)</p>
 
226
  </div>
227
  </div>
228
  </div>
229
  </div>
230
+ </div>
231
 
232
+ <!-- Efficiency Tab -->
233
+ <div class="tab-pane fade" id="efficiency" role="tabpanel" aria-labelledby="efficiency-tab">
234
+ <div class="row row-cols-1 row-cols-md-2 g-4 mb-4">
235
+ <div class="col">
236
+ <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#powerModal">
237
+ <div class="card-body text-center">
238
+ <h3 class="card-title">๐Ÿ”‹ Power Optimization</h3>
239
+ <p class="lead-text">Reduces power draw by ~20โ€“30% during idle states.</p>
240
+ <p class="lead-text"><strong>Boost:</strong> 20โ€“30% | <strong>Stat:</strong> ~10W saved | <strong>Math:</strong> \( P_{\text{idle}} \approx 0.7 \cdot P_{\text{base_idle}} \)</p>
 
241
  </div>
242
  </div>
243
+ </div>
244
+ <div class="col">
245
+ <div class="card bg-ghost-card h-100" data-bs-toggle="modal" data-bs-target="#threadModal">
246
+ <div class="card-body text-center">
247
+ <h3 class="card-title">๐Ÿงต Thread Tuning</h3>
248
+ <p class="lead-text">Optimizes thread allocation for ~10โ€“15% CPU efficiency.</p>
249
+ <p class="lead-text"><strong>Boost:</strong> 10โ€“15% | <strong>Stat:</strong> ~5โ€“10% less overhead | <strong>Math:</strong> \( C_{\text{thread}} \approx 0.85 \cdot C_{\text{base_thread}} \)</p>
250
  </div>
251
  </div>
252
  </div>
253
  </div>
254
  </div>
255
+ </div>
256
 
257
+ <!-- Optimization Breakdown -->
258
+ <div class="row mt-5">
259
+ <div class="col-md-12">
260
+ <h3 class="mb-4 text-white">Optimization Breakdown</h3>
261
+ <div class="row row-cols-1 row-cols-md-2 g-4">
262
+ <div class="col">
263
+ <div class="card bg-ghost-card">
264
+ <div class="card-body text-center">
265
+ <h4 class="card-title">๐Ÿ”ฎ Compressed Context Packing</h4>
266
+ <p class="lead-text">Collapses frame contexts into a fixed-size matrix, slashing VRAM by ~50% (\( M_{\text{VRAM}} \propto O(1) \)), enabling 60s videos on 6GB VRAM GPUs like GTX 1650.</p>
267
+ </div>
268
+ </div>
269
+ </div>
270
+ <div class="col">
271
+ <div class="card bg-ghost-card">
272
+ <div class="card-body text-center">
273
+ <h4 class="card-title">๐Ÿงฌ Dynamic Batching</h4>
274
+ <p class="lead-text">Adapts batches (2โ€“4 frames), boosting throughput by ~30โ€“50% (\( \text{Throughput} \propto B \)), perfect for RTX 3050 with enhanced frame processing.</p>
275
+ </div>
276
+ </div>
277
+ </div>
278
+ <div class="col">
279
+ <div class="card bg-ghost-card">
280
+ <div class="card-body text-center">
281
+ <h4 class="card-title">โšก๏ธ Teacache Efficiency</h4>
282
+ <p class="lead-text">Caches diffusion states, cutting ~40% off compute time (\( T_{\text{total}} \approx 0.6T_{\text{base}} \)), delivering ~10โ€“15s/frame on RTX 3060.</p>
283
+ </div>
284
+ </div>
285
+ </div>
286
+ <div class="col">
287
+ <div class="card bg-ghost-card">
288
+ <div class="card-body text-center">
289
+ <h4 class="card-title">๐Ÿง™โ€โ™‚๏ธ Sage-Attention</h4>
290
+ <p class="lead-text">Streamlines attention layers, saving ~5โ€“10% time (\( T_{\text{attn}} \approx 0.9T_{\text{base_attn}} \)), boosting low-VRAM performance.</p>
291
+ </div>
292
+ </div>
293
+ </div>
294
+ <div class="col">
295
+ <div class="card bg-ghost-card">
296
+ <div class="card-body text-center">
297
+ <h4 class="card-title">๐Ÿ’พ tcmalloc</h4>
298
+ <p class="lead-text">Reduces memory overhead by ~5โ€“20% (\( O_{\text{mem}} \approx 0.8O_{\text{glibc}} \)), easing CPU load by ~15% for smoother operation.</p>
299
+ </div>
300
+ </div>
301
+ </div>
302
+ <div class="col">
303
+ <div class="card bg-ghost-card">
304
+ <div class="card-body text-center">
305
+ <h4 class="card-title">โšก CUDA Tweaks</h4>
306
+ <p class="lead-text">Cuts latency by ~10โ€“15% (\( L_{\text{CUDA}} \approx 0.85L_{\text{base}} \)) with optimized memory allocation, maximizing GPU efficiency.</p>
307
+ </div>
308
+ </div>
309
+ </div>
310
+ <div class="col">
311
+ <div class="card bg-ghost-card">
312
+ <div class="card-body text-center">
313
+ <h4 class="card-title">๐Ÿš€ Dynamic Scheduling</h4>
314
+ <p class="lead-text">Adapts processing queues, reducing task completion time by ~15โ€“20% (\( T_{\text{sched}} \approx 0.8T_{\text{base_sched}} \)), enhancing workflow speed.</p>
315
+ </div>
316
+ </div>
317
+ </div>
318
+ <div class="col">
319
+ <div class="card bg-ghost-card">
320
+ <div class="card-body text-center">
321
+ <h4 class="card-title">๐Ÿ“ฆ Memory Cache</h4>
322
+ <p class="lead-text">Preloads data, cutting memory swaps by ~25% (\( M_{\text{swap}} \approx 0.75M_{\text{base_swap}} \)), improving data access times.</p>
323
+ </div>
324
+ </div>
325
+ </div>
326
+ <div class="col">
327
+ <div class="card bg-ghost-card">
328
+ <div class="card-body text-center">
329
+ <h4 class="card-title">๐Ÿ”‹ Power Optimization</h4>
330
+ <p class="lead-text">Lowers power draw by ~20โ€“30% (\( P_{\text{idle}} \approx 0.7P_{\text{base_idle}} \)), ideal for energy-efficient setups.</p>
331
+ </div>
332
+ </div>
333
+ </div>
334
+ <div class="col">
335
+ <div class="card bg-ghost-card">
336
+ <div class="card-body text-center">
337
+ <h4 class="card-title">๐Ÿงต Thread Tuning</h4>
338
+ <p class="lead-text">Optimizes thread allocation, boosting CPU efficiency by ~10โ€“15% (\( C_{\text{thread}} \approx 0.85C_{\text{base_thread}} \)), reducing overhead.</p>
339
+ </div>
340
+ </div>
341
+ </div>
342
  </div>
343
  </div>
344
 
345
  <!-- VRAM Requirements Table -->
346
+ <div class="row mt-5">
347
  <div class="col-md-12">
348
  <h3 class="text-center text-white mb-3">VRAM Requirements ๐Ÿ–ฅ๏ธ</h3>
349
  <div class="table-responsive">