Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shamik 's Collections
Text Generation Models
Vision Models
Audio Models
MCP Servers
Benchmark
Multi modal Document Parser
Interesting Spaces

Interesting Spaces

updated 10 days ago
Upvote
-

  • Running
    184
    184

    Attention Visualization

    🔥

    Vision Transformer Attention Visualization


  • Running on Zero
    140
    140

    Open NotebookLM

    🎙

    Generate a podcast to discuss the topic of your choice!


  • Running on Zero
    MCP
    329
    329

    OCR

    🍍

    olmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr


  • Running on Zero
    MCP
    122
    122

    OCR2

    💻

    monkey ocr / nanonets ocr / smoldocling / typhoon ocr


  • Running
    40
    40

    comparevlms

    🏃

    Compare vision language models


  • Running on Zero
    1.09k
    1.09k

    Hunyuan3D-2.1

    👻

    Image-to-3D Generation


  • Running on Zero
    824
    824

    Sesame CSM

    🌱

    Conversational speech generation


  • Running on Zero
    MCP
    1.35k
    1.35k

    Chatterbox TTS

    🍿

    Expressive Zeroshot TTS


  • Running on T4
    392
    392

    Resemble Enhance

    🚀

    Enhance and clean audio files


  • Running on Zero
    265
    265

    ClearerVoice-Studio (Speech Enhancement, Separation and Extraction)

    📈

    Better AI powered platform to purify your speech signal


  • Running
    188
    188

    MedGemma - Radiology Explainer Demo

    🩺

    Radiology Image & Report Explainer Demo. Built with MedGemma


  • Running on CPU Upgrade
    100
    100

    Appoint Ready - MedGemma Demo

    📋

    Simulated Pre-visit Intake Demo built using MedGemma


  • Running on Zero
    43
    43

    OCR Time Machine

    📚

    Convert images to text using OCR models

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs