metadata
title: InternViT No Flash-Attn Test
emoji: π
colorFrom: yellow
colorTo: red
sdk: docker
pinned: false
InternViT-6B without Flash Attention
This Space demonstrates loading and using the InternViT-6B model without the flash-attn dependency.
Approach
Instead of trying to compile flash-attn (which times out during build), this version:
- Uses a mock/dummy implementation of flash-attn
- Only includes required dependencies (einops)
- Uses a runtime PyTorch image instead of development image
- Monkey patches the imports to avoid errors
Benefits
- Much faster build time
- No compilation errors or timeouts
- Still loads the full 6B parameter model
- Demonstrates basic functionality
Note
Some functionality might be reduced without the flash-attn optimization, but the model should still work for basic inference.
Instructions
- Click the "Test Model Loading" button
- Wait for the model to load and run the test
- Check the results for success or errors