Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
1
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 43
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 39
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 11
Inference Providers
Fireworks
Novita
Nebius AI
Together AI
Cerebras
Featherless AI
Hyperbolic
Nscale
+ 6
Apply filters
Models
558
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text, transformers
Clear all
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
2.49M
•
1.37k
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.93M
•
737
kha-white/manga-ocr-base
Image-to-Text
•
Updated
Jun 22, 2022
•
277k
•
150
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
28.2k
•
119
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
107k
•
215
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
206k
•
38
openthaigpt/thai-trocr
Image-to-Text
•
0.1B
•
Updated
Nov 4, 2024
•
4.51k
•
15
bipin/image-caption-generator
Image-to-Text
•
0.3B
•
Updated
Jul 27, 2024
•
222
•
16
jinhybr/OCR-Donut-CORD
Image-to-Text
•
Updated
Nov 5, 2022
•
441
•
208
ddobokki/ko-trocr
Image-to-Text
•
0.2B
•
Updated
Oct 22, 2024
•
2.23k
•
31
kusumakar/hashtagGenerater
Image-to-Text
•
0.3B
•
Updated
Jul 15, 2023
•
24
•
3
facebook/nougat-small
Image-to-Text
•
0.2B
•
Updated
Nov 20, 2023
•
9.05k
•
28
Gregor/mblip-bloomz-7b
Image-to-Text
•
8B
•
Updated
Apr 28, 2024
•
45
•
2
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
170k
•
166
AdamCodd/donut-receipts-extract
Image-to-Text
•
0.2B
•
Updated
Jan 11
•
35
thwri/CogFlorence-2.2-Large
Image-to-Text
•
0.8B
•
Updated
Sep 28, 2024
•
12.4k
•
38
humbleakh/qwen2.5-vl-3b-8bit-chain-of-zoom
Image-to-Text
•
Updated
20 days ago
•
72
•
1
adalbertojunior/image_captioning_portuguese
Image-to-Text
•
Updated
Jul 17, 2024
•
37
•
1
gagan3012/ViTGPT2_vizwiz
Image-to-Text
•
Updated
Feb 7, 2022
•
43
•
1
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11
•
186k
•
414
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
385k
•
176
microsoft/trocr-base-stage1
Image-to-Text
•
0.4B
•
Updated
May 27, 2024
•
46.9k
•
13
microsoft/trocr-large-printed
Image-to-Text
•
0.6B
•
Updated
May 27, 2024
•
190k
•
167
microsoft/trocr-large-stage1
Image-to-Text
•
0.6B
•
Updated
May 27, 2024
•
6.05k
•
25
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
362k
•
49
microsoft/trocr-small-printed
Image-to-Text
•
0.1B
•
Updated
May 27, 2024
•
40.6k
•
40
microsoft/trocr-small-stage1
Image-to-Text
•
Updated
Jan 24, 2023
•
16.2k
•
12
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
590k
•
896
sachin/vit2distilgpt2
Image-to-Text
•
0.2B
•
Updated
Aug 17, 2023
•
47
•
8
ydshieh/vit-gpt2-coco-en
Image-to-Text
•
Updated
Sep 16, 2022
•
3.5k
•
38
Previous
1
2
3
...
19
Next