PyTorch Image Models

https://github.com/rwightman/pytorch-image-models

Activity Feed

AI & ML interests

Computer Vision

Recent Activity

rwightman new activity 9 days ago

timm/plant-pathology-2021:Update README.md

rwightman updated a model 9 days ago

timm/vit_dpwee_patch16_reg1_gap_256.sbb_in1k

rwightman updated a model 9 days ago

timm/vit_dpwee_patch16_reg1_gap_256.sbb_nadamuon_in1k

View all activity

timm 's collections 20

timm DINOv3

Meta AI's DINOv3 weights in timm. ViTs with `qkvb` have a zero QV bias present, otherwise bias is disabled. QKV bias are all 0 in original weights.

timm/vit_7b_patch16_dinov3.sat493m

Image Feature Extraction • Updated Sep 24 • 2.69k • 1
timm/vit_7b_patch16_dinov3.lvd1689m

Image Feature Extraction • Updated Sep 24 • 3.78k
timm/vit_huge_plus_patch16_dinov3.lvd1689m

Image Feature Extraction • Updated Sep 24 • 23.2k • 4
timm/vit_huge_plus_patch16_dinov3_qkvb.lvd1689m

Image Feature Extraction • Updated Sep 24 • 1.23k

MobileCLIP-2

OpenCLIP / timm ports of Apple's MobileCLIP-2 multi-modal and image encoders

timm/MobileCLIP2-S4-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 190
timm/MobileCLIP2-S3-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 32.9k • 1
timm/MobileCLIP2-S2-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 675
timm/MobileCLIP2-S0-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 1.7k

Searching for Better ViT Baselines

Exploring ViT hparams and model shapes for the GPU poor (between tiny and base).

timm/vit_so150m2_patch16_reg1_gap_384.sbb_e200_in12k_ft_in1k

Image Classification • Updated Feb 17 • 58 • 2
timm/vit_so150m2_patch16_reg1_gap_256.sbb_e200_in12k_ft_in1k

Image Classification • Updated Feb 17 • 57 • 1
timm/vit_so150m2_patch16_reg1_gap_256.sbb_e200_in12k

Image Classification • Updated Feb 17 • 44 • 1
timm/vit_mediumd_patch16_reg4_gap_384.sbb2_e200_in12k_ft_in1k

Image Classification • Updated Jan 21 • 1.7k • 4

MobileNetV4 pretrained weights

Weights for MobileNet-V4 pretrained in timm

timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k

Image Classification • Updated Jan 21 • 4.39k • 2
timm/mobilenetv4_conv_aa_large.e230_r384_in12k_ft_in1k

Image Classification • Updated Jan 21 • 995 • 1
timm/mobilenetv4_hybrid_large.ix_e600_r384_in1k

Image Classification • Updated Jan 21 • 335 • 5
timm/mobilenetv4_hybrid_large.e600_r384_in1k

Image Classification • Updated Jan 21 • 244 • 1

timm Top-20 Fastest Models

Not the most accurate, but the highest throughput image classification models in timm

timm/tinynet_e.in1k

Image Classification • Updated Jan 21 • 3.27k
timm/mobilenetv3_small_050.lamb_in1k

Image Classification • Updated Jan 21 • 16.5k
timm/lcnet_050.ra2_in1k

Image Classification • Updated Jan 21 • 5.04k
timm/mobilenetv3_small_075.lamb_in1k

Image Classification • Updated Jan 21 • 35.5k • 1

timm Takes on the Classics

timm includes the most popular convolutional and vision transformer models, many with new weights from updated training recipes.

timm/resnet50.a1_in1k

Image Classification • Updated Jul 11 • 7.22M • 39
timm/resnet50.a1h_in1k

Image Classification • Updated Jan 21 • 1.48k
timm/resnet50d.a1_in1k

Image Classification • Updated Jan 21 • 4.37k
timm/resnet101.a1h_in1k

Image Classification • Updated Jan 21 • 24.4k

Fastest timm models > 80% Top-1 ImageNet-1k

Fastest image classification models with 80% accuracy in ImageNet-1k .

timm/levit_256.fb_dist_in1k

Image Classification • 19M • Updated Jul 29 • 23.6k • 1
timm/vit_base_patch32_clip_224.laion2b_ft_in1k

Image Classification • Updated Jan 21 • 155 • 1
timm/vit_base_patch32_clip_224.laion2b_ft_in12k_in1k

Image Classification • Updated Jan 21 • 2.26k • 3
timm/vit_base_patch32_clip_224.openai_ft_in1k

Image Classification • Updated Jan 21 • 252

Fastest timm models > 86% ImageNet-1k Top-1

Fastest image classification models with 86% accuracy in ImageNet-1k .

timm/vit_base_patch16_clip_224.laion2b_ft_in12k_in1k

Image Classification • Updated Jan 21 • 3.19k • 2
timm/beitv2_base_patch16_224.in1k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 3.17k
timm/convnext_base.clip_laion2b_augreg_ft_in12k_in1k

Image Classification • Updated Jan 21 • 183k
timm/convnext_base.clip_laion2b_augreg_ft_in1k

Image Classification • Updated Jan 21 • 478

timm Backbones

Pre-trained feature extraction backbones available in timm.

timm/vit_small_patch14_dinov2.lvd142m

Image Feature Extraction • Updated Jan 21 • 178k • 5
timm/vit_large_patch14_dinov2.lvd142m

Image Feature Extraction • Updated Jan 20 • 117k • 16
timm/vit_base_patch16_224.dino

Image Feature Extraction • Updated Jan 21 • 466k • 6
timm/vit_base_patch16_clip_224.openai

Image Feature Extraction • Updated Jan 21 • 152k • 10

Fine-Tune Image Classification Benchmark Datasets

Datasets for fine-tune benchmarking, hparam tuning. All vetted and tested with timm scripts.

timm/oxford-iiit-pet

Viewer • Updated Jan 7, 2024 • 7.35k • 6.11k • 6
timm/resisc45

Viewer • Updated Jan 7, 2024 • 31.5k • 8.16k • 5
timm/eurosat-rgb

Viewer • Updated Jan 7, 2024 • 27k • 3.48k • 1

Perception Encoder

OpenCLIP (PE Core image + text) and timm PE Core, Spatial, Lang (ViT only) weights. NOTE: These weights do not work with original modeling code.

timm/PE-Core-bigG-14-448

Zero-Shot Image Classification • Updated Jul 24 • 11.8k • 5
timm/PE-Core-L-14-336

Zero-Shot Image Classification • Updated Jul 24 • 6.27k • 3
timm/PE-Core-B-16

Zero-Shot Image Classification • Updated Jul 24 • 2.69k
timm/PE-Core-S-16-384

Zero-Shot Image Classification • Updated Jul 24 • 233

SigLIP 2

OpenCLIP and timm SigLIP 2 models

timm/ViT-gopt-16-SigLIP2-384

Zero-Shot Image Classification • Updated Feb 21 • 19.1k • 4
timm/ViT-gopt-16-SigLIP2-256

Zero-Shot Image Classification • Updated Feb 21 • 47
timm/ViT-SO400M-16-SigLIP2-512

Zero-Shot Image Classification • Updated Feb 21 • 760 • 6
timm/ViT-SO400M-16-SigLIP2-384

Zero-Shot Image Classification • Updated Feb 21 • 193k • 5

MetaCLIP

MetaCLIP & MetaCLIP2 OpenCLIP and timm models. All models are dual timm + OpenCLIP (or just timm for specific vit encoders).

timm/vit_gigantic_patch14_clip_378.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 143 • 2
timm/vit_gigantic_patch14_clip_224.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 121 • 1
timm/vit_huge_patch14_clip_378.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 111 • 1
timm/vit_huge_patch14_clip_224.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 481 • 1

timm Top-20 ImageNet-1k Models

The 20 best models on ImageNet-1k validation set, all pretrained on datasets larger than ImageNet and fine-tuned on ImageNet-1k.

timm/eva02_large_patch14_448.mim_m38m_ft_in22k_in1k

Image Classification • Updated Jan 21 • 68k • 21
timm/eva02_large_patch14_448.mim_in22k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 2.45k • 1
timm/eva_giant_patch14_560.m30m_ft_in22k_in1k

Image Classification • Updated Jan 21 • 313 • 3
timm/eva02_large_patch14_448.mim_m38m_ft_in1k

Image Classification • Updated Jan 21 • 274 • 14

timm ImageNet-12k Models

timm has a number of unique and exclusive models trained on a 11821 (12k) subset of the full ImageNet-22k

timm/convnext_xxlarge.clip_laion2b_soup_ft_in12k

Image Classification • Updated Jan 21 • 752 • 2
timm/vit_huge_patch14_clip_224.laion2b_ft_in12k

Image Classification • Updated Jan 21 • 146 • 1
timm/vit_large_patch14_clip_224.openai_ft_in12k

Image Classification • Updated Jan 21 • 98
timm/vit_large_patch14_clip_224.laion2b_ft_in12k

Image Classification • Updated Jan 21 • 100

Fastest timm models > 75.3% IN-1k Top-1 (Original ResNet-50)

Fastest image classification models with 75.3% accuracy in ImageNet-1k .

timm/levit_128s.fb_dist_in1k

Image Classification • 7.82M • Updated Jul 29 • 1.7k • 2
timm/vit_small_patch32_224.augreg_in21k_ft_in1k

Image Classification • Updated Jan 21 • 484 • 2
timm/levit_128.fb_dist_in1k

Image Classification • 9.26M • Updated Jan 21 • 2.47k • 1
timm/efficientvit_m5.r224_in1k

Image Classification • Updated Jan 21 • 1.18k

Fastest timm models > 83% ImageNet-1k Top-1

Fastest image classification models with 83% accuracy in ImageNet-1k .

timm/vit_base_patch32_clip_224.laion2b_ft_in12k_in1k

Image Classification • Updated Jan 21 • 2.26k • 3
timm/deit3_small_patch16_224.fb_in22k_ft_in1k

Image Classification • Updated Jan 21 • 5.17k
timm/tiny_vit_11m_224.dist_in22k_ft_in1k

Image Classification • Updated Jan 21 • 267
timm/tresnet_m.miil_in21k_ft_in1k

Image Classification • Updated Jan 21 • 802

Fastest timm models > 88% ImageNet-1k Top-1

Fastest image classification models with 88% accuracy in ImageNet-1k .

timm/eva_large_patch14_196.in22k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 10.9k • 3
timm/beitv2_large_patch16_224.in1k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 2k • 2
timm/vit_large_patch14_clip_224.openai_ft_in12k_in1k

Image Classification • Updated Jan 21 • 2.76k • 38
timm/convnext_large_mlp.clip_laion2b_soup_ft_in12k_in1k_384

Image Classification • Updated Jan 21 • 1.08k • 3

All the ImageNets

Noteworthy instances of ImageNet on the Hub. Vetted and tested with timm train and validation scripts.

ILSVRC/imagenet-1k

Viewer • Updated Sep 17 • 1.43M • 63.9k • 639
timm/imagenet-1k-wds

Viewer • Updated Jan 7, 2024 • 80.7k • 7.06k • 30
timm/imagenet-w21-wds

Viewer • Updated Jan 7, 2024 • 60.1k • 1.19k • 5
timm/imagenet-w21-webp-wds

Viewer • Updated Jan 7, 2024 • 113k • 783 • 3

timm tiny test models

A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k.

timm/test_byobnet.r160_in1k

Image Classification • Updated Jan 21 • 946 • 1
timm/test_convnext.r160_in1k

Image Classification • Updated Jan 21 • 377
timm/test_convnext2.r160_in1k

Image Classification • Updated Jan 21 • 373
timm/test_convnext3.r160_in1k

Image Classification • Updated Jan 21 • 370

timm DINOv3

Meta AI's DINOv3 weights in timm. ViTs with `qkvb` have a zero QV bias present, otherwise bias is disabled. QKV bias are all 0 in original weights.

timm/vit_7b_patch16_dinov3.sat493m

Image Feature Extraction • Updated Sep 24 • 2.69k • 1
timm/vit_7b_patch16_dinov3.lvd1689m

Image Feature Extraction • Updated Sep 24 • 3.78k
timm/vit_huge_plus_patch16_dinov3.lvd1689m

Image Feature Extraction • Updated Sep 24 • 23.2k • 4
timm/vit_huge_plus_patch16_dinov3_qkvb.lvd1689m

Image Feature Extraction • Updated Sep 24 • 1.23k

Perception Encoder

OpenCLIP (PE Core image + text) and timm PE Core, Spatial, Lang (ViT only) weights. NOTE: These weights do not work with original modeling code.

timm/PE-Core-bigG-14-448

Zero-Shot Image Classification • Updated Jul 24 • 11.8k • 5
timm/PE-Core-L-14-336

Zero-Shot Image Classification • Updated Jul 24 • 6.27k • 3
timm/PE-Core-B-16

Zero-Shot Image Classification • Updated Jul 24 • 2.69k
timm/PE-Core-S-16-384

Zero-Shot Image Classification • Updated Jul 24 • 233

MobileCLIP-2

OpenCLIP / timm ports of Apple's MobileCLIP-2 multi-modal and image encoders

timm/MobileCLIP2-S4-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 190
timm/MobileCLIP2-S3-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 32.9k • 1
timm/MobileCLIP2-S2-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 675
timm/MobileCLIP2-S0-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11 • 1.7k

SigLIP 2

OpenCLIP and timm SigLIP 2 models

timm/ViT-gopt-16-SigLIP2-384

Zero-Shot Image Classification • Updated Feb 21 • 19.1k • 4
timm/ViT-gopt-16-SigLIP2-256

Zero-Shot Image Classification • Updated Feb 21 • 47
timm/ViT-SO400M-16-SigLIP2-512

Zero-Shot Image Classification • Updated Feb 21 • 760 • 6
timm/ViT-SO400M-16-SigLIP2-384

Zero-Shot Image Classification • Updated Feb 21 • 193k • 5

Searching for Better ViT Baselines

Exploring ViT hparams and model shapes for the GPU poor (between tiny and base).

timm/vit_so150m2_patch16_reg1_gap_384.sbb_e200_in12k_ft_in1k

Image Classification • Updated Feb 17 • 58 • 2
timm/vit_so150m2_patch16_reg1_gap_256.sbb_e200_in12k_ft_in1k

Image Classification • Updated Feb 17 • 57 • 1
timm/vit_so150m2_patch16_reg1_gap_256.sbb_e200_in12k

Image Classification • Updated Feb 17 • 44 • 1
timm/vit_mediumd_patch16_reg4_gap_384.sbb2_e200_in12k_ft_in1k

Image Classification • Updated Jan 21 • 1.7k • 4

MetaCLIP

MetaCLIP & MetaCLIP2 OpenCLIP and timm models. All models are dual timm + OpenCLIP (or just timm for specific vit encoders).

timm/vit_gigantic_patch14_clip_378.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 143 • 2
timm/vit_gigantic_patch14_clip_224.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 121 • 1
timm/vit_huge_patch14_clip_378.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 111 • 1
timm/vit_huge_patch14_clip_224.metaclip2_worldwide

Zero-Shot Image Classification • Updated Aug 1 • 481 • 1

MobileNetV4 pretrained weights

Weights for MobileNet-V4 pretrained in timm

timm/mobilenetv4_conv_aa_large.e230_r448_in12k_ft_in1k

Image Classification • Updated Jan 21 • 4.39k • 2
timm/mobilenetv4_conv_aa_large.e230_r384_in12k_ft_in1k

Image Classification • Updated Jan 21 • 995 • 1
timm/mobilenetv4_hybrid_large.ix_e600_r384_in1k

Image Classification • Updated Jan 21 • 335 • 5
timm/mobilenetv4_hybrid_large.e600_r384_in1k

Image Classification • Updated Jan 21 • 244 • 1

timm Top-20 ImageNet-1k Models

The 20 best models on ImageNet-1k validation set, all pretrained on datasets larger than ImageNet and fine-tuned on ImageNet-1k.

timm/eva02_large_patch14_448.mim_m38m_ft_in22k_in1k

Image Classification • Updated Jan 21 • 68k • 21
timm/eva02_large_patch14_448.mim_in22k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 2.45k • 1
timm/eva_giant_patch14_560.m30m_ft_in22k_in1k

Image Classification • Updated Jan 21 • 313 • 3
timm/eva02_large_patch14_448.mim_m38m_ft_in1k

Image Classification • Updated Jan 21 • 274 • 14

timm Top-20 Fastest Models

Not the most accurate, but the highest throughput image classification models in timm

timm/tinynet_e.in1k

Image Classification • Updated Jan 21 • 3.27k
timm/mobilenetv3_small_050.lamb_in1k

Image Classification • Updated Jan 21 • 16.5k
timm/lcnet_050.ra2_in1k

Image Classification • Updated Jan 21 • 5.04k
timm/mobilenetv3_small_075.lamb_in1k

Image Classification • Updated Jan 21 • 35.5k • 1

timm ImageNet-12k Models

timm has a number of unique and exclusive models trained on a 11821 (12k) subset of the full ImageNet-22k

timm/convnext_xxlarge.clip_laion2b_soup_ft_in12k

Image Classification • Updated Jan 21 • 752 • 2
timm/vit_huge_patch14_clip_224.laion2b_ft_in12k

Image Classification • Updated Jan 21 • 146 • 1
timm/vit_large_patch14_clip_224.openai_ft_in12k

Image Classification • Updated Jan 21 • 98
timm/vit_large_patch14_clip_224.laion2b_ft_in12k

Image Classification • Updated Jan 21 • 100

timm Takes on the Classics

timm includes the most popular convolutional and vision transformer models, many with new weights from updated training recipes.

timm/resnet50.a1_in1k

Image Classification • Updated Jul 11 • 7.22M • 39
timm/resnet50.a1h_in1k

Image Classification • Updated Jan 21 • 1.48k
timm/resnet50d.a1_in1k

Image Classification • Updated Jan 21 • 4.37k
timm/resnet101.a1h_in1k

Image Classification • Updated Jan 21 • 24.4k

Fastest timm models > 75.3% IN-1k Top-1 (Original ResNet-50)

Fastest image classification models with 75.3% accuracy in ImageNet-1k .

timm/levit_128s.fb_dist_in1k

Image Classification • 7.82M • Updated Jul 29 • 1.7k • 2
timm/vit_small_patch32_224.augreg_in21k_ft_in1k

Image Classification • Updated Jan 21 • 484 • 2
timm/levit_128.fb_dist_in1k

Image Classification • 9.26M • Updated Jan 21 • 2.47k • 1
timm/efficientvit_m5.r224_in1k

Image Classification • Updated Jan 21 • 1.18k

Fastest timm models > 80% Top-1 ImageNet-1k

Fastest image classification models with 80% accuracy in ImageNet-1k .

timm/levit_256.fb_dist_in1k

Image Classification • 19M • Updated Jul 29 • 23.6k • 1
timm/vit_base_patch32_clip_224.laion2b_ft_in1k

Image Classification • Updated Jan 21 • 155 • 1
timm/vit_base_patch32_clip_224.laion2b_ft_in12k_in1k

Image Classification • Updated Jan 21 • 2.26k • 3
timm/vit_base_patch32_clip_224.openai_ft_in1k

Image Classification • Updated Jan 21 • 252

Fastest timm models > 83% ImageNet-1k Top-1

Fastest image classification models with 83% accuracy in ImageNet-1k .

timm/vit_base_patch32_clip_224.laion2b_ft_in12k_in1k

Image Classification • Updated Jan 21 • 2.26k • 3
timm/deit3_small_patch16_224.fb_in22k_ft_in1k

Image Classification • Updated Jan 21 • 5.17k
timm/tiny_vit_11m_224.dist_in22k_ft_in1k

Image Classification • Updated Jan 21 • 267
timm/tresnet_m.miil_in21k_ft_in1k

Image Classification • Updated Jan 21 • 802

Fastest timm models > 86% ImageNet-1k Top-1

Fastest image classification models with 86% accuracy in ImageNet-1k .

timm/vit_base_patch16_clip_224.laion2b_ft_in12k_in1k

Image Classification • Updated Jan 21 • 3.19k • 2
timm/beitv2_base_patch16_224.in1k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 3.17k
timm/convnext_base.clip_laion2b_augreg_ft_in12k_in1k

Image Classification • Updated Jan 21 • 183k
timm/convnext_base.clip_laion2b_augreg_ft_in1k

Image Classification • Updated Jan 21 • 478

Fastest timm models > 88% ImageNet-1k Top-1

Fastest image classification models with 88% accuracy in ImageNet-1k .

timm/eva_large_patch14_196.in22k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 10.9k • 3
timm/beitv2_large_patch16_224.in1k_ft_in22k_in1k

Image Classification • Updated Jan 21 • 2k • 2
timm/vit_large_patch14_clip_224.openai_ft_in12k_in1k

Image Classification • Updated Jan 21 • 2.76k • 38
timm/convnext_large_mlp.clip_laion2b_soup_ft_in12k_in1k_384

Image Classification • Updated Jan 21 • 1.08k • 3

timm Backbones

Pre-trained feature extraction backbones available in timm.

timm/vit_small_patch14_dinov2.lvd142m

Image Feature Extraction • Updated Jan 21 • 178k • 5
timm/vit_large_patch14_dinov2.lvd142m

Image Feature Extraction • Updated Jan 20 • 117k • 16
timm/vit_base_patch16_224.dino

Image Feature Extraction • Updated Jan 21 • 466k • 6
timm/vit_base_patch16_clip_224.openai

Image Feature Extraction • Updated Jan 21 • 152k • 10

All the ImageNets

Noteworthy instances of ImageNet on the Hub. Vetted and tested with timm train and validation scripts.

ILSVRC/imagenet-1k

Viewer • Updated Sep 17 • 1.43M • 63.9k • 639
timm/imagenet-1k-wds

Viewer • Updated Jan 7, 2024 • 80.7k • 7.06k • 30
timm/imagenet-w21-wds

Viewer • Updated Jan 7, 2024 • 60.1k • 1.19k • 5
timm/imagenet-w21-webp-wds

Viewer • Updated Jan 7, 2024 • 113k • 783 • 3

Fine-Tune Image Classification Benchmark Datasets

Datasets for fine-tune benchmarking, hparam tuning. All vetted and tested with timm scripts.

timm/oxford-iiit-pet

Viewer • Updated Jan 7, 2024 • 7.35k • 6.11k • 6
timm/resisc45

Viewer • Updated Jan 7, 2024 • 31.5k • 8.16k • 5
timm/eurosat-rgb

Viewer • Updated Jan 7, 2024 • 27k • 3.48k • 1

timm tiny test models

A collection of very small (~300-500k parameter) models at 160x160 resolution, for testing purposes. Trained on ImageNet-1k.

timm/test_byobnet.r160_in1k

Image Classification • Updated Jan 21 • 946 • 1
timm/test_convnext.r160_in1k

Image Classification • Updated Jan 21 • 377
timm/test_convnext2.r160_in1k

Image Classification • Updated Jan 21 • 373
timm/test_convnext3.r160_in1k

Image Classification • Updated Jan 21 • 370

AI & ML interests

Recent Activity

Team members 4

timm 's collections 20