Leaderboard

Below is the leaderboard for our competition, including our baseline FastCLIP.

Name Model # Params (M) Resolution Dataset Dataset Size (B) Batch Size (K) Samples Seen (B) ImageNet-1K Top 1 DataComp Average Weights
EVA-CLIP ViT-E/14 5045 224 LAION-2B 2 144 9 81.97 69.30 Huggingface
EVA-CLIP ViT-E/14 4705 224 LAION-2B 2 115 4 81.96 66.66 Huggingface
OpenCLIP ViT-bigG/14 2540 224 LAION-2B 2 160 36 80.08 66.65 Huggingface
CLIPA ViT-bigG/14 2518 336 Datacomp-1B 1.4 64 13.4 83.07 68.44 Huggingface
CLIPA ViT-bigG/14 2517 224 Datacomp-1B 1.4 64 13.3 82.67 68.26 Huggingface
OpenCLIP ViT-g/14 1367 224 LAION-2B 2 86.7 34.5 78.48 64.26 Huggingface
OpenCLIP ViT-g/14 1367 224 LAION-2B 2 42 13 76.66 62.87 Huggingface
EVA-CLIP ViT-g/14 1367 224 Merged-2B 2 114 11 79.32 66.05 Huggingface
OpenCLIP ConvNeXt-XXL 1201 256 LAION-2B 2 82 34 79.46 65.11 Huggingface
OpenCLIP ConvNeXt-XXL 1201 256 LAION-2B 2 82 34 79.32 64.96 Huggingface
OpenCLIP ConvNeXt-XXL 1201 256 LAION-2B 2 82 34 79.07 64.93 Huggingface
OpenCLIP ViT-H/14 1193 224 LAION-5B 5 90 13 76.97 65.15 Huggingface
EVA-CLIP ViT-g/14 1136 224 LAION-400M 0.4 41 11 78.52 63.39 Huggingface
DFN ViT-H/14 987 378 DFN-5B 5 78 44 84.35 70.79 Huggingface
DFN ViT-H/14 986 224 DFN-5B 5 78 39 83.43 69.62 Huggingface
MetaCLIP ViT-H/14 986 224 MetaCLIP-2.5B 2.5 32 12.8 80.53 66.72 Meta
OpenCLIP ViT-H/14 986 224 LAION-2B 2 77.25 34 77.92 64.02 Huggingface
CLIPA ViT-H/14 969 336 Datacomp-1B 1.4 64 13.4 81.81 66.77 Huggingface
CLIPA ViT-H/14 969 336 LAION-2B 2 64 13.4 79.10 64.39 Huggingface
CLIPA ViT-H/14 968 224 Datacomp-1B 1.4 64 13.3 81.50 66.53 Huggingface
ViTamin ViTamin-XL 925 336 Datacomp-1B 1.4 90 41 82.67 68.12 Huggingface
ViTamin ViTamin-XL 925 256 Datacomp-1B 1.4 90 40 82.25 67.66 Huggingface
ViTamin ViTamin-XL 925 384 Datacomp-1B 1.4 90 41 81.39 66.38 Huggingface
SigLIP ViT-SO400M/14 878 384 WebLI 10 32 45 83.09 69.21 Huggingface
SigLIP ViT-SO400M/14 877 256 WebLI 10 32 40 82.04 68.07 Huggingface