Below is the leaderboard for our competition, including our baseline FastCLIP.
Name | Model | # Params (M) | Resolution | Dataset | Dataset Size (B) | Batch Size (K) | Samples Seen (B) | ImageNet-1K Top 1 | DataComp Average | Weights |
---|---|---|---|---|---|---|---|---|---|---|
EVA-CLIP | ViT-E/14 | 5045 | 224 | LAION-2B | 2 | 144 | 9 | 81.97 | 69.30 | Huggingface |
EVA-CLIP | ViT-E/14 | 4705 | 224 | LAION-2B | 2 | 115 | 4 | 81.96 | 66.66 | Huggingface |
OpenCLIP | ViT-bigG/14 | 2540 | 224 | LAION-2B | 2 | 160 | 36 | 80.08 | 66.65 | Huggingface |
CLIPA | ViT-bigG/14 | 2518 | 336 | Datacomp-1B | 1.4 | 64 | 13.4 | 83.07 | 68.44 | Huggingface |
CLIPA | ViT-bigG/14 | 2517 | 224 | Datacomp-1B | 1.4 | 64 | 13.3 | 82.67 | 68.26 | Huggingface |
OpenCLIP | ViT-g/14 | 1367 | 224 | LAION-2B | 2 | 86.7 | 34.5 | 78.48 | 64.26 | Huggingface |
OpenCLIP | ViT-g/14 | 1367 | 224 | LAION-2B | 2 | 42 | 13 | 76.66 | 62.87 | Huggingface |
EVA-CLIP | ViT-g/14 | 1367 | 224 | Merged-2B | 2 | 114 | 11 | 79.32 | 66.05 | Huggingface |
OpenCLIP | ConvNeXt-XXL | 1201 | 256 | LAION-2B | 2 | 82 | 34 | 79.46 | 65.11 | Huggingface |
OpenCLIP | ConvNeXt-XXL | 1201 | 256 | LAION-2B | 2 | 82 | 34 | 79.32 | 64.96 | Huggingface |
OpenCLIP | ConvNeXt-XXL | 1201 | 256 | LAION-2B | 2 | 82 | 34 | 79.07 | 64.93 | Huggingface |
OpenCLIP | ViT-H/14 | 1193 | 224 | LAION-5B | 5 | 90 | 13 | 76.97 | 65.15 | Huggingface |
EVA-CLIP | ViT-g/14 | 1136 | 224 | LAION-400M | 0.4 | 41 | 11 | 78.52 | 63.39 | Huggingface |
DFN | ViT-H/14 | 987 | 378 | DFN-5B | 5 | 78 | 44 | 84.35 | 70.79 | Huggingface |
DFN | ViT-H/14 | 986 | 224 | DFN-5B | 5 | 78 | 39 | 83.43 | 69.62 | Huggingface |
MetaCLIP | ViT-H/14 | 986 | 224 | MetaCLIP-2.5B | 2.5 | 32 | 12.8 | 80.53 | 66.72 | Meta |
OpenCLIP | ViT-H/14 | 986 | 224 | LAION-2B | 2 | 77.25 | 34 | 77.92 | 64.02 | Huggingface |
CLIPA | ViT-H/14 | 969 | 336 | Datacomp-1B | 1.4 | 64 | 13.4 | 81.81 | 66.77 | Huggingface |
CLIPA | ViT-H/14 | 969 | 336 | LAION-2B | 2 | 64 | 13.4 | 79.10 | 64.39 | Huggingface |
CLIPA | ViT-H/14 | 968 | 224 | Datacomp-1B | 1.4 | 64 | 13.3 | 81.50 | 66.53 | Huggingface |
ViTamin | ViTamin-XL | 925 | 336 | Datacomp-1B | 1.4 | 90 | 41 | 82.67 | 68.12 | Huggingface |
ViTamin | ViTamin-XL | 925 | 256 | Datacomp-1B | 1.4 | 90 | 40 | 82.25 | 67.66 | Huggingface |
ViTamin | ViTamin-XL | 925 | 384 | Datacomp-1B | 1.4 | 90 | 41 | 81.39 | 66.38 | Huggingface |
SigLIP | ViT-SO400M/14 | 878 | 384 | WebLI | 10 | 32 | 45 | 83.09 | 69.21 | Huggingface |
SigLIP | ViT-SO400M/14 | 877 | 256 | WebLI | 10 | 32 | 40 | 82.04 | 68.07 | Huggingface |