4 results found Sort:

4.9k
33.4k
apache-2.0
320
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Created 2019-02-02
2,705 commits to main branch, last one 10 days ago
Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
Created 2024-07-02
22 commits to main branch, last one 8 months ago