7 results found Sort:

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Created 2024-01-24
271 commits to main branch, last one 2 months ago
11
166
apache-2.0
4
Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗
Created 2024-01-04
16 commits to main branch, last one about a year ago
2
90
apache-2.0
4
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Created 2024-10-04
6 commits to main branch, last one 3 months ago
Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".
Created 2024-07-26
3 commits to main branch, last one 22 days ago
Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".
Created 2024-11-08
27 commits to main branch, last one 6 days ago
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
Created 2024-06-18
149 commits to main branch, last one 5 months ago
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
Created 2023-11-14
18 commits to main branch, last one 11 months ago