8 results found Sort:

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Created 2024-01-24
274 commits to main branch, last one 9 days ago
12
192
apache-2.0
5
Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗
Created 2024-01-04
20 commits to main branch, last one a day ago
2
92
apache-2.0
3
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Created 2024-10-04
6 commits to main branch, last one 4 months ago
Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".
Created 2024-07-26
3 commits to main branch, last one about a month ago
本项目以应用为主出发,结合了从基础的机器学习、深度学习到目标检测以及目前最新的大模型,采用目前成熟的 第三方库、开源预训练模型以及相关论文的最新技术,目的是记录学习的过程同时也进行分享以供更多人可以直接进行使用。
Created 2020-09-06
117 commits to master branch, last one 5 days ago
Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".
Created 2024-11-08
27 commits to main branch, last one 28 days ago
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
Created 2024-06-18
149 commits to main branch, last one 5 months ago
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
Created 2023-11-14
18 commits to main branch, last one about a year ago