11 results found Sort:

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Created 2024-01-24
274 commits to main branch, last one about a month ago
12
223
apache-2.0
5
Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗
Created 2024-01-04
20 commits to main branch, last one about a month ago
8
113
apache-2.0
6
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning
Created 2024-07-31
32 commits to main branch, last one 2 hours ago
4
95
apache-2.0
3
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
Created 2024-10-04
6 commits to main branch, last one 5 months ago
Official repository of "TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models".
Created 2024-07-26
3 commits to main branch, last one 2 months ago
Inference and fine-tuning examples for vision models from 🤗 Transformers
Created 2025-01-20
38 commits to main branch, last one a day ago
本项目以应用为主出发,结合了从基础的机器学习、深度学习到目标检测以及目前最新的大模型,采用目前成熟的 第三方库、开源预训练模型以及相关论文的最新技术,目的是记录学习的过程同时也进行分享以供更多人可以直接进行使用。
Created 2020-09-06
127 commits to master branch, last one 23 days ago
Official PyTorch implementation of the WACV 2025 Oral paper "Composed Image Retrieval for Training-FREE DOMain Conversion".
Created 2024-11-08
28 commits to main branch, last one 5 days ago
Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
Created 2024-06-18
149 commits to main branch, last one 6 months ago
[ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
Created 2025-01-28
3 commits to main branch, last one about a month ago
A minimal, but effective implementation of CLIP (Contrastive Language-Image Pretraining) in PyTorch
Created 2023-11-14
18 commits to main branch, last one about a year ago