Search Results - RepositoryStats

VisualGPT Vision-CAIR

53

327

mit

13

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

visualgpt image-caption data-efficient-image-caption

Created 2021-02-15

43 commits to main branch, last one 2 years ago

clip-gpt-captioning jmisilo

32

115

mit

2

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

cv nlp python pytorch deep-learning image-caption computer-vision image-captioning machine-learning image-caption-generator

Created 2022-09-25

103 commits to main branch, last one about a month ago

SCD-Net jianjieluo

5

60

other

1

[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.

image-caption diffusion-model

Created 2022-12-05

8 commits to main branch, last one 9 months ago

image_caption_generator bhushan2311

6

37

unknown

1

An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic capt...

Created 2023-02-02

16 commits to master branch, last one about a year ago

wd-llm-caption-cli fireicewolf

8

34

apache-2.0

3

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

wd14 qwen2-vl florence-2 joy-caption image-caption llama3-vision

Created 2024-09-01

23 commits to main branch, last one 5 months ago