Statistics for topic image-captioning
RepositoryStats tracks 595,856 Github repositories, of these 82 are tagged with the image-captioning topic. The most common primary language for repositories using this topic is Python (50). Other languages include: Jupyter Notebook (19)
Stargazers over time for topic image-captioning
Most starred repositories for topic image-captioning (view more)
Trending repositories for topic image-captioning (view more)
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
LAVIS - A One-stop Library for Language-Vision Intelligence
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
LAVIS - A One-stop Library for Language-Vision Intelligence
LAVIS - A One-stop Library for Language-Vision Intelligence
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。
a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。
Awesome radiology report generation and image captioning papers.
A list of awesome remote sensing image captioning resources
LAVIS - A One-stop Library for Language-Vision Intelligence
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。
Awesome radiology report generation and image captioning papers.
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
LAVIS - A One-stop Library for Language-Vision Intelligence
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
Awesome radiology report generation and image captioning papers.
An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic capt...