11 results found Sort:

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
Created 2022-09-09
49 commits to main branch, last one 2 years ago
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
Created 2022-11-04
38 commits to master branch, last one 7 months ago
62
1.1k
apache-2.0
15
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Created 2023-02-21
293 commits to main branch, last one about a month ago
T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!
Created 2024-03-28
139 commits to main branch, last one 3 months ago
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
Created 2024-01-09
68 commits to main branch, last one 2 months ago
1-shot image segmentation using Stable Diffusion
Created 2023-09-27
22 commits to main branch, last one 8 months ago
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Created 2021-05-19
14 commits to main branch, last one 3 years ago
The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".
Created 2023-05-24
7 commits to main branch, last one 7 months ago
A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)
Created 2022-11-20
81 commits to main branch, last one about a year ago
2
32
apache-2.0
3
[ITSC-2023] HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection
Created 2022-06-30
5 commits to master branch, last one about a year ago