11 results found Sort:

4
225
unknown
4
[Paper][AAAI 2025] (MyGO)Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation
Created 2024-04-15
11 commits to main branch, last one 2 days ago
9
146
unknown
5
The official repository of Achelous and Achelous++
Created 2023-03-17
150 commits to main branch, last one 5 months ago
19
143
apache-2.0
4
Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!
Created 2023-05-22
48 commits to main branch, last one about a year ago
[IEEE TCYB 2023] The first large-scale tracking dataset by fusing RGB and Event cameras.
Created 2020-12-09
169 commits to main branch, last one 2 months ago
This repository contains the source code for our paper: "Husformer: A Multi-Modal Transformer for Multi-Modal Human State Recognition". For more details, please refer to our paper at https://arxiv.org...
Created 2022-08-26
152 commits to master branch, last one about a year ago
Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Zeta
Created 2024-01-21
14 commits to main branch, last one 11 months ago
Code for J. Wang, J. Li, Y. Shi, J. Lai and X. Tan, "AM3Net: Adaptive Mutual-learning-based Multimodal Data Fusion Network," in IEEE TCSVT, 2022. We conducted the experiments on the hyperspectral and ...
Created 2021-12-07
35 commits to main branch, last one about a year ago
Training for multi-modal image fusion with PyTorch.
Created 2022-08-10
16 commits to main branch, last one about a year ago
1
28
unknown
4
[Paper][SIGIR 2024] NativE: Multi-modal Knowledge Graph Completion in the Wild
Created 2024-03-28
8 commits to main branch, last one 4 months ago
The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"
Created 2023-10-02
28 commits to main branch, last one about a year ago