Search Results - RepositoryStats

248

3.8k

other

34

🪩 Create Disco Diffusion artworks in one line

dalle imgen prompts diffusion midjourney multimodal creative-ai cross-modal creative-art discodiffusion generative-art disco-diffusion latent-diffusion stable-diffusion clip-guided-diffusion

Created 2022-06-30

385 commits to main branch, last one about a year ago

docarray docarray

233

3.0k

apache-2.0

45

Represent, send, store and search multimodal data

qdrant fastapi pytorch docarray protobuf pydantic weaviate dataclass multimodal cross-modal multi-modal nested-data deep-learning elasticsearch neural-search data-structures semantic-search machine-learning nearest-neighbor-search

Created 2021-12-14

1,467 commits to main branch, last one 24 days ago

knowledge-graphs shaoxiongji

295

1.7k

unknown

61

A collection of research on knowledge graphs

ner paper survey reasoning commonsense cross-modal knowledge-graph dialogue-systems question-answering relation-extraction information-retrieval recommendation-systems representation-learning meta-relational-learning temporal-knowledge-graph knowledge-graph-completion natural-language-processing

Created 2019-01-02

73 commits to master branch, last one 2 years ago

awesome-audio-visual krantiparida

68

709

unknown

17

A curated list of different papers and datasets in various areas of audio-visual processing

awesome cross-modal mutli-modal audio-visual awesome-list localization source-separation

Created 2019-03-30

63 commits to master branch, last one about a year ago

SCAN kuanghuei

115

559

apache-2.0

9

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

pytorch cross-modal deep-learning neural-network computer-vision visual-semantic image-captioning

Created 2018-05-11

19 commits to master branch, last one 2 years ago

examples towhee-io

118

484

apache-2.0

6

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

nlp embeddings cross-modal video-tagging machine-learning audio-classification image-classification

Created 2022-04-11

401 commits to main branch, last one about a year ago

CMG haihuangcode

6

215

unknown

3

The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)

multimodal cross-modal pretrained-models cross-modal-generalization

Created 2023-10-24

29 commits to master branch, last one 3 months ago

SOLC yisun98

27

214

unknown

1

Remote Sensing Sar-Optical Land-use Classfication Pytorch Pytorch高分辨率遥感语义分割/地物分割/地物分类

pytorch oa-kappa deeplabv3 cross-modal multi-modal sar-optical multi-source segmentation remote-sensing land-use-classification

Created 2022-06-01

86 commits to main branch, last one 11 months ago

RIM JizhiziLi

13

207

unknown

22

[CVPR 2023] Referring Image Matting

matting multimodal cross-modal image-matting image-segmentation

Created 2022-06-12

6 commits to master branch, last one about a year ago

MoTIS DRSY

10

124

unknown

4

[NAACL 2022]Mobile Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP)

ai knn lsh clip naacl k-means ios-swift retrieval cross-modal image-search vector-search semantic-search random-projection k-means-clustering knowledge-distillation

Created 2021-08-07

132 commits to main branch, last one about a year ago

BioT5 QizhiPei

5

111

mit

3

BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)

nlp cross-modal bioinformatics machine-learning nlp-applications computational-biology

Created 2023-10-11

60 commits to main branch, last one 7 months ago

Weakly-Supervised-3D-Object-Detection Zengyi-Qin

18

106

mit

6

Weakly Supervised 3D Object Detection from Point Clouds (VS3D), ACM MM 2020

vs3d ws3d kitti lidar stereo monocular tensorflow acm-mm-2020 cross-modal point-cloud object-proposals transfer-learning 3d-object-detection unsupervised-learning weakly-supervised-detection unsupervised-object-detection

Created 2020-07-28

10 commits to master branch, last one 4 years ago

distill-bev qcraftai

6

99

unknown

5

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

bev lidar nuscenes cross-modal multi-modal point-cloud distillation multi-camera self-driving autonomous-driving 3d-object-detection knowledge-distillation

Created 2023-09-25

19 commits to main branch, last one about a year ago

VLTVG yangli18

9

95

unknown

2

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022

cross-modal vision-language visual-grounding visual-linguistic

Created 2022-04-29

5 commits to master branch, last one 2 years ago

DSRAN kywen1119

12

72

apache-2.0

3

Code for journal paper "Learning Dual Semantic Relations with Graph Attention for Image-Text Matching", TCSVT, 2020.

tcsvt pytorch cross-modal computer-vision image-text-matching

Created 2020-10-22

60 commits to main branch, last one 3 years ago

Multimodality-Representation-Learning marslanm

7

72

unknown

8

This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl....

cross-modal multimodal-pretext transformer-models multimodal-datasets multimodal-applications multimodal-deep-learning vision-language-pretraining multimodal-pre-trained-model

Created 2022-03-13

66 commits to main branch, last one about a year ago

UniPT Paranioar

1

67

apache-2.0

1

[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"

cross-modal transfer-learning memory-efficient-tuning memory-efficient-learning parameter-efficient-tuning parameter-efficient-learning parameter-efficient-fine-tuning

Created 2023-08-28

17 commits to main branch, last one 6 months ago

UPIDet Eaphan

7

61

apache-2.0

5

Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]

cross-modal multi-modal 3d-object-detection

Created 2023-01-20

13 commits to main branch, last one 10 months ago

Xmodal-Ctx GT-RIPL

10

60

unknown

2

Official PyTorch implementation of our CVPR 2022 paper: Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning

clip cross-modal image-captioning vision-and-language

Created 2022-05-09

3 commits to main branch, last one 2 years ago