Search Results - RepositoryStats

1.0k

10.4k

bsd-3-clause

96

LAVIS - A One-stop Library for Language-Vision Intelligence

salesforce deep-learning image-captioning vision-framework multimodal-datasets vision-and-language deep-learning-library multimodal-deep-learning visual-question-anwsering vision-language-pretraining vision-language-transformer

Created 2022-08-24

492 commits to main branch, last one 4 months ago

FinRobot AI4Finance-Foundation

492

3.0k

apache-2.0

48

FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀

fingpt aiagent chatgpt finance robo-advisor prompt-engineering large-language-models multimodal-deep-learning

Created 2024-02-27

269 commits to master branch, last one 4 months ago

Awesome-Text-to-Image Yutong-Zhou-cv

199

2.3k

mit

75

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

survey multimodal awseome-list text-to-face text-to-image image-synthesis image-generation image-manipulation multimodal-deep-learning generative-adversarial-network

Created 2020-10-13

614 commits to 2024-Version-2.0 branch, last one about a month ago

Time-LLM KimMeen

329

1.9k

apache-2.0

21

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"

time-series deep-learning prompt-tuning cross-modality language-model machine-learning cross-modal-learning time-series-analysis time-series-forecast large-language-models multimodal-time-series time-series-forecasting multimodal-deep-learning

Created 2024-01-20

39 commits to main branch, last one 4 months ago

BitNet kyegomez

160

1.8k

mit

43

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

gpt4 multimodal deeplearning machine-learning deep-neural-networks artificial-intelligence multimodal-deep-learning

Created 2023-10-18

172 commits to main branch, last one 9 months ago

AdvancedLiterateMachinery AlibabaResearch

190

1.7k

apache-2.0

40

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

Created 2022-09-28

69 commits to main branch, last one 3 months ago

CVPR2024-Papers-with-Code-Demo DWCTOD

150

1.3k

apache-2.0

27

收集 CVPR 最新的成果，包括论文、代码和demo视频等，欢迎大家推荐！Collect the latest CVPR (Conference on Computer Vision and Pattern Recognition) results, including papers, code, and demo videos, etc., and welcome recommendations...

llm cvpr cvpr2021 cvpr2022 cvpr2023 cvpr2024 segmentation computer-vision object-detection segment-anything multimodal-deep-learning

Created 2021-03-13

19 commits to main branch, last one 11 months ago

pytorch-widedeep jrzaurin

196

1.3k

apache-2.0

23

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

text images python pytorch model-hub pytorch-cv pytorch-nlp tabular-data deep-learning pytorch-tabular-data pytorch-transformers multimodal-deep-learning

Created 2017-10-21

945 commits to master branch, last one about a month ago

awesome-vision-language-pretraining-papers yuewang-cuhk

104

1.2k

unknown

51

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

bert vl-ptms pretraining vision-and-language multimodal-deep-learning

Created 2020-03-25

38 commits to master branch, last one 3 years ago

awesome-grounding TheShadow29

99

1.1k

mit

29

awesome grounding: A curated list of research papers in visual grounding

arxiv paper papers grounding awesome-list paper-roadmap embodied-agent computer-vision image-grounding video-grounding phrase-grounding visual-grounding captioning-images captioning-videos language-grounding video-understanding multimodal-deep-learning natural-language-processing

Created 2018-09-03

97 commits to master branch, last one about a year ago

multimodal-deep-learning declare-lab

156

817

mit

7

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

multimodal-learning multimodal-interactions multimodal-deep-learning multimodal-sentiment-analysis

Created 2021-08-28

95 commits to main branch, last one 2 years ago

awesome-multimodal-in-medical-imaging richard-peng-xia

66

703

mit

17

A collection of resources on applications of multi-modal learning in medical imaging.

medical-imaging multimodal-learning large-language-models large-multimodal-models multimodal-deep-learning medical-report-generation visual-question-answering multimodal-large-language-models

Created 2022-07-13

158 commits to main branch, last one about a month ago

blended-latent-diffusion omriav

37

594

mit

47

Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]

pytorch diffusion multimodal deep-learning text-to-image computer-vision diffusion-models generative-model image-generation text-driven-editing text-to-image-synthesis multimodal-deep-learning text-guided-manipulation

Created 2022-06-06

10 commits to master branch, last one 9 months ago

MMMU MMMU-Benchmark

33

405

apache-2.0

3

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

llm llms stem evaluation multimodal deep-learning multimodality computer-vision machine-learning foundation-models question-answering multimodal-learning deep-neural-networks large-language-models large-multimodal-models multimodal-deep-learning visual-question-answering natural-language-processing

Created 2023-11-23

147 commits to main branch, last one 20 days ago

Awesome-Parameter-Efficient-Transfer-Learning jianghaojun

25

401

mit

19

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

deep-learning computer-vision machine-learning transfer-learning multimodal-deep-learning parameter-efficient-tuning parameter-efficient-learning

Created 2022-12-22

66 commits to main branch, last one 6 months ago

Med-PaLM kyegomez

52

372

mit

7

Towards Generalist Biomedical AI

gpt4 biomedical multimodal opensource deep-learning multimodality multimodal-deep-learning

Created 2023-07-31

118 commits to main branch, last one about a year ago

scarches theislab

56

358

bsd-3-clause

11

Reference mapping for single-cell genomics

scrna-seq multiomics single-cell deep-learning batch-correction data-integration human-cell-atlas rna-seq-analysis single-cell-genomics multimodal-deep-learning

Created 2019-08-12

1,176 commits to master branch, last one about a month ago

Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review westlake-repl

27

345

unknown

11

Paper List of Pre-trained Foundation Recommender Models

Created 2023-06-25

197 commits to main branch, last one 7 months ago

content-moderation-deep-learning fcakyon

19

339

mit

5

Deep learning based content moderation from text, audio, video & image input modalities.

movie-trailer content-ratings nsfw-recognition nudity-detection content-moderation violence-detection profanity-detection genre-classification movie-content-filter multimodal-deep-learning

Created 2022-09-22

47 commits to main branch, last one 3 months ago

MUStARD soujanyaporia

62

337

mit

8

Multimodal Sarcasm Detection Dataset

sarcasm sarcasm-detection multimodal-interactions multimodal-deep-learning

Created 2019-02-20

82 commits to master branch, last one 7 months ago

VQASynth remyxai

12

322

unknown

7

Compose multimodal datasets 🎹

data-pipeline data-processing dataset-generation multimodal-datasets multimodal-deep-learning synthetic-dataset-generation

Created 2024-02-17

139 commits to main branch, last one 11 days ago

Awesome-Multimodality Yutong-Zhou-cv

22

322

unknown

12

A Survey on multimodal learning research.

awesome-list multimodality multimodal-deep-learning

Created 2021-09-20

79 commits to main branch, last one about a year ago

CLoT sail-sg

15

308

unknown

8

CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".

association leap-of-thought humor-generation large-language-models multimodal-deep-learning

Created 2023-12-01

33 commits to main branch, last one 11 months ago

awesome-Vision-and-Language-Pre-training phellonchen

16

293

apache-2.0

11

Recent Advances in Vision and Language Pre-training (VLP)

vlp pretraining vision-and-language multimodal-deep-learning vision-and-language-pre-training

Created 2021-09-14

56 commits to main branch, last one about a year ago

multimodal-ml-music ilaria-manco

11

292

mit

14

List of academic resources on Multimodal ML for Music

music-ai resources awesome-list music-research multimodal-data multimodal-learning academic-publications multimodal-deep-learning music-information-retrieval

Created 2022-12-29

11 commits to main branch, last one 2 years ago

ECCV2022-Papers-with-Code-Demo DWCTOD

23

286

unknown

7

收集 ECCV 最新的成果，包括论文、代码和demo视频等，欢迎大家推荐！

ai cv eccv nerf dataset eccv2022 diffusion computer-vision face-recognition image-segmentation vision-transformer objection-detection multimodal-deep-learning

Created 2022-07-04

48 commits to main branch, last one 2 years ago