83 results found Sort:

732
9.4k
unknown
141
The official GitHub page for the survey paper "A Survey of Large Language Models".
Created 2023-03-14
138 commits to main branch, last one about a month ago
525
2.9k
apache-2.0
75
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Created 2019-04-10
1,058 commits to master branch, last one about a month ago
203
2.1k
apache-2.0
19
ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Phi3-Vision, ...)
Created 2023-08-01
667 commits to main branch, last one a day ago
109
1.7k
apache-2.0
14
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
Created 2023-08-01
173 commits to main branch, last one a day ago
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
Created 2020-05-27
259 commits to master branch, last one 4 months ago
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
Created 2023-03-08
680 commits to main branch, last one 6 days ago
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
Created 2019-05-24
44 commits to main branch, last one 3 months ago
54
1.1k
unknown
20
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Created 2021-09-05
56 commits to main branch, last one about a year ago
247
1.0k
mit
25
Oscar and VinVL
Created 2020-05-14
28 commits to master branch, last one 10 months ago
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Created 2022-09-26
110 commits to main branch, last one about a month ago
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Created 2018-10-23
101 commits to master branch, last one 5 years ago
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
Created 2023-06-10
42 commits to main branch, last one about a month ago
109
772
unknown
18
Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
Created 2020-01-28
70 commits to master branch, last one 2 years ago
110
736
mit
14
Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
Created 2019-11-22
25 commits to master branch, last one 3 years ago
[ICML2024] Unified Training of Universal Time Series Forecasting Transformers
Created 2024-02-07
76 commits to main branch, last one 2 days ago
103
534
mit
10
[NeurIPS 2020] "Graph Contrastive Learning with Augmentations" by Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, Yang Shen
Created 2020-09-25
116 commits to master branch, last one about a year ago
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Created 2023-10-16
71 commits to main branch, last one 3 months ago
89
476
mit
12
Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"
Created 2020-06-26
97 commits to master branch, last one about a year ago
Large Language Model-enhanced Recommender System Papers
Created 2023-05-11
123 commits to main branch, last one 22 days ago
34
451
other
14
Multi-modality pre-training
Created 2022-03-15
104 commits to main branch, last one about a month ago
19
354
apache-2.0
7
Generative AI for Math: MathPile
Created 2023-11-27
30 commits to main branch, last one 3 days ago
Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.
Created 2021-03-03
16 commits to main branch, last one about a year ago
34
323
apache-2.0
5
Code for our SIGKDD'22 paper Pre-training-Enhanced Spatial-Temporal Graph Neural Network For Multivariate Time Series Forecasting.
Created 2022-02-08
71 commits to github branch, last one 6 months ago
53
322
mit
15
GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training @ KDD 2020
Created 2020-06-16
242 commits to master branch, last one 3 years ago
17
304
apache-2.0
10
Probing the representations of Vision Transformers.
Created 2022-03-12
36 commits to main branch, last one about a year ago
The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.
Created 2022-02-09
124 commits to main branch, last one 5 months ago
16
274
unknown
6
[CVPR2023] All in One: Exploring Unified Video-Language Pre-training
Created 2022-03-14
29 commits to main branch, last one about a year ago
14
266
apache-2.0
7
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Created 2022-07-30
72 commits to master branch, last one about a month ago
💐Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
Created 2021-03-05
153 commits to main branch, last one about a year ago