Search Results - RepositoryStats

YAYI2 wenge-research

19

3.6k

apache-2.0

7

YAYI 2 是中科闻歌研发的新一代开源大语言模型，采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

gpt chat yayi chinese artificial-intelligence pretrained-language-model natural-language-generation

This repository has been archived (exclude archived)

Created 2023-12-15

33 commits to main branch, last one about a year ago

torchscale microsoft

215

3.1k

mit

44

Foundation Architecture for (M)LLMs

multimodal transformer translation computer-vision machine-learning speech-processing pretrained-language-model natural-language-processing

Created 2022-11-17

123 commits to main branch, last one 12 months ago

awesome-sentence-embedding Separius

262

2.3k

gpl-3.0

77

A curated list of pretrained sentence and word embedding models

nlp bert awesome awesome-list cross-lingual wordembedding language-model subword-models word-embeddings embedding-models natural-language pretrained-models sentence-embeddings pretrained-embedding unsupervised-learning sentence-representations pretrained-language-model contextualized-representation

This repository has been archived (exclude archived)

Created 2018-12-10

200 commits to master branch, last one 3 years ago

P-tuning-v2 THUDM

202

2.0k

apache-2.0

29

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

p-tuning prompt-tuning pretrained-language-model natural-language-processing parameter-efficient-learning

Created 2021-10-14

37 commits to main branch, last one about a year ago

OpenDelta thunlp

82

1.0k

apache-2.0

19

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

nlp nlp-library deep-learning pretrained-language-model parameter-efficient-learning

Created 2022-02-14

160 commits to main branch, last one 6 months ago

Summarization-Papers xcfcode

145

1.0k

unknown

23

Summarization Papers

nlp chatgpt summarization text-generation pretrained-language-model natural-language-processing

This repository has been archived (exclude archived)

Created 2020-10-14

457 commits to main branch, last one about a year ago

lawyer-llama AndrewZhe

126

925

apache-2.0

11

中文法律LLaMA (LLaMA for Chinese legel domain)

llm nlp plm llama alpaca legal-ai pretrained-models large-language-models pretrained-language-model

Created 2023-04-12

40 commits to main branch, last one 7 months ago

NLP-Projects gaoisbest

151

545

unknown

22

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, informati...

word2vec sentence2vec knowledge-graph text-generation dialogue-systems network-embedding sequence-labeling text-classification information-retrieval information-extraction pretrained-language-model machine-reading-comprehension

Created 2017-07-10

641 commits to master branch, last one 4 years ago

dont-stop-pretraining allenai

73

529

unknown

8

Code associated with the Don't Stop Pretraining ACL 2020 paper

pretrained-language-model natural-language-processing

Created 2020-04-09

61 commits to master branch, last one 3 years ago

CPM-Live OpenBMB

40

507

unknown

20

Live Training for Open-source Big Models

nlp deep-learning multi-task-learning pretrained-language-model natural-language-generation natural-language-processing parameter-efficient-learning natural-language-understanding

Created 2022-05-21

525 commits to master branch, last one about a year ago

awesome-instruction-learning RenzeLou

25

488

mit

7

Papers and Datasets on Instruction Tuning and Following. ✨✨✨

prompt survey datasets paper-list instruction awesome-list instruction-tuning in-context-learning instruction-learning large-language-models pretrained-language-model

Created 2023-02-21

190 commits to main branch, last one about a year ago

Diffusion-BERT Hzfinfdu

24

307

apache-2.0

11

ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

bert text-generation diffusion-models conditional-generation unconditional-generation pretrained-language-model

Created 2022-11-29

13 commits to main branch, last one about a year ago

MWPToolkit LYH-YF

37

163

mit

3

MWPToolkit is an open-source framework for math word problem(MWP) solvers.

pytorch deep-learning graph-to-tree sequence-to-tree math-word-problem sequence-to-sequence pretrained-language-model

Created 2021-01-26

564 commits to master branch, last one 2 years ago

AttrPrompt yueyu1030

13

151

apache-2.0

3

[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.

attributed-text data-centric-ai zero-shot-learning text-classification large-language-models training-data-generation pretrained-language-model natural-language-processing

Created 2023-05-31

26 commits to main branch, last one about a year ago

ATPapers ZhengZixiang

13

133

unknown

6

Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合

bert awesome transformer attention-mechanism pretrained-language-model

Created 2019-11-02

96 commits to master branch, last one 4 years ago

awesome-refreshing-llms hyintell

10

132

mit

5

EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

llm nlp llms paper review survey refreshing update-llm awesome-list knowledge-editing continual-learning large-language-models pretrained-language-model natural-language-processing retrieval-augmented-generation

Created 2023-10-08

14 commits to main branch, last one about a year ago

COCO-LM microsoft

13

118

mit

3

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

pretraining transformers deep-learning language-model contrastive-learning representation-learning pretrained-language-model natural-language-processing natural-language-understanding

Created 2021-10-21

33 commits to main branch, last one 2 years ago

awesome-lifelong-learning-methods-for-llm zzz47zzz

6

117

unknown

3

[ACM Computing Surveys 2025] This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)

lifelong-learning continual-learning incremental-learning large-language-models pretrained-language-model

Created 2023-09-25

31 commits to main branch, last one 2 months ago

TEMPO DC-research

14

106

mit

2

The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for fo...

gpt forecasting time-series transformer transformers foundation-models pretrained-models forecasting-models transformers-models time-series-analysis forecasting-time-series pretrained-language-model

Created 2024-04-01

36 commits to main branch, last one about a month ago

BERT4ETH git-disl

20

102

unknown

5

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)

bert www2023 ethereum blockchain transformer deanonymization fraud-detection phishing-detection pretrained-language-model

Created 2023-02-05

71 commits to master branch, last one 9 months ago

Prompt-Transferability thunlp

11

99

mit

6

On Transferability of Prompt Tuning for Natural Language Processing

nlp prompt pytorch prompt-tuning pretrained-models transfer-learning pretrained-language-model parameter-efficient-tuning pretrained-language-models parameter-efficient-learning

Created 2021-05-29

689 commits to main branch, last one 11 months ago

Awesome-LLM-Self-Consistency SuperBruceJia

7

96

mit

4

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

llms gpt-3 gpt-4 chatgpt reasoning semantics llms-reasoning chain-of-thought self-consistency factual-consistency logical-consistency semantics-preserving semantics-consistency hypothetical-consistency compositional-consistency pretrained-language-model self-consistency-learning self-consistency-benchmark self-consistent-generation

Created 2023-10-08

68 commits to main branch, last one 8 months ago

Bamboo SJTU-IPADS

1

92

apache-2.0

10

Bamboo-7B Large Language Model

llm powerinfer sparse-llm pretrained-models large-language-models pretrained-language-model

Created 2024-03-25

35 commits to main branch, last one about a year ago

TopClus yumeng5

11

87

apache-2.0

2

[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

clustering language-model topic-modeling topic-discovery pretrained-language-model

Created 2022-01-29

9 commits to main branch, last one 3 years ago

CODER GanjinZero

5

78

unknown

1

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

nlp umls medical embeddings multi-language pretrained-language-model

Created 2020-08-12

32 commits to master branch, last one 2 years ago

Scientific-Inspiration-Machines-Optimized-for-Novelty EagleW

11

77

apache-2.0

2

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

llm gpt4 acl2024 pytorch text-generation hypothesis-generation pretrained-language-model retrieval-augmented-generation

Created 2023-05-17

21 commits to main branch, last one 12 months ago

TransPolymer ChangwenXu98

21

68

mit

2

Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch

polymer pytorch transformer deep-learning self-supervised-learning pretrained-language-model

Created 2022-08-31

40 commits to master branch, last one about a year ago

SuperGen yumeng5

14

64

apache-2.0

2

[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding

text-generation zero-shot-learning text-classification pretrained-language-model natural-language-processing natural-language-understanding

Created 2022-02-10

15 commits to main branch, last one 2 years ago

PoincareProbe FranxYao

5

58

unknown

6

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

bert probing bertology bert-model hyperbolic probing-tasks bert-embeddings hyperbolic-geometry hyperbolic-embeddings pretrained-language-model

Created 2021-03-17

12 commits to main branch, last one 4 years ago

Awesome-LLMs-ICLR-24 azminewasi

3

57

mit

1

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

llm llms llmops llm-agent llm-privacy llm-serving llm-security llm-training llm-framework llm-inference llm-prompting llm-evaluation pretrained-models pretrained-weights large-language-model large-language-models pretrained-language-model large-language-models-for-graph-learning large-language-models-and-translation-systems

Created 2024-03-18

5 commits to main branch, last one about a year ago