38 results found Sort:
- Filter by Primary Language:
- Python (28)
- Jupyter Notebook (2)
- OpenEdge ABL (1)
- TeX (1)
- +
YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
Created
2023-12-15
33 commits to main branch, last one 8 months ago
Foundation Architecture for (M)LLMs
Created
2022-11-17
123 commits to main branch, last one 8 months ago
A curated list of pretrained sentence and word embedding models
This repository has been archived
(exclude archived)
Created
2018-12-10
200 commits to master branch, last one 3 years ago
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Created
2021-10-14
37 commits to main branch, last one about a year ago
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
Created
2022-02-14
160 commits to main branch, last one 3 months ago
Summarization Papers
This repository has been archived
(exclude archived)
Created
2020-10-14
457 commits to main branch, last one about a year ago
中文法律LLaMA (LLaMA for Chinese legel domain)
Created
2023-04-12
40 commits to main branch, last one 3 months ago
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, informati...
Created
2017-07-10
641 commits to master branch, last one 3 years ago
Code associated with the Don't Stop Pretraining ACL 2020 paper
Created
2020-04-09
61 commits to master branch, last one 3 years ago
Live Training for Open-source Big Models
Created
2022-05-21
525 commits to master branch, last one about a year ago
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
Created
2023-02-21
190 commits to main branch, last one 8 months ago
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Created
2022-11-29
13 commits to main branch, last one about a year ago
MWPToolkit is an open-source framework for math word problem(MWP) solvers.
Created
2021-01-26
564 commits to master branch, last one 2 years ago
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
Created
2023-05-31
26 commits to main branch, last one about a year ago
Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合
Created
2019-11-02
96 commits to master branch, last one 3 years ago
EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.
Created
2023-10-08
14 commits to main branch, last one about a year ago
[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Created
2021-10-21
33 commits to main branch, last one 2 years ago
BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)
Created
2023-02-05
71 commits to master branch, last one 6 months ago
On Transferability of Prompt Tuning for Natural Language Processing
Created
2021-05-29
689 commits to main branch, last one 7 months ago
Bamboo-7B Large Language Model
Created
2024-03-25
35 commits to main branch, last one 8 months ago
The official code for "TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting (ICLR 2024)". TEMPO is one of the very first open source Time Series Foundation Models for fo...
Created
2024-04-01
35 commits to main branch, last one 21 days ago
[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
Created
2022-01-29
9 commits to main branch, last one 2 years ago
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
llms
gpt-3
gpt-4
chatgpt
reasoning
semantics
llms-reasoning
chain-of-thought
self-consistency
factual-consistency
logical-consistency
semantics-preserving
semantics-consistency
hypothetical-consistency
compositional-consistency
pretrained-language-model
self-consistency-learning
self-consistency-benchmark
self-consistent-generation
Created
2023-10-08
68 commits to main branch, last one 4 months ago
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Created
2020-08-12
32 commits to master branch, last one 2 years ago
Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty
Created
2023-05-17
21 commits to main branch, last one 8 months ago
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Created
2022-02-10
15 commits to main branch, last one 2 years ago
Implementation of "TransPolymer: a Transformer-based language model for polymer property predictions" in PyTorch
Created
2022-08-31
40 commits to master branch, last one about a year ago
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
Created
2021-03-17
12 commits to main branch, last one 3 years ago
ELECTRA기반 한국어 대화체 언어모델
Created
2021-04-13
40 commits to master branch, last one 3 years ago
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
Created
2022-03-02
25 commits to main branch, last one 2 years ago