32 results found Sort:

463
6.6k
apache-2.0
59
Official release of InternLM2.5 base and chat models. 1M context support
Created 2023-07-06
235 commits to main branch, last one about a month ago
278
2.6k
apache-2.0
12
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Created 2023-09-21
173 commits to main branch, last one 4 months ago
144
1.5k
apache-2.0
17
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Created 2024-08-12
31 commits to main branch, last one about a month ago
59
704
mit
7
LongBench v2 and LongBench (ACL 2024)
Created 2023-07-29
76 commits to main branch, last one a day ago
53
657
apache-2.0
7
Large Context Attention
Created 2023-06-01
54 commits to main branch, last one 4 months ago
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Created 2023-05-15
35 commits to main branch, last one 3 months ago
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Created 2024-02-14
221 commits to main branch, last one about a month ago
33
430
apache-2.0
11
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Created 2024-08-31
54 commits to main branch, last one 5 days ago
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Created 2023-04-24
69 commits to main branch, last one about a month ago
29
312
mit
16
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
Created 2024-03-03
41 commits to main branch, last one 8 months ago
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Created 2023-11-22
76 commits to main branch, last one 2 months ago
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Created 2024-04-15
81 commits to main branch, last one 7 months ago
14
272
apache-2.0
6
LLM KV cache compression made easy
Created 2024-11-06
16 commits to main branch, last one 2 days ago
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Created 2024-04-04
17 commits to main branch, last one 3 months ago
15
230
apache-2.0
8
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
Created 2024-01-27
46 commits to main branch, last one 5 days ago
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
Created 2023-11-02
26 commits to main branch, last one 2 months ago
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
Created 2023-11-12
220 commits to main branch, last one 6 days ago
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
Created 2024-04-18
23 commits to main branch, last one 4 months ago
LongQLoRA: Extent Context Length of LLMs Efficiently
Created 2023-10-22
25 commits to master branch, last one about a year ago
6
140
apache-2.0
3
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Created 2024-10-22
13 commits to main branch, last one about a month ago
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
Created 2023-05-31
19 commits to main branch, last one 2 months ago
The official repo for "LLoCo: Learning Long Contexts Offline"
Created 2024-04-12
4 commits to main branch, last one 6 months ago
Implementation of Infini-Transformer in Pytorch
Created 2024-05-01
45 commits to main branch, last one 2 months ago
6
106
unknown
1
[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.
Created 2024-06-05
53 commits to main branch, last one 26 days ago
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
Created 2022-06-18
20 commits to main branch, last one about a year ago
11
86
unknown
6
[EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
Created 2024-10-03
3 commits to main branch, last one about a month ago
Counting-Stars (★)
Created 2024-03-13
192 commits to main branch, last one 3 months ago
Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
Created 2024-08-19
13 commits to master branch, last one 3 months ago
My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)
Created 2023-05-18
10 commits to main branch, last one about a year ago
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
Created 2024-04-10
53 commits to main branch, last one 8 months ago