36 results found Sort:

482
6.8k
apache-2.0
58
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
Created 2023-07-06
245 commits to main branch, last one about a month ago
283
2.7k
apache-2.0
13
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Created 2023-09-21
173 commits to main branch, last one 7 months ago
161
1.6k
apache-2.0
18
[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Created 2024-08-12
31 commits to main branch, last one 5 months ago
79
830
mit
8
LongBench v2 and LongBench (ACL 2024)
Created 2023-07-29
79 commits to main branch, last one 2 months ago
53
697
apache-2.0
6
Large Context Attention
Created 2023-06-01
56 commits to main branch, last one 2 months ago
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Created 2023-05-15
36 commits to main branch, last one 3 months ago
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Created 2024-02-14
221 commits to main branch, last one 5 months ago
33
486
apache-2.0
12
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Created 2024-08-31
55 commits to main branch, last one 3 months ago
32
447
apache-2.0
13
LLM KV cache compression made easy
Created 2024-11-06
33 commits to main branch, last one 17 days ago
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Created 2023-04-24
71 commits to main branch, last one 2 months ago
34
348
mit
16
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
Created 2024-03-03
41 commits to main branch, last one 11 months ago
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Created 2023-11-22
76 commits to main branch, last one 6 months ago
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Created 2024-04-15
81 commits to main branch, last one 11 months ago
29
267
other
15
✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy
Created 2024-12-14
46 commits to main branch, last one 17 days ago
20
249
apache-2.0
8
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
Created 2024-01-27
46 commits to main branch, last one 3 months ago
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Created 2024-04-04
17 commits to main branch, last one 7 months ago
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
Created 2023-11-12
251 commits to main branch, last one 8 days ago
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
Created 2023-11-02
26 commits to main branch, last one 6 months ago
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
Created 2024-04-18
23 commits to main branch, last one 8 months ago
LongQLoRA: Extent Context Length of LLMs Efficiently
Created 2023-10-22
25 commits to master branch, last one about a year ago
9
155
apache-2.0
3
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Created 2024-10-22
13 commits to main branch, last one 5 months ago
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
Created 2023-05-31
19 commits to main branch, last one 6 months ago
The official repo for "LLoCo: Learning Long Contexts Offline"
Created 2024-04-12
4 commits to main branch, last one 9 months ago
5
114
unknown
2
[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.
Created 2024-06-05
53 commits to main branch, last one 4 months ago
Implementation of Infini-Transformer in Pytorch
Created 2024-05-01
47 commits to main branch, last one 3 months ago
14
101
unknown
6
[EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
Created 2024-10-03
4 commits to main branch, last one 2 months ago
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
Created 2022-06-18
20 commits to main branch, last one 2 years ago
Counting-Stars (★)
Created 2024-03-13
192 commits to main branch, last one 7 months ago
Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
Created 2024-08-19
14 commits to master branch, last one about a month ago
6
56
apache-2.0
0
WritingBench: A Comprehensive Benchmark for Generative Writing
Created 2025-03-10
13 commits to main branch, last one 17 days ago