32 results found Sort:
- Filter by Primary Language:
- Python (30)
- Jupyter Notebook (1)
- +
Official release of InternLM2.5 base and chat models. 1M context support
Created
2023-07-06
235 commits to main branch, last one about a month ago
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Created
2023-09-21
173 commits to main branch, last one 4 months ago
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Created
2024-08-12
31 commits to main branch, last one about a month ago
LongBench v2 and LongBench (ACL 2024)
Created
2023-07-29
76 commits to main branch, last one a day ago
Large Context Attention
Created
2023-06-01
54 commits to main branch, last one 4 months ago
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Created
2023-05-15
35 commits to main branch, last one 3 months ago
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
Created
2024-02-14
221 commits to main branch, last one about a month ago
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Created
2024-08-31
54 commits to main branch, last one 5 days ago
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
Created
2023-04-24
69 commits to main branch, last one about a month ago
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"
Created
2024-03-03
41 commits to main branch, last one 8 months ago
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Created
2023-11-22
76 commits to main branch, last one 2 months ago
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
Created
2024-04-15
81 commits to main branch, last one 7 months ago
LLM KV cache compression made easy
Created
2024-11-06
16 commits to main branch, last one 2 days ago
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
Created
2024-04-04
17 commits to main branch, last one 3 months ago
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
Created
2024-01-27
46 commits to main branch, last one 5 days ago
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
Created
2023-11-02
26 commits to main branch, last one 2 months ago
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
Created
2023-11-12
220 commits to main branch, last one 6 days ago
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
Created
2024-04-18
23 commits to main branch, last one 4 months ago
LongQLoRA: Extent Context Length of LLMs Efficiently
Created
2023-10-22
25 commits to master branch, last one about a year ago
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Created
2024-10-22
13 commits to main branch, last one about a month ago
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
Created
2023-05-31
19 commits to main branch, last one 2 months ago
The official repo for "LLoCo: Learning Long Contexts Offline"
Created
2024-04-12
4 commits to main branch, last one 6 months ago
Implementation of Infini-Transformer in Pytorch
Created
2024-05-01
45 commits to main branch, last one 2 months ago
[NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of existing MLLMs to comprehend long multimodal documents.
Created
2024-06-05
53 commits to main branch, last one 26 days ago
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
Created
2022-06-18
20 commits to main branch, last one about a year ago
[EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
Created
2024-10-03
3 commits to main branch, last one about a month ago
Counting-Stars (★)
Created
2024-03-13
192 commits to main branch, last one 3 months ago
Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
Created
2024-08-19
13 commits to master branch, last one 3 months ago
My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)
Created
2023-05-18
10 commits to main branch, last one about a year ago
The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"
Created
2024-04-10
53 commits to main branch, last one 8 months ago