Search Results - RepositoryStats

20

218

mit

7

Implementation of Block Recurrent Transformer - Pytorch

memory recurrence deep-learning attention-mechanisms long-context-attention artificial-intelligence long-context-transformers

Created 2023-02-07

65 commits to main branch, last one 7 months ago

3

72

mit

4

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

position-encoding long-context-transformers

Created 2024-10-24

22 commits to main branch, last one 4 months ago

1

31

apache-2.0

3

This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"

llm llms benchmark evaluation multimodal deep-learning multimodality computer-vision machine-learning foundation-models deep-neural-networks large-language-models long-context-modeling large-multimodal-models long-context-transformers visual-question-answering natural-language-processing multimodal-large-language-models

Created 2024-04-12

17 commits to main branch, last one 9 months ago