1 result found Sort:
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
Created
2024-05-29
33 commits to main branch, last one 4 days ago