1 result found Sort:

Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
Created 2024-05-29
33 commits to main branch, last one 4 days ago