8 results found Sort:

289
2.0k
apache-2.0
15
MTEB: Massive Text Embedding Benchmark
Created 2022-04-05
2,107 commits to main branch, last one a day ago
38
518
apache-2.0
6
Crosslingual Generalization through Multitask Finetuning
Created 2022-10-18
33 commits to master branch, last one 3 months ago
EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning,...
Created 2023-12-05
174 commits to main branch, last one 7 months ago
3
99
other
8
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023
Created 2023-05-15
4 commits to main branch, last one 8 months ago
This repo supports various cross-lingual transfer learning & multilingual NLP models.
Created 2019-08-31
32 commits to master branch, last one 2 years ago
[EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Created 2022-10-27
243 commits to main branch, last one 4 months ago
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
Created 2024-02-16
24 commits to main branch, last one 9 months ago
This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st An...
Created 2021-12-15
18 commits to main branch, last one 9 months ago