​Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond



New SOTA on cross-lingual transfer (XNLI, MLDoc) and bitext mining (BUCC) using a shared encoder for 93 languages.



Link: https://arxiv.org/abs/1812.10464



#SOTA #NLP



🔗 Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond