Junjie Hu 0001 Sebastian Ruder Aditya Siddhant Graham Neubig Orhan Firat Melvin Johnson XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization. 2020 abs/2003.11080 CoRR https://arxiv.org/abs/2003.11080 db/journals/corr/corr2003.html#abs-2003-11080