Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks Weicheng Ma author Kai Zhang author Renze Lou author Lili Wang author Soroush Vosoughi author 2021-08 text Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) Chengqing Zong editor Fei Xia editor Wenjie Li editor Roberto Navigli editor Association for Computational Linguistics Online conference publication ma-etal-2021-contributions 10.18653/v1/2021.acl-long.152 https://aclanthology.org/2021.acl-long.152/ 2021-08 1956 1966