DeepMistake at LSCDiscovery: Can a Multilingual Word-in-Context Model Replace Human Annotators?

Daniil Homskiy; Nikolay Arefyev

doi:10.18653/v1/2022.lchange-1.18

DeepMistake at LSCDiscovery: Can a Multilingual Word-in-Context Model Replace Human Annotators?

Abstract

In this paper we describe our solution of the LSCDiscovery shared task on Lexical Semantic Change Discovery (LSCD) in Spanish. Our solution employs a Word-in-Context (WiC) model, which is trained to determine if a particular word has the same meaning in two given contexts. We basically try to replicate the annotation of the dataset for the shared task, but replacing human annotators with a neural network. In the graded change discovery subtask, our solution has achieved the 2nd best result according to all metrics. In the main binary change detection subtask, our F1-score is 0.655 compared to 0.716 of the best submission, corresponding to the 5th place. However, in the optional sense gain detection subtask we have outperformed all other participants. During the post-evaluation experiments we compared different ways to prepare WiC data in Spanish for fine-tuning. We have found that it helps leaving only examples annotated as 1 (unrelated senses) and 4 (identical senses) rather than using 2x more examples including intermediate annotations.

Anthology ID:: 2022.lchange-1.18
Volume:: Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Nina Tahmasebi, Syrielle Montariol, Andrey Kutuzov, Simon Hengchen, Haim Dubossarsky, Lars Borin
Venue:: LChange
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 173–179
Language:
URL:: https://aclanthology.org/2022.lchange-1.18/
DOI:: 10.18653/v1/2022.lchange-1.18
Bibkey:
Cite (ACL):: Daniil Homskiy and Nikolay Arefyev. 2022. DeepMistake at LSCDiscovery: Can a Multilingual Word-in-Context Model Replace Human Annotators?. In Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change, pages 173–179, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: DeepMistake at LSCDiscovery: Can a Multilingual Word-in-Context Model Replace Human Annotators? (Homskiy & Arefyev, LChange 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.lchange-1.18.pdf
Video:: https://aclanthology.org/2022.lchange-1.18.mp4

PDF Cite Search Video Fix data