FReND: A French Resource of Negation Data

Hafida Le Cloirec - Ait Yahya, Olga Seminck, Pascal Amsili


Abstract
FReND is a freely available corpus of French language in which negations are hand-annotated. Negations are annotated by their cues and scopes. Comprising 590K tokens and over 8.9K negations, it is the largest dataset available for French. A variety of types of textual genres are covered: literature, blog posts, Wikipedia articles, political debates, clinical reports and newspaper articles. As the understanding of negation is not yet mastered by current state of the art AI-models, FReND is not only a valuable resource for linguistic research into negation, but also as training data for AI tasks such as negation detection.
Anthology ID:
2024.lrec-main.658
Volume:
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:
LREC | COLING
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
7461–7468
Language:
URL:
https://aclanthology.org/2024.lrec-main.658
DOI:
Bibkey:
Cite (ACL):
Hafida Le Cloirec - Ait Yahya, Olga Seminck, and Pascal Amsili. 2024. FReND: A French Resource of Negation Data. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 7461–7468, Torino, Italia. ELRA and ICCL.
Cite (Informal):
FReND: A French Resource of Negation Data (Le Cloirec - Ait Yahya et al., LREC-COLING 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.lrec-main.658.pdf