Abstract
We demonstrate a computational method for analyzing complementizer variability within language and literature, focusing on Hebrew as a test case.The primary complementizers in Hebrew are še- and ki.We first run a large-scale corpus analysis to determine the relative preference for one or the other of these complementizers given the preceding verb.On top of this foundation, we leverage clustering methods to measure the degree of interchangeability between the complementizers for each verb.The resulting tables, which provide this information for all common complement-taking verbs in Hebrew, are a first-of-its-kind lexical resource which we provide to the NLP community.Upon this foundation, we demonstrate a computational method to analyze literary works for unusual and unexpected complementizer usages deserving of literary analysis.
Original language | English |
---|---|
Title of host publication | NLP4DH 2024 - 4th International Conference on Natural Language Processing for Digital Humanities, Proceedings of the Conference |
Editors | Mika Hamalainen, Emily Ohman, So Miyagawa, Khalid Alnajjar, Yuri Bizzoni |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 294-307 |
Number of pages | 14 |
ISBN (Electronic) | 9798891761810 |
State | Published - 2024 |
Event | 4th International Conference on Natural Language Processing for Digital Humanities, NLP4DH 2024 - Miami, United States Duration: 16 Nov 2024 → … |
Publication series
Name | NLP4DH 2024 - 4th International Conference on Natural Language Processing for Digital Humanities, Proceedings of the Conference |
---|
Conference
Conference | 4th International Conference on Natural Language Processing for Digital Humanities, NLP4DH 2024 |
---|---|
Country/Territory | United States |
City | Miami |
Period | 16/11/24 → … |
Bibliographical note
Publisher Copyright:© 2024 Association for Computational Linguistics.