Abstract
Systems that automatically define unfamiliar terms hold the promise of improving the accessibility of scientific texts, especially for readers who may lack prerequisite background knowledge. However, current systems assume a single “best” description per concept, which fails to account for the many ways a concept can be described. We present ACCoRD, an end-to-end system tackling the novel task of generating sets of descriptions of scientific concepts. Our system takes advantage of the myriad ways a concept is mentioned across the scientific literature to produce distinct, diverse descriptions of target concepts in terms of different reference concepts. In a user study, we find that users prefer (1) descriptions produced by our end-to-end system, and (2) multiple descriptions to a single “best” description. We release the ACCoRD corpus which includes 1,275 labeled contexts and 1,787 expert-authored concept descriptions to support research on our task.
| Original language | English |
|---|---|
| Title of host publication | EMNLP 2022 - 2022 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Demonstrations Session |
| Editors | Wanxiang Che, Ekaterina Shutova |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 200-213 |
| Number of pages | 14 |
| ISBN (Electronic) | 9781959429418 |
| DOIs | |
| State | Published - 2022 |
| Externally published | Yes |
| Event | 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, EMNLP 2022 - Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 11 Dec 2022 |
Publication series
| Name | EMNLP 2022 - 2022 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Demonstrations Session |
|---|
Conference
| Conference | 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, EMNLP 2022 |
|---|---|
| Country/Territory | United Arab Emirates |
| City | Abu Dhabi |
| Period | 7/12/22 → 11/12/22 |
Bibliographical note
Publisher Copyright:© 2022 Association for Computational Linguistics.