A generalized framework for revealing analogous themes across related topics

Marx Zvika*, Dagan Ido, Shamir Eli

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

Abstract

This work addresses the task of identifying thematic correspondences across subcorpora focused on different topics. We introduce an unsupervised algorithmic framework based on distributional data clustering, which generalizes previous initial works on this task. The empirical results reveal interesting commonalities of different religions. We evaluate the results through measuring the overlap of our clusters with clusters compiled manually by experts. The tested variants of our framework are shown to outperform alternative methods applicable to the task.

Conference

ConferenceHuman Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, HLT/EMNLP 2005, Co-located with the 2005 Document Understanding Conference, DUC and the 9th International Workshop on Parsing Technologies, IWPT
Country/TerritoryCanada
CityVancouver, BC
Period6/10/058/10/05

Fingerprint

Dive into the research topics of 'A generalized framework for revealing analogous themes across related topics'. Together they form a unique fingerprint.

Cite this