Improved unsupervised POS induction through prototype discovery

Omri Abend, Roi Reichart, Ari Rappoport

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

We present a novel fully unsupervised algorithm for POS induction from plain text, motivated by the cognitive notion of prototypes. The algorithm first identifies landmark clusters of words, serving as the cores of the induced POS categories. The rest of the words are subsequently mapped to these clusters. We utilize morphological and distributional representations computed in a fully unsupervised manner. We evaluate our algorithm on English and German, achieving the best reported results for this task.

Original languageAmerican English
Title of host publicationACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Conference Proceedings
EditorsJan Hajic, Sandra Carberry, Stephen Clark
PublisherAssociation for Computational Linguistics (ACL)
Pages1298-1307
Number of pages10
ISBN (Electronic)1932432663, 9781932432664
StatePublished - 2010
Externally publishedYes
Event48th Annual Meeting of the Association for Computational Linguistics, ACL 2010 - Uppsala, Sweden
Duration: 11 Jul 201016 Jul 2010

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
Volume2010-July
ISSN (Print)0736-587X

Conference

Conference48th Annual Meeting of the Association for Computational Linguistics, ACL 2010
Country/TerritorySweden
CityUppsala
Period11/07/1016/07/10

Bibliographical note

Funding Information:
∗Omri Abend is grateful to the Azrieli Foundation for the award of an Azrieli Fellowship.

Publisher Copyright:
© 2010 Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'Improved unsupervised POS induction through prototype discovery'. Together they form a unique fingerprint.

Cite this