Abstract
The run time complexity of state-of-the-art inference algorithms in graph-based dependency parsing is super-linear in the number of input words (n). Recently, pruning algorithms for these models have shown to cut a large portion of the graph edges, with minimal damage to the resulting parse trees. Solving the inference problem in run time complexity determined solely by the number of edges (m) is hence of obvious importance. We propose such an inference algorithm for first-order models, which encodes the problem as a minimum spanning tree (MST) problem in an undirected graph. This allows us to utilize state-of-the-art undirected MST algorithms whose run time is O(m) at expectation and with a very high probability. A directed parse tree is then inferred from the undirected MST and is subsequently improved with respect to the directed parsing model through local greedy updates, both steps running in O(n) time. In experiments with 18 languages, a variant of the first-order MSTParser (McDonald et al., 2005b) that employs our algorithm performs very similarly to the original parser that runs an O(n2) directed MST inference.
Original language | English |
---|---|
Title of host publication | 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 2104-2113 |
Number of pages | 10 |
ISBN (Electronic) | 9781510827585 |
DOIs | |
State | Published - 2016 |
Event | 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Berlin, Germany Duration: 7 Aug 2016 → 12 Aug 2016 |
Publication series
Name | 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers |
---|---|
Volume | 4 |
Conference
Conference | 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 |
---|---|
Country/Territory | Germany |
City | Berlin |
Period | 7/08/16 → 12/08/16 |
Bibliographical note
Publisher Copyright:© 2016 Association for Computational Linguistics.