Skip to main navigation Skip to search Skip to main content

Querying parse trees of stochastic context-free grammars

  • Sara Cohen*
  • , Benny Kimelfeld
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

Stochastic context-free grammars (SCFGs) have long been recognized as useful for a large variety of tasks including natural language processing, morphological parsing, speech recognition, information extraction, Web-page wrapping and even analysis of RNA. A string and an SCFG jointly represent a probabilistic interpretation of the meaning of the string, in the form of a (possibly infinite) probability space of parse trees. The problem of evaluating a query over this probability space is considered under the conventional semantics of querying a probabilistic database. For general SCFGs, extremely simple queries may have results that include irrational probabilities. But, for a large subclass of SCFGs (that includes all the standard studied subclasses of SCFGs) and the language of tree-pattern queries with projection (and child/descendant edges), it is shown that query results have rational probabilities with a polynomial-size bit representation and, more importantly, an efficient query-evaluation algorithm is presented.

Original languageEnglish
Title of host publicationDatabase Theory - ICDT 2010
Subtitle of host publication13th International Conference on Database Theory, Proceedings
PublisherAssociation for Computing Machinery (ACM)
Pages62-75
Number of pages14
ISBN (Print)9781605589473
DOIs
StatePublished - 23 Mar 2010
Event13th International Conference on Database Theory, ICDT 2010 - Lausanne, Switzerland
Duration: 23 Mar 201025 Mar 2010

Publication series

NameACM International Conference Proceeding Series

Conference

Conference13th International Conference on Database Theory, ICDT 2010
Country/TerritorySwitzerland
CityLausanne
Period23/03/1025/03/10

Keywords

  • probabilistic databases
  • querying
  • stochastic context free grammars

Fingerprint

Dive into the research topics of 'Querying parse trees of stochastic context-free grammars'. Together they form a unique fingerprint.

Cite this