A practically efficient algorithm for generating answers to keyword search over data graphs

Konstantin Golenberg, Yehoshua Sagiv

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In keyword search over a data graph, an answer is a non-redundant subtree that contains all the keywords of the query. A naive approach to producing all the answers by increasing height is to generalize Dijkstra's algorithm to enumerating all acyclic paths by increasing weight. The idea of freezing is introduced so that (most) non-shortest paths are generated only if they are actually needed for producing answers. The resulting algorithm for generating subtrees, called GTF, is subtle and its proof of correctness is intricate. Extensive experiments show that GTF outperforms existing systems, even ones that for efficiency's sake are incomplete (i.e., cannot produce all the answers). In particular, GTF is scalable and performs well even on large data graphs and when many answers are needed.

Original languageEnglish
Title of host publication19th International Conference on Database Theory, ICDT 2016
EditorsThomas Zeume, Wim Martens
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959770026
DOIs
StatePublished - 1 Mar 2016
Event19th International Conference on Database Theory, ICDT 2016 - Bordeaux, France
Duration: 15 Mar 201618 Mar 2016

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume48
ISSN (Print)1868-8969

Conference

Conference19th International Conference on Database Theory, ICDT 2016
Country/TerritoryFrance
CityBordeaux
Period15/03/1618/03/16

Bibliographical note

Publisher Copyright:
© 2016 Konstantin Golenberg and Yehoshua Sagiv.

Keywords

  • Efficiency
  • Keyword Search Over Data Graphs
  • Subtree Enumeration By Height
  • Top-K Answers

Fingerprint

Dive into the research topics of 'A practically efficient algorithm for generating answers to keyword search over data graphs'. Together they form a unique fingerprint.

Cite this