TY - GEN
T1 - Indexing for subtree similarity-search using edit distance
AU - Cohen, Sara
PY - 2013
Y1 - 2013
N2 - Given a tree Q and a large set of trees T = {T1,⋯, T n}, the subtree similarity-search problem is that of finding the subtrees of trees among T that are most similar to Q, using the tree edit distance metric. Determining similarity using tree edit distance has been proven useful in a variety of application areas. While subtree similarity-search has been studied in the past, solutions required traversal of all of T, which poses a severe bottleneck in processing time, as T grows larger. This paper proposes the first index structure for subtree similarity-search, provided that the unit cost function is used. Extensive experimentation and comparison to previous work shows the huge improvement gained when using the proposed index structure and processing algorithm.
AB - Given a tree Q and a large set of trees T = {T1,⋯, T n}, the subtree similarity-search problem is that of finding the subtrees of trees among T that are most similar to Q, using the tree edit distance metric. Determining similarity using tree edit distance has been proven useful in a variety of application areas. While subtree similarity-search has been studied in the past, solutions required traversal of all of T, which poses a severe bottleneck in processing time, as T grows larger. This paper proposes the first index structure for subtree similarity-search, provided that the unit cost function is used. Extensive experimentation and comparison to previous work shows the huge improvement gained when using the proposed index structure and processing algorithm.
KW - Edit distance
KW - Indexing
UR - http://www.scopus.com/inward/record.url?scp=84880564442&partnerID=8YFLogxK
U2 - 10.1145/2463676.2463716
DO - 10.1145/2463676.2463716
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84880564442
SN - 9781450320375
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 49
EP - 60
BT - SIGMOD 2013 - International Conference on Management of Data
T2 - 2013 ACM SIGMOD Conference on Management of Data, SIGMOD 2013
Y2 - 22 June 2013 through 27 June 2013
ER -