MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Approximate tree matching in the presence of variable length don't cares (1994) [33 citations — 6 self]

Download:
Download as a PDF | Download as a PS
by Kaizhong Zhang, Dennis Shasha, Jason T. L. Wang
Journal of Algorithms
http://www.cis.njit.edu/~discdb/papers/tmatch_vldc.ps
Add To MetaCart

Abstract:

Ordered labeled trees are trees in which the sibling order matters. This paper presents algorithms for three problems having to do with approximate matching for such trees with variable-length don't cares (VLDC's). In strings, a VLDC symbol in the pattern may substitute for zero or more symbols in the data string. For example, if "comer " is the pattern, then the " " would substitute for the substring "put" when matching the data string "computer". Approximate VLDC matching in strings means that after the best possible substitution, the pattern still need not be the same as the data string for a match to be allowed. For example, "comer " matches "counter " within distance 1 (representing the cost of removing the "m " from "comer " and having the " " substitute for "unt"). We generalize approximate VLDC string matching to three algorithms for approximate VLDC matching on trees. The time complexity of our algorithms is O(jP j \Theta jDj \Theta min(depth(P); leaves(P)) \Theta min(depth(D); leaves(D))) (where jP j and jDj are the number of nodes respectively of the pattern P and the data tree D), the same as for the best approximate tree matching algorithm without VLDC's previously reported in [25].

Citations

185 The Theory and Computation of Evolutionary Distances – Sellers
169 Simple fast algorithms for the editing distance between trees and related problems – Zhang, Shacha - 1989
122 The tree-to-tree correction problem – Tai - 1979
120 Finding Approximate Patterns in Strings – Ukkonen - 1985
114 Pattern matching in trees – Hoffmann, O'Donnell - 1982
100 Comparing multiple rna secondary structures using tree comparison – Shapiro, Zhang - 1990
89 Fast parallel and serial approximate string matching – LANDAU, VISHKIN - 1989
86 Code generation using tree matching and dynamic programming – Aho, Ganapathi, et al. - 1989
72 An improved algorithm for approximate string matching – Galil, Park - 1990
66 Extracting semantic hierarchies from a large online dictionary – Chodorow, Byrd, et al. - 1985
56 A system for approximate tree matching – Wang, Zhang, et al. - 1994
51 Approximate matching of regular expressions – Myers, Miller - 1989
50 Semantically significant patterns in dictionary definitions – Markowitz, Ahlswede, et al. - 1986
36 Efficient tree pattern matching – Kosaraju - 1989
31 Fast algorithms for unit cost editing distance between trees – Shasha, Zhang - 1990
30 Faster Tree Pattern Matching – Dubiner, Galil, et al. - 1990
22 W.A.: Some biological sequence metrics – Waterman, Smith, et al. - 1976
18 A Tree System Approach For Fingerprint Pattern Recognition – Moayer, Woo - 1986
18 Distance transform for images represented by quadtrees – Samet - 1982
11 dictionary grammars and dictionary entry parsing – Neff, Boguraev - 1989
8 Creating and querying hierarchical lexical data bases – Neff, Byrd, et al. - 1988
6 Three-dimensional structure of a transfer rna in two crystal forms – Kim - 1976
4 Locating syntactic patterns in text corpora – Chodorow, Klavans - 1990
3 Fast serial and parallel algorithms for approximate tree matching with VLDC's – Zhang, Shasha, et al. - 1992
2 LQL user notes: An informal guide to the lexical query language – Byrd - 1990
2 Reference manual for ATBE: A tool for approximate tree matching – Wang, Jeong, et al. - 1991