Download:
|
by Kaizhong Zhang, Dennis Shasha, Jason T. L. Wang
Journal of Algorithms
http://www.cis.njit.edu/~discdb/papers/tmatch_vldc.ps
Add To MetaCart
Abstract:
Ordered labeled trees are trees in which the sibling order matters. This paper presents algorithms for three problems having to do with approximate matching for such trees with variable-length don't cares (VLDC's). In strings, a VLDC symbol in the pattern may substitute for zero or more symbols in the data string. For example, if "comer " is the pattern, then the " " would substitute for the substring "put" when matching the data string "computer". Approximate VLDC matching in strings means that after the best possible substitution, the pattern still need not be the same as the data string for a match to be allowed. For example, "comer " matches "counter " within distance 1 (representing the cost of removing the "m " from "comer " and having the " " substitute for "unt"). We generalize approximate VLDC string matching to three algorithms for approximate VLDC matching on trees. The time complexity of our algorithms is O(jP j \Theta jDj \Theta min(depth(P); leaves(P)) \Theta min(depth(D); leaves(D))) (where jP j and jDj are the number of nodes respectively of the pattern P and the data tree D), the same as for the best approximate tree matching algorithm without VLDC's previously reported in [25].
Citations
|
185
|
The Theory and Computation of Evolutionary Distances
– Sellers
|
|
169
|
Simple fast algorithms for the editing distance between trees and related problems
– Zhang, Shacha
- 1989
|
|
122
|
The tree-to-tree correction problem
– Tai
- 1979
|
|
120
|
Finding Approximate Patterns in Strings
– Ukkonen
- 1985
|
|
114
|
Pattern matching in trees
– Hoffmann, O'Donnell
- 1982
|
|
100
|
Comparing multiple rna secondary structures using tree comparison
– Shapiro, Zhang
- 1990
|
|
89
|
Fast parallel and serial approximate string matching
– LANDAU, VISHKIN
- 1989
|
|
86
|
Code generation using tree matching and dynamic programming
– Aho, Ganapathi, et al.
- 1989
|
|
72
|
An improved algorithm for approximate string matching
– Galil, Park
- 1990
|
|
66
|
Extracting semantic hierarchies from a large online dictionary
– Chodorow, Byrd, et al.
- 1985
|
|
56
|
A system for approximate tree matching
– Wang, Zhang, et al.
- 1994
|
|
51
|
Approximate matching of regular expressions
– Myers, Miller
- 1989
|
|
50
|
Semantically significant patterns in dictionary definitions
– Markowitz, Ahlswede, et al.
- 1986
|
|
36
|
Efficient tree pattern matching
– Kosaraju
- 1989
|
|
31
|
Fast algorithms for unit cost editing distance between trees
– Shasha, Zhang
- 1990
|
|
30
|
Faster Tree Pattern Matching
– Dubiner, Galil, et al.
- 1990
|
|
22
|
W.A.: Some biological sequence metrics
– Waterman, Smith, et al.
- 1976
|
|
18
|
A Tree System Approach For Fingerprint Pattern Recognition
– Moayer, Woo
- 1986
|
|
18
|
Distance transform for images represented by quadtrees
– Samet
- 1982
|
|
11
|
dictionary grammars and dictionary entry parsing
– Neff, Boguraev
- 1989
|
|
8
|
Creating and querying hierarchical lexical data bases
– Neff, Byrd, et al.
- 1988
|
|
6
|
Three-dimensional structure of a transfer rna in two crystal forms
– Kim
- 1976
|
|
4
|
Locating syntactic patterns in text corpora
– Chodorow, Klavans
- 1990
|
|
3
|
Fast serial and parallel algorithms for approximate tree matching with VLDC's
– Zhang, Shasha, et al.
- 1992
|
|
2
|
LQL user notes: An informal guide to the lexical query language
– Byrd
- 1990
|
|
2
|
Reference manual for ATBE: A tool for approximate tree matching
– Wang, Jeong, et al.
- 1991
|