Results 1 -
2 of
2
NoDoSE - A tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents.
- SIGMOD Record
, 1998
"... Often interesting structured or semistructured data is not in database systems but in HTML pages, text files, or on paper. The data in these formats is not usable by standard query processing engines and hence users need a way of extracting data from these sources into a DBMS or of writing wrappers ..."
Abstract
-
Cited by 168 (2 self)
- Add to MetaCart
. This paper describes both the NoDoSE architecture, which can be used as a test bed for structure mi...
Abstract NoDoSE- A Tool for Semi-Automatically Extracting Structured and
"... Often interesting structured or semistructured data is not in database systems but in HTML pages, text files, or on paper. The data in these formats is not usable by standard query processing engines and hence users need a way of extracting data from these sources into a DBMS or of writing wrappers ..."
Abstract
- Add to MetaCart
. This paper describes both the NoDoSE architecture, which can be used as a test bed for structure mining algorithms in general, and the mining algorithms that have been de-veloped by the author. The prototype, which is written in Java, is described and experiences parsing a variety of documents are reported