| G. Gottlob and C. Koch, "Monadic datalog and the expressive power of languages for web information extraction." Journal of the ACM, vol. 51, 2004. |
....the notion of tag proximity w.r.t. a similar proximity in strings which are a specific traversal of HTML trees. So far, little research addresses the information extraction from tree documents [3; 6; 8] Some researchers study languages for wrapping tree structures and their expressive power [6] ; other researchers develop learning algorithms for extraction from tree structures [3; 8] Interestingly, wrapper builders in [3] fit the local view approach, while tree automata in [8] follow the global view approach. In the grammatical inference, certain results has been successfully ....
Georg Gottlob and Christoph Koch. Monadic datalog and the expressive power of languages for web information extraction. In Proc. ACM PODS, pages 17--28, 2002.
....XSLT transformations [18] However, most of the work investigated MSO logic as a basis for XML querying. It turned out that it de nes a very robust class of queries with a lot of equivalent characterizations by other query mechanisms like attributed grammars [23] automata [20, 1, 21] and datalog [12]. Although MSO logic is a robust and powerful query language it can not express all kinds Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage ....
G. Gottlob and C. Koch. Monadic Datalog and the Expressive Power of Languages for Web Information Extraction. In PODS 2001.
....However, the Lixto TS goes beyond the scope of mere Web data extraction tools. In fact data extraction is merely one although a very important step in the processing model for which we employ the Lixto extraction engine. Lixto Visual Wrapper allows for very expressive extraction programs [8] , that can be generated by means of a visual builder interface. Therefore it integrates very nicely with the overall visual programming approach. Related work has been carried out in the field of automating the retrieval of Web pages and Web data. While crawler technology is directed towards ....
G. Gottlob and C. Koch. Monadic datalog and the expressive power of languages for Web Information Extraction. In Proc. of PODS -- Best paper award, 2002.
No context found.
G. Gottlob and C. Koch, "Monadic datalog and the expressive power of languages for web information extraction." Journal of the ACM, vol. 51, 2004.
No context found.
G. Gottlob and Ch. Koch. Monadic Datalog and the expressive power of languages for web information retrieval. J. ACM, 51(1):74--113, 2004.
No context found.
G. Gottlob and Ch. Koch. Monadic Datalog and the expressive power of languages for web information retrieval. J. ACM, 51(1):74--113, 2004.
No context found.
GOTTLOB, G., AND KOCH, C. Monadic datalog and the expressive power of languages for Web Information Extraction. In Proc. of PODS (2002).
No context found.
G. Gottlob and C. Koch. Monadic datalog and the expressive power of languages for Web Information Extraction. In Proc. of PODS, 2002.
No context found.
G. Gottlob and C. Koch. Monadic datalog and the expressive power of languages for Web Information Extraction. In Proc. of PODS, 2002.
No context found.
G. Gottlob and C. Koch. Monadic Datalog and the Expressive Power of Languages for Web Information Extraction. In PODS 2001, pages 17--28, 2002.
No context found.
G. Gottlob, C. Koch, Monadic Datalog and the Expressive Power of Languages for Web Information Extraction, in: Proceedings of 21st ACM SIGMODSIGACT -SIGART Symposium on Principles of Database Systems (PODS 2002.
No context found.
G. Gottlob and C. Koch. Monadic datalog and the expressive power of languages for web information extraction. In Proceedings of the ACM Symposium on Principle of Databases Systems, pages 17--28, 2002.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC