MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  A Multithreaded Java Framework for Information Extraction in the Context of Enterprise Application Integration

Download:
Download as a PDF
by Stefan Kuhlins, Axel Korthaus
http://www.wifo.uni-mannheim.de/~kuhlins/paper/framework.pdf
Add To MetaCart

Abstract:

In this paper, we present a new multithreaded framework for information extraction with Java in heterogeneous enterprise application environments, which frees the developer from having to deal with the error-prone task of low-level thread programming. The power of this framework is demonstrated by an example of extracting product prices from web sites, but the framework is useful for numerous other purposes, too. Strong points of the framework are its performance, continuous feedback, and adherence to maximum response times. The description of the framework uses UML modeling techniques for visualizing multithreading. Moreover, we tackle Java problems of stopping running threads. 1.

Citations

822 Mediators in the architecture of future information systems – Wiederhold - 1992
281 A scalable comparison-shopping agent for the World-Wide Web – Doorenbos, Etzioni, et al. - 1997
142 Managing semantic heterogeneity in databases: A theoretical perspective – Hull - 1997
49 The bargainfinder agent: Comparison price shopping on the internet – Krulwich - 1996
27 Information Extraction from World Wide Web - A Survey – Eikvil - 1999
16 Concurrent Programming – Lea - 2000
12 Toolkits for generating wrappers – a survey of software toolkits for automated data extraction from Web sites – Kuhlins, Tredwell - 2002
8 A Wrapper Architecture for Legacy Data Sources – Roth, Schwartz - 1997
6 Modeling Java Threads in UML – Schader, Korthaus - 1998
4 Mastering Regular Expressions, 2nd edition, O'Reilly & Associates – Friedl - 2002
2 The Java Developers Almanac 1.4, Volume 1: Examples and Quick Reference, e93. Stopping a Thread, http://javaalmanac.com/egs/java.lang/StopThread.html – Chan - 2002
1 Toward) an Extensible Wrapper Repository Standard – Kushmerick - 1998
1 Gleaning Answers from the Web. Position paper – Kushmerick - 2002