Download:
by Stefan Kuhlins, Axel Korthaus
http://www.wifo.uni-mannheim.de/~kuhlins/paper/framework.pdf
Add To MetaCart
Abstract:
In this paper, we present a new multithreaded framework for information extraction with Java in heterogeneous enterprise application environments, which frees the developer from having to deal with the error-prone task of low-level thread programming. The power of this framework is demonstrated by an example of extracting product prices from web sites, but the framework is useful for numerous other purposes, too. Strong points of the framework are its performance, continuous feedback, and adherence to maximum response times. The description of the framework uses UML modeling techniques for visualizing multithreading. Moreover, we tackle Java problems of stopping running threads. 1.
Citations
|
822
|
Mediators in the architecture of future information systems
– Wiederhold
- 1992
|
|
281
|
A scalable comparison-shopping agent for the World-Wide Web
– Doorenbos, Etzioni, et al.
- 1997
|
|
142
|
Managing semantic heterogeneity in databases: A theoretical perspective
– Hull
- 1997
|
|
49
|
The bargainfinder agent: Comparison price shopping on the internet
– Krulwich
- 1996
|
|
27
|
Information Extraction from World Wide Web - A Survey
– Eikvil
- 1999
|
|
16
|
Concurrent Programming
– Lea
- 2000
|
|
12
|
Toolkits for generating wrappers – a survey of software toolkits for automated data extraction from Web sites
– Kuhlins, Tredwell
- 2002
|
|
8
|
A Wrapper Architecture for Legacy Data Sources
– Roth, Schwartz
- 1997
|
|
6
|
Modeling Java Threads in UML
– Schader, Korthaus
- 1998
|
|
4
|
Mastering Regular Expressions, 2nd edition, O'Reilly & Associates
– Friedl
- 2002
|
|
2
|
The Java Developers Almanac 1.4, Volume 1: Examples and Quick Reference, e93. Stopping a Thread, http://javaalmanac.com/egs/java.lang/StopThread.html
– Chan
- 2002
|
|
1
|
Toward) an Extensible Wrapper Repository Standard
– Kushmerick
- 1998
|
|
1
|
Gleaning Answers from the Web. Position paper
– Kushmerick
- 2002
|