Download:
|
by Xiaolan Zhu, Susan Gauch, Lutz Gerhard, Nicholas Kral, Er Pretschner
In Proceedings of the 8 th International Conference On Information Knowledge Management (CIKM
http://homer.ittc.ukans.edu/website/publications/papers/cikm99.ps
Add To MetaCart
Abstract:
Centralized search process requires that the whole collection reside at a single site. This imposes a burden on both the system storage of the site and the network traffic near the site. It thus comes to require the search process to be distributed. Recently, more and more Web sites provide the ability to search their local collection of Web pages. Query brokering systems are used to direct queries to the promising sites and merge the results from these sites. Creation of meta-information of the sites plays an important role in such systems. In this article, we introduce an ontology-based web site mapping method used to produce conceptual meta-information, the Vector Space approach, and present a serial of experiments comparing it with Nave-Bayes approach. We found that the Vector Space approach produces better accuracy in ontology-based web site mapping.
Citations
|
3215
|
C4.5: Programs for Machine Learning
– Quinlan
- 1993
|
|
2217
|
J.: Introduction to Modern Information Retrieval
– Salton, Macgill
- 1983
|
|
555
|
Generalized Fisheye Views
– Furnas
- 1986
|
|
351
|
Cone Trees: animated 3D visualizations of hierarchical information
– Robertson, Mackinlay, et al.
- 1991
|
|
296
|
The INQUERY retrieval system
– Callan, Croft, et al.
- 1992
|
|
254
|
Enabling technology for knowledge sharing
– Neches, Fikes, et al.
- 1991
|
|
232
|
An Analysis of Bayesian Classifiers
– Langley, Iba, et al.
- 1992
|
|
201
|
Treemaps: A space-filling approach to the visualization of hierarchical information structures
– Johnson, Shneiderman
- 1991
|
|
171
|
Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering. http://www.cs.cmu.edu/ mccallum/bow
– McCallum
- 1996
|
|
112
|
STARTS: Stanford Proposal for Internet MetaSearching
– Gravano, Chang, et al.
- 1997
|
|
87
|
Hypursuit: A hierarchical network search engine that exploits content-link hypertext clustering
– Weiss
- 1996
|
|
73
|
Building classifiers using bayesian networks
– Friedman, Goldszmidt
- 1996
|
|
33
|
A corpus analysis approach for automatic query expansion
– Gauch, Wang
- 1997
|
|
33
|
WebCutter: A system for dynamic and tailorable site mapping
– Maarek, Jacovi, et al.
- 1997
|
|
30
|
Adaptive agents for information gathering from multiple, distributed information sources
– Fan, Gauch
- 1999
|
|
30
|
Learning Collection Fusion Strategies for Information Retrieval
– Towell, Voorhees, et al.
- 1995
|
|
29
|
Internet Agents: Spiders, Wanderers, Brokers, and Bots
– Cheong
- 1996
|
|
25
|
Value Bars: An Information Visualization and Navigation Tool for Multiattribute Listings (Demo Summary
– Chimera
- 1992
|
|
25
|
Resource Selection in Café: an Architecture for Networked Information Retrieval
– Chower, Nicholas
- 1996
|
|
24
|
Experience with the InfoSleuth agent architecture
– Nodine, Perry, et al.
- 1998
|
|
11
|
Agent sourcebook
– Caglayan, Harrison
- 1997
|
|
10
|
Information Fusion with ProFusion
– Gauch, Wang
- 1996
|
|
10
|
Searching and browsing on map displays
– LIN
- 1995
|
|
8
|
Using Statistical Properties of Text to Create Metadata
– Crowder, Nicholas
- 1996
|
|
6
|
Database Merging Strategies for Searching Public and Private Collections
– Voorhees
- 1997
|
|
2
|
Resource Selection in CAF: an Architecture for Network Information Retrieval
– Crowder
- 1996
|