MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Web taxonomy integration using support vector machines (2004) [11 citations — 1 self]

Download:
Download as a PDF
by Dell Zhang
In WWW ’04: Proceedings of the 13th international conference on World Wide Web
http://www.www2004.org/proceedings/docs/1p472.pdf
Add To MetaCart

Abstract:

We address the problem of integrating objects from a source taxonomy into a master taxonomy. This problem is not only currently pervasive on the web, but also important to the emerging semantic web. A straightforward approach to automating this process would be to train a classifier for each category in the master taxonomy, and then classify objects from the source taxonomy into these categories. In this paper we attempt to use a powerful classification method, Support Vector Machine (SVM), to attack this problem. Our key insight is that the availability of the source taxonomy data could be helpful to build better classifiers in this scenario, therefore it would be beneficial to do transductive learning rather than inductive learning, i.e., learning to optimize classification performance on a particular set of test examples. Noticing that the categorizations of the master and source taxonomies often have some semantic overlap, we propose a method, Cluster Shrinkage (CS), to further enhance the classification by exploiting such implicit knowledge. Our experiments with real-world web data show substantial improvements in the performance of taxonomy integration.

Citations

4514 Statistical Learning Theory – Vapnik - 1998
1256 Modern information retrieval – Baeza-Yates, Ribiero-Neto - 1999
961 Text Categorization with Support Vector Machines – Joachims - 1997
559 Relevance feedback in information retrieval, The – Rocchio - 1971
536 An Introduction to Support Vector Machines – Cristianini, Shawe-Taylor - 2000
477 A comparison of event models for Naive Bayes text classification – McCallum, Nigam - 1998
416 A re-examination of text categorization methods – Yang, Liu - 1999
319 Inductive Learning Algorithms and Representations for Text Categorization – Dumais, Platt, et al. - 1998
319 Ontologies. A Silver Bullet for Knowledge Management and E-Commerce. 2 nd Edition – Fensel - 2003
304 Transductive inference for text classification using support vector machines – Joachims - 1999
242 PROMPT: Algorithm and tool for automated ontology merging and alignment – Noy, Musen - 2000
179 A.: Learning to Map between Ontologies on the Semantic Web – Doan, Madhavan, et al. - 2002
148 Hierarchical classification of Web content – Dumais, Chen - 2000
88 Ontomorph: A translation system for symbolic knowledge – Chalupsky - 2000
87 FCA-Merge: Bottom-up merging of ontologies – Stumme, Maedche - 2001
79 Anchor-prompt: using non-local context for semantic matching – Noy, Musen
68 The Chimaera Ontology Environment – McGuinness, Fikes, et al.
59 Unsing taxonomy, discriminants, and signatures for navigating in text databases – Charkabarti, Dom, et al. - 1997
58 Semi-automatic integration of knowledge sources – Mitra, Wiederhold, et al. - 1999
47 On integrating catalogs – Agrawal, Srikant
39 Facilitating the exchange of explicit knowledge through ontology mappings – Lacher, Groh - 2001
22 Athena: Mining-based interactive management of text databases – Agrawal, Bayardo, et al. - 2000
7 Rule Induction for Concept Hierarchy Alignment – Ichise, Takeda, et al. - 2001