Results 11 - 20
of
20
A Branch and Bound Algorithm to Scale Alignment of Large Ontologies
"... Abstract Increasingly, ontologies are being developed and exposed on the Web to support a variety of applications, including biological knowledge sharing, enhanced search and discovery, and decision support. This proliferation of new Web knowledge sources is resulting in a growing need for integrat ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract Increasingly, ontologies are being developed and exposed on the Web to support a variety of applications, including biological knowledge sharing, enhanced search and discovery, and decision support. This proliferation of new Web knowledge sources is resulting in a growing need for integration and enrichment of these sources. Automated and semi-automated solutions to aligning ontologies are emerging that address this growing need with very promising results. However, only very recently, solutions for scalability of ontology alignment have begun to emerge. The goal of this research is to investigate scalability issues in alignment of large-scale ontologies. We present an alignment algorithm that bounds processing by selecting optimal subtrees to align and show that this significantly improves efficiency without major reduction in precision. We apply the algorithm in conjunction with our approach that includes modeling ontology alignment in a Support Vector Machine.
Review of Ontology Matching Approaches and Challenges
, 2015
"... Ontology mapping aims to solve the semantic heterogeneity problems such as ambiguous entity names, different entity granularity, incomparable categorization, and various instances of different ontologies. The mapping helps to search or query data from different sources. Ontology mapping is necessary ..."
Abstract
- Add to MetaCart
Ontology mapping aims to solve the semantic heterogeneity problems such as ambiguous entity names, different entity granularity, incomparable categorization, and various instances of different ontologies. The mapping helps to search or query data from different sources. Ontology mapping is necessary in many applications such as data integration, ontology evolution, data warehousing, e-commerce and data exchange in various domains such as purchase order, health, music and e-commerce. It is performed by ontology matching approaches that find semantic correspondences between ontology entities. In this paper, we review state of the art ontology matching approaches. We describe the approaches according to instance-based, schema-based, instance and schema-based, usage-based, element-level, and structure-level. The analysis of the existing approaches will assist us in revealing some challenges in ontology mapping such as handling ontology matching errors, user involvement and reusing previous match operations. We explain the way of handling the challenges using new strategy in order to increase the performance.
Submitted to the International Workshop on Vaccine and Drug Ontology Studies (VDOS 2013). draft material-- please do not cite Aligning Pharmacologic Classes Between MeSH and ATC
"... Objective: To align pharmacologic classes in ATC and MeSH with lexical and instance-based techniques. Methods: Lexical alignment: we map the names of ATC classes to MeSH through the UMLS, leveraging normalization and additional synonymy. Instance-based alignment: we associate ATC and MeSH classes th ..."
Abstract
- Add to MetaCart
(Show Context)
Objective: To align pharmacologic classes in ATC and MeSH with lexical and instance-based techniques. Methods: Lexical alignment: we map the names of ATC classes to MeSH through the UMLS, leveraging normalization and additional synonymy. Instance-based alignment: we associate ATC and MeSH classes through the drugs they share, using the Jaccard coefficient to measure class-class similarity. We use a metric to distinguish between equivalence and inclusion mappings. Results: We found 226 lexical mappings, as well as 360 instance-based mappings, with a limited overlap (62). From the 360 instance-based methods we classify 113 as equivalence mappings and 247 as inclusion mappings. A limited failure analysis is presented. Conclusion: Our instance-based approach to aligning pharmacologic classes has the prospect of effectively supporting the creation of a mapping of pharmacologic classes between ATC and MeSH. This exploratory investigation needs to be evaluated in order to adapt the thresholds for similarity.
Discovering Cross-Ontology Subsumption Relationships by Using Ontological Annotations on Biomedical Literature
"... Cross-ontology concept subsumption relationships facilitate the integration of ontologies by explicitly defining the generalization of a concept over other concepts in different ontologies. However, existing methods for discovering these relationships show poor performances and one of the problems i ..."
Abstract
- Add to MetaCart
(Show Context)
Cross-ontology concept subsumption relationships facilitate the integration of ontologies by explicitly defining the generalization of a concept over other concepts in different ontologies. However, existing methods for discovering these relationships show poor performances and one of the problems is the lack of instance data in ontologies that can be used to identify cross-ontology subsumptions reliably. To address the problem, we present a novel method, SURD (SUbsumption Relation Discovery), which uses annotations on biomedical text corpora for populating ontologies with instances. Subsumption relationships between pairs of concepts are then determined based on their shared instances. SURD shows good performance when applied to biomedical ontologies, achieving precision values of 0.786 and 0.729 for cross-ontology subsumptions between the ontology pairs GRO-UMLS Metathesaurus and GENIA-UMLS Metathesaurus respectively. As a practical application, we used SURD’s subsumptions for automated ontological corpus annotation and achieved F-measures of 0.693 and 0.783 on the GRO and GENIA corpora respectively. These results are superior to the results of using subsumption relations inferred from equivalence relations (F-measures of 0.569 and 0.645) and subsumption relations identified with Hearst patterns (F-measures of 0.002 and 0.096). 1
Aligning Pharmacologic Classes Between MeSH and ATC
"... Objective: To align pharmacologic classes in ATC and MeSH with lexical and instance-based techniques. Methods: Lexical alignment: we map the names of ATC classes to MeSH through the UMLS, leveraging normalization and additional synonymy. Instance-based alignment: we associate ATC and MeSH classes th ..."
Abstract
- Add to MetaCart
(Show Context)
Objective: To align pharmacologic classes in ATC and MeSH with lexical and instance-based techniques. Methods: Lexical alignment: we map the names of ATC classes to MeSH through the UMLS, leveraging normalization and additional synonymy. Instance-based alignment: we associate ATC and MeSH classes through the drugs they share, using the Jaccard coefficient to measure class-class similarity. We use a metric to distinguish between equivalence and inclusion mappings. Results: We found 221 lexical mappings, as well as 343 instance-based mappings, with a limited overlap (61). From the 343 instance-based mappings we classify 113 as equivalence mappings and 230 as inclusion mappings. A limited failure analysis is presented. Conclusion: Our instance-based approach to aligning pharmacologic classes has the prospect of effectively supporting the creation of a mapping of pharmacologic classes between ATC and MeSH. This exploratory investigation needs to be evaluated in order to adapt the thresholds for similarity.
Table of Contents Editorial Welcome
, 2010
"... www.educationaldatamining.org/JEDM/index.php?option=com_content&view=category&layout=blog&id ..."
Abstract
- Add to MetaCart
www.educationaldatamining.org/JEDM/index.php?option=com_content&view=category&layout=blog&id
Data integration
"... Semantic heterogeneity Finally, two powerful matchers, V-DOC and GMO, are employed to discover alignments in the W logies antic sing. Web f the knowledge from thesauri or third parties ’ ontologies. The rest 20 % requires manual contribution to perfect portions of the results. Past work mainly focus ..."
Abstract
- Add to MetaCart
(Show Context)
Semantic heterogeneity Finally, two powerful matchers, V-DOC and GMO, are employed to discover alignments in the W logies antic sing. Web f the knowledge from thesauri or third parties ’ ontologies. The rest 20 % requires manual contribution to perfect portions of the results. Past work mainly focuses on human collaboration [40,65] and matching visualization [1,9,17,36], which significantly facilitate matching understanding and refinement. In this paper, we aim at proposing automatic approaches for ontology matching.
An Evolution-based Approach forAssessing Ontology Mappings- A Case Study in the Life Sciences
"... Abstract: Ontology matching has been widely studied. However, the resulting on-tology mappings can be rather unstable when the participating ontologies or util-ized secondary sources (e.g., instance sources, thesauri) evolve. We propose an evolution-based approach for assessing ontology mappings by ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract: Ontology matching has been widely studied. However, the resulting on-tology mappings can be rather unstable when the participating ontologies or util-ized secondary sources (e.g., instance sources, thesauri) evolve. We propose an evolution-based approach for assessing ontology mappings by annotating their cor-respondences by information about similarity values for past ontology versions. These annotations allow us to assess the stability of correspondences over time and they can thus be used to determine better and more robust ontology mappings. The approach is generic in that it can be applied independently from the utilized match technique. We define different stability measures and show results of a first evaluation for the life science domain. 1
and
"... This paper introduces a method for mining multiple-choice assessment data for similarity of the concepts represented by the multiple choice responses. The resulting similarity matrix can be used to visualize the distance between concepts in a lower-dimensional space. This gives an instructor a visua ..."
Abstract
- Add to MetaCart
This paper introduces a method for mining multiple-choice assessment data for similarity of the concepts represented by the multiple choice responses. The resulting similarity matrix can be used to visualize the distance between concepts in a lower-dimensional space. This gives an instructor a visualization of the relative difficulty of concepts among the students in the class. It may also be used to cluster concepts, to understand unknown responses in the context of previously identified concepts.