See this document in CiteSeerX!

Schema Matching using Duplicates  (Make Corrections)  (3 citations)
Alexander Bilke Technische Universit at Berlin, Germany -berlin.de ...



  Home/Search   Context   Related

 
View or download:
www2.informatik.huberlin....ICDE05.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  www2.informatik.hu...publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Most data integration applications require a matching between the schemas of the respective data sets. We show how the existence of duplicates within these data sets can be exploited to automatically identify matching attributes. We describe an algorithm that first discovers duplicates among data sets with unaligned schemas and then uses these duplicates to perform schema matching between schemas with opaque column names. (Update)

Cited by:   More
Efficiently Computing Inclusion Dependencies for Schema.. - Jana Bauckmann Ulf   (Correct)
Automatic Data Fusion with HumMer - Alexander Bilke Jens (2005)   (Correct)
Tuning Schema Matching Software - Using Synthetic Scenarios (2005)   (Correct)

Active bibliography (related documents):   More   All
0.6:   Record Linkage: Current Practice and Future Directions - Gu, Baxter, Vickers..   (Correct)
0.4:   Bootstrapping Ontology Alignment Methods with APFEL - Ehrig, Staab, Sure (2005)   (Correct)
0.4:   A Duplicate Detection Benchmark for - Xml And Relational   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

Related documents from co-citation:   More   All
2:   Reconciling Schemas of Disparate Data Sources: A Maching-Learning Approach - Doan, Domingos et al. - 2001

BibTeX entry:   (Update)

Alexander Bilke and Felix Naumann. Schema matching using duplicates. In Proc. of ICDE-05. http://citeseer.ist.psu.edu/749754.html   More

@misc{ bilke-schema,
  author = "A. Bilke and F. Naumann",
  title = "Schema matching using duplicates",
  text = "Alexander Bilke and Felix Naumann. Schema matching using duplicates. In
    Proc. of ICDE-05.",
  url = "citeseer.ist.psu.edu/749754.html" }
Citations (may not include all citations):
482   Combinatorial Optimization: Algorithms and Complexity (context) - Papadimitriou, Steiglitz - 1982
372   An algorithm for suffix stripping (context) - Porter - 1980
141   A survey of approaches to automatic schema matching - Rahm, Bernstein - 2001
93   Integration of heterogeneous databases without common domain.. - Cohen - 1998
79   Reconciling schemas of disparate data sources: A machine-lea.. - Doan, Domingos et al. - 2001
78   Generic schema matching with Cupid - Madhavan, Bernstein et al. - 2001
66   A theory for record linkage (context) - Fellegi, Sunter - 1969
56   Category translation: Learning to understand information on .. - Perkowitz, Etzioni - 1995
43   Real world data is dirty Data cleansing and mergepurge probl.. - andez, world et al. - 1998
41   Translating web data - Popa, Velegrakis et al. - 2002
39   COMA - A system for flexible combination of schema matching .. - Do, Rahm - 2002
36   Similarity Flooding: A versatile graph matching algorithm an.. (context) - Melnik, Garcia-Molina et al. - 2002
28   The field matching problem: Algorithms and applications - Monge, Elkan - 1996
28   Entity identification in database integration - Lim, Srivastasa et al. - 1993
24   Applying model management to classical meta data problems - Bernstein - 2003
23   Matching and record linkage - Winkler - 1995
23   Efficient algorithms for finding maximum matching in graphs (context) - Galil - 1986
23   A comparison of string distance metrics for name-matching ta.. - Cohen, Ravikumar et al. - 2003
21   Block edit models for approximate string matching - Lopresti, Tomkins - 1997
21   Data cleaning: Problems and current approaches (context) - Rahm, Do - 2000
19   Adaptive duplicate detection using learnable string similari.. - Bilenko, Mooney - 2003
16   Learning object identification rules for information integra.. - Tejada, Knoblock et al. - 2001
15   Comparison of schema matching evaluations - Do, Melnik et al. - 2002
13   TAILOR: A record linkage toolbox - Elfeky, Verykios et al. - 2002
12   Eliminating fuzzy duplicates in data warehouses - Ananthakrishna, Chaudhuri et al. - 2002
8   Attribute classification using feature analysis (context) - Naumann, Ho et al. - 2002
8   Text joins in an RDBMS for web data integration - Gravano, Ipeiriotis et al. - 2003
8   iMAP: Discovering complex semantic matches between database .. - Dhamankar, Lee et al. - 2004
4   Instancebased attribute identification in database integrati.. (context) - Chua, Chiang et al. - 2003
4   Object matching for data integration: A profile-based approa.. (context) - Doan, Lu et al. - 2003
3   Automatic record matching in cooperative information systems (context) - Bertolazzi, Santis et al. - 2003

Documents on the same site (http://www2.informatik.hu-berlin.de/mac/publications.html):   More
A Data Model and Query Language to Explore Enhanced Links.. - Mihaila, Naumann, al. (2005)   (Correct)
Completeness of Information Sources - Naumann, Freytag, Leser (2000)   (Correct)
Quality-driven Integration of Heterogeneous Information.. - Naumann, Leser, Freytag (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC