Zur Kurzanzeige

2005-04-01Konferenzveröffentlichung DOI: 10.18452/9202
Schema Matching using Duplicates
dc.contributor.authorBilke, Alexander
dc.contributor.authorNaumann, Felix
dc.date.accessioned2017-06-17T00:20:55Z
dc.date.available2017-06-17T00:20:55Z
dc.date.created2006-06-29
dc.date.issued2005-04-01
dc.identifier.isbn0-7695-2285-8
dc.identifier.urihttp://edoc.hu-berlin.de/18452/9854
dc.description.abstractMost data integration applications require a matching between the schemas of the respective data sets. We show how the existence of duplicates within these data sets can be exploited to automatically identify matching attributes. We describe an algorithm that first discovers duplicates among data sets with unaligned schemas and then uses these duplicates to perform schema matching between schemas with opaque column names. Discovering duplicates among data sets with unaligned schemas is more difficult than in the usual setting, because it is not clear which fields in one object should be compared with which fields in the other. We have developed a new algorithm that efficiently finds the most likely duplicates in such a setting. Now, our schema matching algorithm is able to identify corresponding attributes by comparing data values within those duplicate records. An experimental study on real-world data shows the effectiveness of this approach.eng
dc.language.isoeng
dc.publisherHumboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät II
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subjectData Integrationeng
dc.subjectSchema Mappingeng
dc.subjectSchema Integrationeng
dc.subject.ddc004 Informatik
dc.titleSchema Matching using Duplicates
dc.typeconferenceObject
dc.identifier.urnurn:nbn:de:kobv:11-10065426
dc.identifier.doihttp://dx.doi.org/10.18452/9202
local.edoc.type-nameKonferenzveröffentlichung
local.edoc.container-typeconference
local.edoc.container-type-nameKonferenz
local.edoc.container-year2005
dc.description.versionPeer Reviewed
dc.description.eventProceedings of the 21st International Conference on Data Engineering, ICDE 2005, 5-8 April 2005, Tokyo, Japan, 2005, pp 69-80, 21. ICDE 2005, Tokyo, Japan, 05.04.2005 - 08.04.2005
dcterms.bibliographicCitation.urlhttp://csdl.computer.org/dl/proceedings/icde/2005/2285/00/22850069.pdf
dcterms.bibliographicCitation.booktitle21. ICDE 2005
dcterms.bibliographicCitation.booktitle21. ICDE 2005
dcterms.bibliographicCitation.booktitleProceedings of the 21st International Conference on Data Engineering, ICDE 2005, 5-8 April 2005, Tokyo, Japan
dcterms.bibliographicCitation.originalpublishernameIEEE Computer Society
dcterms.bibliographicCitation.originalpublisherplaceTokyo
dcterms.bibliographicCitation.pagestart69
dcterms.bibliographicCitation.pageend80
bua.departmentMathematisch-Naturwissenschaftliche Fakultät II

Zur Kurzanzeige