| edoc-Server der Humboldt-Universität zu Berlin |
| Publikationsart: | Workshop- oder Konferenzbeitrag |
| Autor(en): | Felix Naumann; Matthias Häussler |
| Titel: | Declarative Data Merging with Conflict Resolution |
| Erschienen in: |
Seventh International Conference on Information Quality (IQ 2002) 2002 S. 212-224 |
| Veranstaltung: |
7. IQ 2002 MIT Sloan School of Management, Cambridge, MA, USA 08.11.2002 - 10.11.2002 |
| Verlag: |
IQ http://www.iqconference.org/ |
| Erscheinungsort: | Cambridge, MA, USA |
| Erstveröffentlichung: | 01.11.2002 |
| Veröffentlichung auf edoc: | 04.07.2006 |
| Status: |
published peer_reviewed |
| Volltext: | pdf (urn:nbn:de:kobv:11-10065728) |
| URL der Erstveröffentlichung: | http://www.iqconference.org/ICIQ/iqdownload.aspx?ICIQYear=2002&File=DeclarativeDataMergingWithConflictResolution.pdf |
| Fachgebiet(e): | Informatik |
| Schlagwörter (eng): | Data Integration, SQL, Data Cleansing, Databases, Data Fusion, Complex Queries, Information Integration, Data Consolidation |
| Einrichtung: | Humboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät II |
| Metadatenexport:
|
Endnote Bibtex |
| print on demand:
|
|
| Diese Seite taggen:
|
| Abstract (eng): | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Database integration is a growing and increasingly important field in both research and industry. Integration requires many steps from initial schema integration and schema mapping, to data scrubbing and cleansing, and finally to data merging. While much research has concentrated on the first steps performed at schema level, there are only few publications about actual, practical merging of the data in an integrated database or in a query against multiple databases. When merging data, especially data from autonomous sources, there is a large potential for decreasing the quality of the merged data, even below the level of the original sources. The main reasons for decreased quality are data conflicts among the sources. To address this problem, we define resolution functions merging conflicting data. We present several alternatives of merging relational data sources with common queries through grouping & aggregating and through partitioning & joining. The resulting queries use resolution functions and can be used to migrate data from multiple sources to a target database, or to define an integrating view on multiple sources. We describe and analyze the advantages of the different approaches, and describe our practical solution in the framework of a schema mapping and data transformation tool. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Zugriffsstatistik:
Bei Formatversionen eines Dokuments, die aus mehreren Dateien bestehen (insbesondere HTML), wird jeweils der monatlich höchste Zugriffswert auf eine der Dateien (Kapitel) des Dokuments angezeigt. Um die detaillierten Zugriffszahlen zu sehen, fahren Sie bitte mit dem Mauszeiger über die einzelnen Balken des Diagramms. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Gesamtzahl der Zugriffe seit Jul 2011:
|
|
| |||