Show simple item record

2005-01-01Konferenzveröffentlichung DOI: 10.18452/9201
(Almost) Hands-Off Information Integration for the Life Sciences
dc.contributor.authorLeser, Ulf
dc.contributor.authorNaumann, Felix
dc.date.accessioned2017-06-17T00:20:44Z
dc.date.available2017-06-17T00:20:44Z
dc.date.created2006-06-29
dc.date.issued2005-01-01
dc.identifier.otherhttp://www-db.cs.wisc.edu/cidr/cidr2005/cidr05cd-rom.zip
dc.identifier.urihttp://edoc.hu-berlin.de/18452/9853
dc.description.abstractData integration in complex domains, such as the life sciences, involves either manual data curation, offering highest information quality at highest price, or follows a schema integration and mapping approach, leading to moderate information quality at a moderate price. We suggest a radically differ-ent integration approach, called ALADIN, for the life sciences application domain. The predominant feature of the ALADIN system is an architecture that allows almost automatic integration of new data sources into the system, i.e., it offers data in-tegration at almost no cost. We suggest a novel combination of data and text mining, schema matching, and duplicate detection to combat the reduction in information quality that seems inevitable when demanding a high degree of automatism. These heuristics can also lead to the detection of previously unknown or unseen rela-tionships between objects, thus directly supporting the discovery-based work of life science research-ers. We argue that such a system is a valuable con-tribution in two areas. First, it offers challenging and new problems for database research. Second, the ALADIN system would be a valuable knowl-edge resource for life science research.eng
dc.language.isoeng
dc.publisherHumboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät II
dc.subjectData Integrationeng
dc.subjectSchema Matchingeng
dc.subjectDuplicate Detectioneng
dc.subjectSchema Managementeng
dc.subject.ddc004 Informatik
dc.title(Almost) Hands-Off Information Integration for the Life Sciences
dc.typeconferenceObject
dc.identifier.urnurn:nbn:de:kobv:11-10065418
dc.identifier.doihttp://dx.doi.org/10.18452/9201
local.edoc.container-title2. CIDR 2005
local.edoc.container-title2. CIDR 2005
local.edoc.container-titleCIDR 2005, Second Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4-7, 2005, Online Proceedings
local.edoc.fp-subtypepaper
local.edoc.type-nameKonferenzveröffentlichung
local.edoc.institutionMathematisch-Naturwissenschaftliche Fakultät II
local.edoc.container-typeconference
local.edoc.container-type-nameKonferenz
local.edoc.container-urlhttp://www-db.cs.wisc.edu/cidr/cidr2005/index.html
local.edoc.container-publisher-nameCIDR
local.edoc.container-publisher-placeAsilomar, CA, USA
local.edoc.container-eventCIDR 2005, Second Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4-7, 2005, Online Proceedings, 2005, pp 131-143, 2. CIDR 2005, Asilomar, CA, USA, 04.01.2005 - 07.01.2005
local.edoc.container-year2005
local.edoc.container-firstpage131
local.edoc.container-lastpage143
dc.description.versionPeer Reviewed

Show simple item record