Abteilung Datenbanken Leipzig (https://old.dbs.uni-leipzig.de)

Web Data Integration

Most proposed approaches on data integration rely on the notion of a global schema to provide a unified and consistent view of the underlying data sources. While it has been successful for data warehouses, the effort to integrate new sources usually is high. This makes it difficult for such approaches to scale to many sources. Furthermore, for virtual data integration it is challenging to obtain a good data quality.

Our work on web data integration focusses on dynamic information fusion of data sources available on the web. Similar to the idea of mashups, we want to achieve a fast development of data integration applications by reusing existing services and entity search engines within a workflow-like data integration. Integration workflows are defined using a script language supporting powerful generic operators.

Projects

Our work on web data integration contains the following projects:

Project Members

Publications

PDF [12]

Google Scholar [13]
[14]Thor, A. [15]; Rahm, E. [16]
CloudFuice: A flexible Cloud-based Data Integration System [17]
Proc. of 10th Intl. Conference on Web Engineering (ICWE), 2011
2011-06 [18]
PDF [19]

Google Scholar [20]
[21]Rahm, E. [22]; Thor, A. [23]; Aumueller, D. [24]
Dynamic Fusion of Web Data [25]
Proc. 5th Intl. XML Database Symposium (XSym), 2007
2007-09 [26]
PDF [27]

Google Scholar [28]
publication icon [29]Thor, Andreas [30]; Aumueller, David [31]; Rahm, Erhard [32]
Data Integration Support for Mashups [33]
Proc. 6th Intl. Workshop on Information Integration on the Web (IIWeb), 2007
2007-07 [34]
PDF [35]

Google Scholar [36]
[37]Kirsten, T. [38]; Thor, A. [39]; Rahm, E. [40]
Instance-based matching of large life science ontologies [41]
Proc. of 4th Intl. Workshop on Data Integration in the Life Sciences (DILS), 2007
2007-06 [42]
PDF [43]

Google Scholar [44]
[45]Thor, A. [46]; Kirsten, T. [47]; Rahm, E. [48]
Instance-based matching of hierarchical ontologies [49]
Proc. of 12. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW), 2007
2007-03 [50]
PDF [51]

Google Scholar [52]
[53]Köpcke, H. [54]; Rahm, E. [55]
Analyse von Zitierungshäufigkeiten für die Datenbankkonferenz BTW [56]
Datenbank-Spektrum, 7. Jahrgang, Heft 20
2007-02 [57]
PDF [58]

Google Scholar [59]
[60]Thor, A. [61]; Rahm, E. [62]
MOMA - A Mapping-based Object Matching System [63]
Proc. 3rd Conference on Innovative Data Systems Research (CIDR), 2007
2007-01 [64]
PDF [65]
further information [66]
Google Scholar [67]
[68]Kirsten, Toralf [69]; Rahm, Erhard [70]
BioFuice: Mapping-based data integration in bioinformatics [71]
Proc. of 3rd Int. Workshop on Data Integration in the Life Sciences (DILS), Springer LNCS 4075, 2006
2006-07 [72]
PDF [73]
further information [74]
Google Scholar [75]
[76]Rahm, E. [77]; Thor, A. [78]
Citation analysis of database publications [79]
ACM Sigmod Record 24(4), 2005
2005-12 [80]
PDF [81]

Google Scholar [82]
[83]Rahm, E. [84]; Thor, A. [85]; Aumueller, D. [86]; Do, H.H. [87]; Golovin, N. [88]; Kirsten, T. [89]
iFuice - Information Fusion utilizing Instance Correspondences and Peer Mappings [90]
Proc. 8th Intl. Workshop on the Web and Databases (WebDB), 2005
2005-06 [91]


Google Scholar [92]
publication icon [93]Kirsten, T. [94]; Rahm, E. [95]
BioFuice: A decentralized Approach to integrate molecular-biological Data [96]
Proc 4th Research Festival for Life Sciences, Leipzig, Dec. 2005
2005 [97]

URL:
https://old.dbs.uni-leipzig.de/de/projekte/DATAINT/index.html