German English

Data integration with WETSUIT

WETSUIT (Web EnTity Search and fUsIon Tool)

WETSUIT is a new powerful open source mashup tool to search and integrate web data from diverse sources and domain-specific entity search engines. It supports adaptive search strategies to query sets of relevant entities with a minimum of communication overhead. Mashups can be composed using a set of high-level operators based on the Java-compatible language Scala. The operator implementation supports a high degree of parallel processing, in particular a streaming of entities between all data transformation operations facilitating a fast presentation of intermediate results.

Demonstration Mashups

  • Online Citation Service lets you determine the citation counts of Google Scholar for any author or venue listed at DBLP. References to be analyzed can also be provided by a csv or bib file.
  • SimPubFinder lets you determine the citing papers for publications listed in a bib or csv input file.

Source Code and Documentation

  • will be published soon

Publications

PDF
PDF
Google Scholar
Endrullis, S.; Thor, A.; Rahm, E.
WETSUIT: An Efficient Mashup Tool for Searching and Fusing Web Entities
Proc. 38th Intl. Conference on Very Large Databases (VLDB) / Proceedings of the VLDB Endowment 5(12), 2012 (demo)
2012-08
PDF

Google Scholar
Endrullis, S.; Thor, A.; Rahm, E.
Entity Search Strategies for Mashup Applications
Proc. 28th Intl. Conference on Data Engineering (ICDE), 2012
2012-04
PDF

Google Scholar
Thor, A.; Rahm, E.
CloudFuice: A flexible Cloud-based Data Integration System
Proc. of 10th Intl. Conference on Web Engineering (ICWE), 2011
2011-06
PDF

Google Scholar
publication iconEndrullis, S.; Thor, A.; Rahm, E.
Evaluation of Query Generators for Entity Search Engines
Proc. Intl. Workshop on Using Search Engine Technology for Information Management (USETIM), 2009
2009-08
PDF

Google Scholar
publication iconThor, Andreas; Aumueller, David; Rahm, Erhard
Data Integration Support for Mashups
Proc. 6th Intl. Workshop on Information Integration on the Web (IIWeb), 2007
2007-07