• search hit 35 of 2320
Back to Result List

RHEEMix in the data jungle

  • Data analytics are moving beyond the limits of a single platform. In this paper, we present the cost-based optimizer of Rheem, an open-source cross-platform system that copes with these new requirements. The optimizer allocates the subtasks of data analytic tasks to the most suitable platforms. Our main contributions are: (i) a mechanism based on graph transformations to explore alternative execution strategies; (ii) a novel graph-based approach to determine efficient data movement plans among subtasks and platforms; and (iii) an efficient plan enumeration algorithm, based on a novel enumeration algebra. We extensively evaluate our optimizer under diverse real tasks. We show that our optimizer can perform tasks more than one order of magnitude faster when using multiple platforms than when using a single platform.

Download full text files

  • zde22.pdfeng
    (1383KB)

    SHA-512804a40def2da20280444231c6dfc8af6700092c641af2f24d226db0f1226d69c4267cd1fd6e73e21258575657d78dc1a03b55f21193a8d90961a1daf3ed15d77

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Sebastian KruseORCiDGND, Zoi KaoudiORCiD, Bertty Contreras-Rojas, Sanjay Chawla, Felix NaumannORCiDGND, Jorge-Arnulfo Quiané-RuizORCiD
URN:urn:nbn:de:kobv:517-opus4-519443
DOI:https://doi.org/10.25932/publishup-51944
Title of parent work (German):Zweitveröffentlichungen der Universität Potsdam : Reihe der Digital Engineering Fakultät
Subtitle (English):a cost-based optimizer for cross-platform systems
Publication series (Volume number):Zweitveröffentlichungen der Universität Potsdam : Reihe der Digital Engineering Fakultät (22)
Publication type:Postprint
Language:English
Date of first publication:2020/05/18
Publication year:2020
Publishing institution:Universität Potsdam
Release date:2024/04/22
Tag:cross-platform; data processing; polystore; query optimization
Issue:6
Number of pages:26
Source:The VLDB Journal 29, 1287–1310 (2020). https://doi.org/10.1007/s00778-020-00612-x
Organizational units:Digital Engineering Fakultät / Hasso-Plattner-Institut für Digital Engineering GmbH
DDC classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 000 Informatik, Informationswissenschaft, allgemeine Werke
Peer review:Referiert
Publishing method:Open Access / Green Open-Access
License (German):License LogoCC-BY - Namensnennung 4.0 International
External remark:Bibliographieeintrag der Originalveröffentlichung/Quelle
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.