Challenging SQL-on-Hadoop performance with Apache Druid
العنوان: | Challenging SQL-on-Hadoop performance with Apache Druid |
---|---|
المؤلفون: | Correia, José, Costa, Carlos A. P., Santos, Maribel Yasmina |
المساهمون: | Universidade do Minho |
بيانات النشر: | Springer Verlag, 2019. |
سنة النشر: | 2019 |
مصطلحات موضوعية: | Big Data, Big Data Warehouse, SQL-on-Hadoop, Druid, OLAP, Science & Technology |
الوصف: | In Big Data, SQL-on-Hadoop tools usually provide satisfactory performance for processing vast amounts of data, although new emerging tools may be an alternative. This paper evaluates if Apache Druid, an innovative column-oriented data store suited for online analytical processing workloads, is an alternative to some of the well-known SQL-on-Hadoop technologies and its potential in this role. In this evaluation, Druid, Hive and Presto are benchmarked with increasing data volumes. The results point Druid as a strong alternative, achieving better performance than Hive and Presto, and show the potential of integrating Hive and Druid, enhancing the potentialities of both tools. |
Description (Translated): | This work is supported by COMPETE: POCI-01-0145-FEDER-007043 and FCT - Fundacao para a Ciencia e Tecnologia within Project UID/CEC/00319/2013 and by European Structural and Investment Funds in the FEDER component, COMPETE 2020 (Funding Reference: POCI-01-0247-FEDER-002814). |
وصف الملف: | application/pdf |
اللغة: | English |
Relation: | 9783030204846; 1865-1348 |
DOI: | 10.1007/978-3-030-20485-3_12 |
الاتاحة: | http://hdl.handle.net/1822/66785 |
Rights: | open access |
رقم الانضمام: | rcaap.com.repositorium.repositorium.sdum.uminho.pt.1822.66785 |
قاعدة البيانات: | RCAAP |
كن أول من يترك تعليقا!