GHive: A Demonstration of GPU-Accelerated Query Processing in Apache Hive

Haotian Liu, Bo Tang, Jiashu Zhang, Yangshen Deng, Xinying Zheng, Qiaomu Shen, Xiao Yan, Dan Zeng, Zunyao Mao, Chaozu Zhang, Zhengxin You, Zhihao Wang, Runzhe Jiang, Fang Wang, Man Lung Yiu, Huan Li, Mingji Han, Qian Li, Zhenghai Luo

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

3 Citationer (Scopus)

Abstract

As a distributed, fault-tolerant data warehouse system for large-scale data analytics, Apache Hive has been used for various applications in many organizations (e.g., Facebook, Amazon, and Huawei). Exploiting the large degrees of parallelism of GPU to improve the performance of online analytical processing (OLAP) in database system is a common practice in the industry. Meanwhile, it is a common practice to exploit the large degrees of parallelism of GPU to improve the performance of online analytical processing (OLAP) in database systems. This demo presents GHive, which enables Apache Hive to accelerate OLAP queries by jointly utilizing CPU and GPU in intelligent and efficient ways. The takeaways for SIGMOD attendees include: (1) the superior performance of GHive compared with vanilla Hive that only uses CPU; (2) intuitive visualizations of execution statistics for Hive and GHive to understand where the acceleration of GHive comes from; (3) detailed profiling of the time taken by each operator on CPU and GPU to show the advantages of GPU execution.

OriginalsprogEngelsk
TitelSIGMOD 2022 - Proceedings of the 2022 International Conference on Management of Data
Antal sider4
ForlagAssociation for Computing Machinery
Publikationsdato10 jun. 2022
Sider2417-2420
ISBN (Elektronisk)9781450392495
DOI
StatusUdgivet - 10 jun. 2022
Begivenhed2022 ACM SIGMOD International Conference on the Management of Data, SIGMOD 2022 - Virtual, Online, USA
Varighed: 12 jun. 202217 jun. 2022

Konference

Konference2022 ACM SIGMOD International Conference on the Management of Data, SIGMOD 2022
Land/OmrådeUSA
ByVirtual, Online
Periode12/06/202217/06/2022
SponsorACM SIGMOD
NavnProceedings of the ACM SIGMOD International Conference on Management of Data
ISSN0730-8078

Bibliografisk note

Publisher Copyright:
© 2022 ACM.

Fingeraftryk

Dyk ned i forskningsemnerne om 'GHive: A Demonstration of GPU-Accelerated Query Processing in Apache Hive'. Sammen danner de et unikt fingeraftryk.

Citationsformater