Enabling the CUDA Unified Memory model in Edge, Cloud and HPC offloaded GPU kernels

Raffaele Montella, Diana Di Luccio, Ciro Giuseppe De Vita, Gennaro Mellone, Marco Lapegna, Giuliano Laccetti, Sokol Kosta, Giulio Giunta

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

1 Citation (Scopus)

Abstract

The use of hardware accelerators, based on code and data offloading devoted to overcoming the CPU limitations in cores, is one of the main distinctive trends in high-end computing and related applications in the last decade. However, while code offloading is convenient for performance improvement, becoming a commonly used paradigm, memory access and management are a source of bottlenecks due to the need to interact with different address spaces. In this regard, NVidia introduced the CUDA Unified Memory model to avoid explicit memory copies between the machine hosting the accelerator device and the device itself and vice-versa. This paper shows a novel design and implementation of the support to the CUDA Unified Memory in open-source GPGPU virtualization services. The performance evaluation demonstrates that the overhead due to the virtualization and remoting is acceptable considering the possibility of sharing CUDA-enabled GPUs between various and heterogeneous machines hosted at the edge, in cloud infrastructures, or as accelerator nodes in an HPC scenario. A prototype implementation of the proposed solution is available as open-source.

Original languageEnglish
Title of host publicationProceedings - 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022
EditorsMaria Fazio, Dhabaleswar K. Panda, Radu Prodan, Valeria Cardellini, Burak Kantarci, Omer Rana, Massimo Villari
Number of pages8
PublisherIEEE Signal Processing Society
Publication date2022
Pages834-841
ISBN (Print)978-1-6654-9957-6
ISBN (Electronic)978-1-6654-9956-9
DOIs
Publication statusPublished - 2022
Event22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022 - Taormina, Italy
Duration: 16 May 202219 May 2022

Conference

Conference22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022
Country/TerritoryItaly
CityTaormina
Period16/05/202219/05/2022
SeriesProceedings - 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022

Bibliographical note

Funding Information:
This work is supported in part by the grant ADMIRE (Adaptive Multi-tier intelligent data manager for Exascale - H2020-JTI-EuroHPC-2019-1), in part by the grant MytilAI (Modelling mytilus farming with Artificial Intelligence technologies - CUP I65F21000040002), and it is conducted in the framework of the research agreement “High-Performance Computing at the Edge” between the Department of Mathematics and Applications of the University of Naples Federico II and the Department of Sciences and Technologies of the University of Naples Parthenope.

Publisher Copyright:
© 2022 IEEE.

Keywords

  • CUDA
  • GPU
  • offloading
  • remoting
  • unified memory access
  • virtualization

Fingerprint

Dive into the research topics of 'Enabling the CUDA Unified Memory model in Edge, Cloud and HPC offloaded GPU kernels'. Together they form a unique fingerprint.
  • CUDA virtualization and remoting for GPGPU based acceleration offloading at the edge

    Mentone, A., Di Luccio, D., Landolfi, L., Kosta, S. & Montella, R., 10 Nov 2019, The 12th International Conference on Internet and Distributed Computing Systems . Montella, R., Ciaramella, A., Fortino, G., Guerrieri, A. & Liotta, A. (eds.). Springer, Vol. 11874. p. 414-423 10 p. (Lecture Notes in Computer Science).

    Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

    Open Access
    File
    4 Citations (Scopus)
    272 Downloads (Pure)
  • A virtualized software based on the NVIDIA cuFFT library for image denoising: performance analysis

    Galletti, A., Marcellino, L., Montella, R., Santopietro, V. & Kosta, S., 17 Sept 2017, In: Procedia Computer Science. 113, p. 496 - 501 6 p.

    Research output: Contribution to journalConference article in JournalResearchpeer-review

    Open Access
    File
    1 Citation (Scopus)
    173 Downloads (Pure)

Cite this