Enabling the CUDA Unified Memory model in Edge, Cloud and HPC offloaded GPU kernels

Raffaele Montella, Diana Di Luccio, Ciro Giuseppe De Vita, Gennaro Mellone, Marco Lapegna, Giuliano Laccetti, Sokol Kosta, Giulio Giunta

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

1 Citationer (Scopus)

Abstract

The use of hardware accelerators, based on code and data offloading devoted to overcoming the CPU limitations in cores, is one of the main distinctive trends in high-end computing and related applications in the last decade. However, while code offloading is convenient for performance improvement, becoming a commonly used paradigm, memory access and management are a source of bottlenecks due to the need to interact with different address spaces. In this regard, NVidia introduced the CUDA Unified Memory model to avoid explicit memory copies between the machine hosting the accelerator device and the device itself and vice-versa. This paper shows a novel design and implementation of the support to the CUDA Unified Memory in open-source GPGPU virtualization services. The performance evaluation demonstrates that the overhead due to the virtualization and remoting is acceptable considering the possibility of sharing CUDA-enabled GPUs between various and heterogeneous machines hosted at the edge, in cloud infrastructures, or as accelerator nodes in an HPC scenario. A prototype implementation of the proposed solution is available as open-source.

OriginalsprogEngelsk
TitelProceedings - 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022
RedaktørerMaria Fazio, Dhabaleswar K. Panda, Radu Prodan, Valeria Cardellini, Burak Kantarci, Omer Rana, Massimo Villari
Antal sider8
ForlagIEEE Signal Processing Society
Publikationsdato2022
Sider834-841
ISBN (Trykt)978-1-6654-9957-6
ISBN (Elektronisk)978-1-6654-9956-9
DOI
StatusUdgivet - 2022
Begivenhed22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022 - Taormina, Italien
Varighed: 16 maj 202219 maj 2022

Konference

Konference22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022
Land/OmrådeItalien
ByTaormina
Periode16/05/202219/05/2022
NavnProceedings - 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing, CCGrid 2022

Bibliografisk note

Publisher Copyright:
© 2022 IEEE.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Enabling the CUDA Unified Memory model in Edge, Cloud and HPC offloaded GPU kernels'. Sammen danner de et unikt fingeraftryk.
  • CUDA virtualization and remoting for GPGPU based acceleration offloading at the edge

    Mentone, A., Di Luccio, D., Landolfi, L., Kosta, S. & Montella, R., 10 nov. 2019, The 12th International Conference on Internet and Distributed Computing Systems . Montella, R., Ciaramella, A., Fortino, G., Guerrieri, A. & Liotta, A. (red.). Springer, Bind 11874. s. 414-423 10 s. (Lecture Notes in Computer Science).

    Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

    Åben adgang
    Fil
    4 Citationer (Scopus)
    269 Downloads (Pure)
  • A virtualized software based on the NVIDIA cuFFT library for image denoising: performance analysis

    Galletti, A., Marcellino, L., Montella, R., Santopietro, V. & Kosta, S., 17 sep. 2017, I: Procedia Computer Science. 113, s. 496 - 501 6 s.

    Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

    Åben adgang
    Fil
    1 Citationer (Scopus)
    173 Downloads (Pure)

Citationsformater