Action Recognition from RGB-D Data: Comparison and Fusion of Spatio-Temporal Handcrafted Features and Deep Strategies

Maryam Asadi-Aghbolaghi, Hugo Bertiche, Vicent Roig, Shohreh Kasaei, Sergio Escalera

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

17 Citationer (Scopus)

Abstract

In this work, multimodal fusion of RGB-D data are analyzed for action recognition by using scene flow as early fusion and integrating the results of all modalities in a late fusion fashion. Recently, there is a migration from traditional handcrafting to deep learning. However, handcrafted features are still widely used owing to their high performance and low computational complexity. In this research, Multimodal dense trajectories (MMDT) is proposed to describe RGB-D videos. Dense trajectories are pruned based on scene flow data. Besides, 2DCNN is extended to multimodal (MM2DCNN) by adding one more stream (scene flow) as input and then fusing the output of all models. We evaluate and compare the results from each modality and their fusion on two action datasets. The experimental result shows that the new representation improves the accuracy. Furthermore, the fusion of handcrafted and learning-based features shows a boost in the final performance, achieving state of the art results.
OriginalsprogEngelsk
Titel2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
Antal sider10
ForlagIEEE Communications Society
Publikationsdato29 okt. 2017
Sider3179-3188
Artikelnummer8265587
ISBN (Trykt)978-1-5386-1035-0
DOI
StatusUdgivet - 29 okt. 2017
Udgivet eksterntJa
Begivenhed2017 IEEE International Conference on Computer Vision Workshops (ICCVW) - Venice, Italy
Varighed: 22 okt. 201729 okt. 2017

Konference

Konference2017 IEEE International Conference on Computer Vision Workshops (ICCVW)
LokationVenice, Italy
Periode22/10/201729/10/2017

Fingeraftryk

Dyk ned i forskningsemnerne om 'Action Recognition from RGB-D Data: Comparison and Fusion of Spatio-Temporal Handcrafted Features and Deep Strategies'. Sammen danner de et unikt fingeraftryk.

Citationsformater