Multimodal Sentiment and Personality Perception Under Speech: A Comparison of Transformer-based Architectures

Ádám Fodor, Rachid R. Saboundji, Julio C.S. Jacques Junior, Sergio Escalera, David Gallardo, Andras Lorincz

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

2 Citationer (Scopus)

Abstract

Human-machine, human-robot interaction, and collaboration appear in diverse fields, from homecare to Cyber-Physical Systems. Technological development is fast, whereas real-time methods for social communication analysis that can measure small changes in sentiment and personality states, including visual, acoustic and language modalities are lagging, particularly when the goal is to build robust, appearance invariant, and fair methods. We study and compare methods capable of fusing modalities while satisfying real-time and invariant appearance conditions. We compare state-of-the-art transformer architectures in sentiment estimation and introduce them in the much less explored field of personality perception. We show that the architectures perform differently on automatic sentiment and personality perception, suggesting that each task may be better captured/modeled by a particular method. Our work calls attention to the attractive properties of the linear versions of the transformer architectures. In particular, we show that the best results are achieved by fusing the different architectures’ preprocessing methods. However, they pose extreme conditions in computation power and energy consumption for real-time computations for quadratic transformers due to their memory requirements. In turn, linear transformers pave the way for quantifying small changes in sentiment estimation and personality perception for real-time social communications for machines and robots.

OriginalsprogEngelsk
BogserieProceedings of Machine Learning Research
Vol/bind173
Sider (fra-til)218-241
Antal sider24
ISSN2640-3498
StatusUdgivet - 2021
BegivenhedChaLearn LAP Challenge on Understanding Social Behavior in Dyadic and Small Group Interactions Workshop, DYAD 2021, held in conjunction with the International Conference on Computer Vision, ICCV 2021 - Virtual, Online
Varighed: 16 okt. 2021 → …

Konference

KonferenceChaLearn LAP Challenge on Understanding Social Behavior in Dyadic and Small Group Interactions Workshop, DYAD 2021, held in conjunction with the International Conference on Computer Vision, ICCV 2021
ByVirtual, Online
Periode16/10/2021 → …

Bibliografisk note

Publisher Copyright:
© 2022 Fodor, R.R. Saboundji, J.C.S.J. Junior, S. Escalera, D. Gallardo & A. Lorincz.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Multimodal Sentiment and Personality Perception Under Speech: A Comparison of Transformer-based Architectures'. Sammen danner de et unikt fingeraftryk.

Citationsformater