Abstract
We present a hybrid approach to individualized head-related transfer function (HRTF) modeling which requires only 3 anthropometric measurements and an image of the pinna. A prediction algorithm based on variational autoencoders synthesizes a pinna-related response from the image, which is used to filter a measured head-andtorso response. The interaural time difference is then manipulated to match that of the HUTUBS dataset subject minimizing the predicted localization error. The results are evaluated using spectral distortion and an auditory localization model. While the latter is inconclusive regarding the efficacy of the structural model, the former metric shows promising results with encoding HRTFs. Index Terms: Hardware - Digital signal processing; Computing methodologies - Neural networks; Applied computing - Sound and music computing
Original language | English |
---|---|
Title of host publication | Proceedings - 2021 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, VRW 2021 |
Number of pages | 6 |
Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
Publication date | Mar 2021 |
Pages | 80-85 |
Article number | 9419096 |
ISBN (Electronic) | 9780738113678 |
DOIs | |
Publication status | Published - Mar 2021 |
Event | 2021 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, VRW 2021 - Virtual, Lisbon, Portugal Duration: 27 Mar 2021 → 3 Apr 2021 |
Conference
Conference | 2021 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, VRW 2021 |
---|---|
Country/Territory | Portugal |
City | Virtual, Lisbon |
Period | 27/03/2021 → 03/04/2021 |
Bibliographical note
Publisher Copyright:© 2021 IEEE.
Keywords
- Digital signal processing
- Neural networks
- Sound and music computing