Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

Sarthak Yadav, Sergios Theodoridis, Lars Kai Hansen, Zheng-Hua Tan

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Fingerprint

Dive into the research topics of 'Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners'. Together they form a unique fingerprint.

Keyphrases

Computer Science

Engineering