Audio-Visual Classification of Sports Types

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

7 Citationer (Scopus)
345 Downloads (Pure)

Abstract

In this work we propose a method for classification of sports types from combined audio and visual features extracted from thermal video. From audio Mel Frequency Cepstral Coefficients (MFCC) are extracted, and PCA are
applied to reduce the feature space to 10 dimensions. From the visual modality short trajectories are constructed to represent the motion of players. From these, four motion features are extracted and combined directly with audio features for classification. A k-nearest neighbour classifier is applied for classification of 180 1-minute video sequences from three sports types. Using 10-fold cross validation a correct classification rate of 96.11% is obtained with multimodal features, compared to 86.67% and 90.00% using only visual or audio features, respectively.
OriginalsprogEngelsk
Titel2015 IEEE International Conference on Computer Vision Workshops (ICCVW)
ForlagIEEE
Publikationsdatodec. 2015
Sider768-773
ISBN (Trykt)978-1-4673-8390-5
DOI
StatusUdgivet - dec. 2015
BegivenhedInternational Conference on Computer Vision - Santiago, Chile
Varighed: 12 dec. 201518 dec. 2015

Konference

KonferenceInternational Conference on Computer Vision
Land/OmrådeChile
BySantiago
Periode12/12/201518/12/2015

Fingeraftryk

Dyk ned i forskningsemnerne om 'Audio-Visual Classification of Sports Types'. Sammen danner de et unikt fingeraftryk.

Citationsformater