A Context-Aware Loss Function for Action Spotting in Soccer Videos

Anthony Cioppa; Adrien Deliege; Silvio Giancola; Bernard Ghanem; Marc Van Droogenbroeck; Rikke Gade; Thomas B. Moeslund

doi:10.1109/CVPR42600.2020.01314

A Context-Aware Loss Function for Action Spotting in Soccer Videos

Anthony Cioppa, Adrien Deliege, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck, Rikke Gade, Thomas B. Moeslund

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

49 Citationer (Scopus)

82 Downloads (Pure)

Abstract

In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. In this paper, we propose a novel loss function that specifically considers the temporal context naturally present around each action, rather than focusing on the single annotated frame to spot. We benchmark our loss on a large dataset of soccer videos, SoccerNet, and achieve an improvement of 12.8% over the baseline. We show the generalization capability of our loss for generic activity proposals and detection on ActivityNet, by spotting the beginning and the end of each activity. Furthermore, we provide an extended ablation study and display challenging cases for action spotting in soccer videos. Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.

Originalsprog	Engelsk
Titel	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Antal sider	11
Forlag	IEEE
Publikationsdato	jun. 2020
Sider	13123-13133
ISBN (Trykt)	978-1-7281-7169-2
ISBN (Elektronisk)	978-1-7281-7168-5
DOI	https://doi.org/10.1109/CVPR42600.2020.01314
Status	Udgivet - jun. 2020
Begivenhed	2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) - Seattle, USA Varighed: 14 jun. 2020 → 19 jun. 2020

Konference

Konference	2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Land/Område	USA
By	Seattle
Periode	14/06/2020 → 19/06/2020

Navn	I E E E Conference on Computer Vision and Pattern Recognition. Proceedings
ISSN	1063-6919

Adgang til dokumentet

10.1109/CVPR42600.2020.01314

Open Access manuscriptAccepteret manuskript, 900 KBLicens: CC BY 4.0

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

Link to publication in Scopus

Citationsformater

@inproceedings{168323c9927e431aa0c2499bf21b0b85,

title = "A Context-Aware Loss Function for Action Spotting in Soccer Videos",

abstract = "In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. In this paper, we propose a novel loss function that specifically considers the temporal context naturally present around each action, rather than focusing on the single annotated frame to spot. We benchmark our loss on a large dataset of soccer videos, SoccerNet, and achieve an improvement of 12.8% over the baseline. We show the generalization capability of our loss for generic activity proposals and detection on ActivityNet, by spotting the beginning and the end of each activity. Furthermore, we provide an extended ablation study and display challenging cases for action spotting in soccer videos. Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.",

author = "Anthony Cioppa and Adrien Deliege and Silvio Giancola and Bernard Ghanem and Droogenbroeck, {Marc Van} and Rikke Gade and Moeslund, {Thomas B.}",

year = "2020",

month = jun,

doi = "10.1109/CVPR42600.2020.01314",

language = "English",

isbn = "978-1-7281-7169-2",

series = "I E E E Conference on Computer Vision and Pattern Recognition. Proceedings",

publisher = "IEEE",

pages = "13123--13133",

booktitle = "2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)",

address = "United States",

note = "2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ; Conference date: 14-06-2020 Through 19-06-2020",

}

Cioppa, A, Deliege, A, Giancola, S, Ghanem, B, Droogenbroeck, MV, Gade, R & Moeslund, TB 2020, A Context-Aware Loss Function for Action Spotting in Soccer Videos. i 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, I E E E Conference on Computer Vision and Pattern Recognition. Proceedings, s. 13123-13133, 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, Washington, USA, 14/06/2020. https://doi.org/10.1109/CVPR42600.2020.01314

A Context-Aware Loss Function for Action Spotting in Soccer Videos. / Cioppa, Anthony; Deliege, Adrien; Giancola, Silvio et al.
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2020. s. 13123-13133 (I E E E Conference on Computer Vision and Pattern Recognition. Proceedings).

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

TY - GEN

T1 - A Context-Aware Loss Function for Action Spotting in Soccer Videos

AU - Cioppa, Anthony

AU - Deliege, Adrien

AU - Giancola, Silvio

AU - Ghanem, Bernard

AU - Droogenbroeck, Marc Van

AU - Gade, Rikke

AU - Moeslund, Thomas B.

PY - 2020/6

Y1 - 2020/6

N2 - In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. In this paper, we propose a novel loss function that specifically considers the temporal context naturally present around each action, rather than focusing on the single annotated frame to spot. We benchmark our loss on a large dataset of soccer videos, SoccerNet, and achieve an improvement of 12.8% over the baseline. We show the generalization capability of our loss for generic activity proposals and detection on ActivityNet, by spotting the beginning and the end of each activity. Furthermore, we provide an extended ablation study and display challenging cases for action spotting in soccer videos. Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.

AB - In video understanding, action spotting consists in temporally localizing human-induced events annotated with single timestamps. In this paper, we propose a novel loss function that specifically considers the temporal context naturally present around each action, rather than focusing on the single annotated frame to spot. We benchmark our loss on a large dataset of soccer videos, SoccerNet, and achieve an improvement of 12.8% over the baseline. We show the generalization capability of our loss for generic activity proposals and detection on ActivityNet, by spotting the beginning and the end of each activity. Furthermore, we provide an extended ablation study and display challenging cases for action spotting in soccer videos. Finally, we qualitatively illustrate how our loss induces a precise temporal understanding of actions and show how such semantic knowledge can be used for automatic highlights generation.

UR - http://www.scopus.com/inward/record.url?scp=85094809735&partnerID=8YFLogxK

U2 - 10.1109/CVPR42600.2020.01314

DO - 10.1109/CVPR42600.2020.01314

M3 - Article in proceeding

SN - 978-1-7281-7169-2

T3 - I E E E Conference on Computer Vision and Pattern Recognition. Proceedings

SP - 13123

EP - 13133

BT - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

PB - IEEE

T2 - 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Y2 - 14 June 2020 through 19 June 2020

ER -

A Context-Aware Loss Function for Action Spotting in Soccer Videos

Abstract

Konference

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater