Abstract
The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, focusing on retrieving all timestamps related to global actions in soccer, (2) ball action spotting, focusing on retrieving all timestamps related to the soccer ball change of state, and (3) dense video captioning, focusing on describing the broadcast with natural language and anchored timestamps. The second theme, field understanding, relates to the single task of (4) camera calibration, focusing on retrieving the intrinsic and extrinsic camera parameters from images. The third and last theme, player understanding, is composed of three low-level tasks related to extracting information about the players: (5) re-identification, focusing on retrieving the same players across multiple views, (6) multiple object tracking, focusing on tracking players and the ball through unedited video streams, and (7) jersey number recognition, focusing on recognizing the jersey number of players from tracklets. Compared to the previous editions of the SoccerNet challenges, tasks (2-3-7) are novel, including new annotations and data, task (4) was enhanced with more data and annotations, and task (6) now focuses on end-to-end approaches. Our report indicates performance trends across tasks: (1) Action spotting is nearing saturation, while (2) ball action spotting improved significantly with advanced end-to-end models. (3) Dense video captioning also saw substantial enhancements aligned with Large Language Models advancements. (4) Camera calibration, redefined end-to-end, demonstrated a significant performance boost. In contrast, (5) player re-identification showed only minor improvements, reflecting decreasing interest. The new (6) multiple object tracking task exhibited notable advances, underscoring the maturity of current techniques. (7) Jersey number recognition received the most focus, achieving impressive results. More information on the tasks, challenges, and leaderboards are available on https://www.soccer-net.org. Baselines and development kits can be found on https://github.com/SoccerNet.
Original language | English |
---|---|
Article number | 24 |
Journal | Sports Engineering |
Volume | 27 |
Issue number | 2 |
ISSN | 1369-7072 |
DOIs | |
Publication status | Published - Dec 2024 |
Bibliographical note
Publisher Copyright:© The Author(s), under exclusive licence to the International Sports Engineering Association 2024.
Keywords
- Artificial intelligence
- Challenges
- Computer vision
- Datasets
- Soccer
- Video understanding