Abstract

State-of-the-art (SoTA) detection-based tracking methods mostly accomplish the detection and the identification feature learning tasks separately. Only a few efforts include the joint learning of detection and identification features. This work proposes two novel one-stage trackers by introducing implicit and explicit attention to the tracking research topic. For our tracking system based on implicit attention, we further introduce a novel fusion of feature maps combining information from different abstraction levels. For our tracking system based on explicit attention, we introduce utilization of an additional auxiliary function. These systems outperform the SoTA tracking systems in terms of MOTP (Multi-Object Tracking Precision) and IDF1 score when evaluated on public benchmark datasets including MOT15, MOT16, and MOT17. High MOTP score indicates precise detection of bounding boxes of objects, while high IDF1 score indicates accurate ID detections, which is very crucial for surveillance and security systems. Therefore, proposed systems are good choice for event-detections in surveillance feeds as we are capable of detecting correct ID and precise location
Original languageEnglish
Title of host publicationIntelligent Systems and Applications : Proceedings of the 2021 Intelligent Systems Conference (IntelliSys) Volume 1
EditorsKohei Arai
Number of pages19
Volume2
PublisherSpringer
Publication date2022
Pages736-754
ISBN (Print)978-3-030-82195-1
ISBN (Electronic)978-3-030-82196-8
DOIs
Publication statusPublished - 2022
EventIntelligent Systems Conference 2021 - Amsterdam, Netherlands
Duration: 2 Sept 20213 Sept 2021
https://saiconference.com/IntelliSys

Conference

ConferenceIntelligent Systems Conference 2021
Country/TerritoryNetherlands
CityAmsterdam
Period02/09/202103/09/2021
Internet address
SeriesLecture Notes in Networks and Systems
Volume295
ISSN2367-3370

Keywords

  • Computer vision
  • Deep neural networks
  • One-stage trackers
  • Tracking-by-detection

Fingerprint

Dive into the research topics of 'Attention-Enabled Object Detection to Improve One-Stage Tracker'. Together they form a unique fingerprint.

Cite this