YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

Abstract

We introduce YOLO11-JDE, a fast and accurate multi-
object tracking (MOT) solution that combines real-time ob-
ject detection with self-supervised Re-Identification (Re-
ID). By incorporating a dedicated Re-ID branch into
YOLO11s, our model performs Joint Detection and Embed-
ding (JDE), generating appearance features for each detec-
tion. The Re-ID branch is trained in a fully self-supervised
setting while simultaneously training for detection, elimi-
nating the need for costly identity-labeled datasets. The
triplet loss, with hard positive and semi-hard negative min-
ing strategies, is used for learning discriminative embed-
dings. Data association is enhanced with a custom tracking
implementation that successfully integrates motion, appear-
ance, and location cues. YOLO11-JDE achieves competi-
tive results on MOT17 and MOT20 benchmarks, surpass-
ing existing JDE methods in terms of FPS and using up to
ten times fewer parameters. Thus, making our method a
highly attractive solution for real-world applications. The
code is publicly available at https://github.com/
inakierregueab/YOLO11-JDE.
OriginalsprogEngelsk
TitelWinter Conference on Applications of Computer Vision Workshops
ForlagIEEE (Institute of Electrical and Electronics Engineers)
Publikationsdato2025
StatusUdgivet - 2025
BegivenhedWinter Conference on Applications of Computer Vision Workshops, - Tucson , USA
Varighed: 28 feb. 20254 mar. 2025

Konference

KonferenceWinter Conference on Applications of Computer Vision Workshops,
Land/OmrådeUSA
ByTucson
Periode28/02/202504/03/2025

Fingeraftryk

Dyk ned i forskningsemnerne om 'YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID'. Sammen danner de et unikt fingeraftryk.

Citationsformater