Abstract
We introduce YOLO11-JDE, a fast and accurate multi-
object tracking (MOT) solution that combines real-time ob-
ject detection with self-supervised Re-Identification (Re-
ID). By incorporating a dedicated Re-ID branch into
YOLO11s, our model performs Joint Detection and Embed-
ding (JDE), generating appearance features for each detec-
tion. The Re-ID branch is trained in a fully self-supervised
setting while simultaneously training for detection, elimi-
nating the need for costly identity-labeled datasets. The
triplet loss, with hard positive and semi-hard negative min-
ing strategies, is used for learning discriminative embed-
dings. Data association is enhanced with a custom tracking
implementation that successfully integrates motion, appear-
ance, and location cues. YOLO11-JDE achieves competi-
tive results on MOT17 and MOT20 benchmarks, surpass-
ing existing JDE methods in terms of FPS and using up to
ten times fewer parameters. Thus, making our method a
highly attractive solution for real-world applications. The
code is publicly available at https://github.com/
inakierregueab/YOLO11-JDE.
object tracking (MOT) solution that combines real-time ob-
ject detection with self-supervised Re-Identification (Re-
ID). By incorporating a dedicated Re-ID branch into
YOLO11s, our model performs Joint Detection and Embed-
ding (JDE), generating appearance features for each detec-
tion. The Re-ID branch is trained in a fully self-supervised
setting while simultaneously training for detection, elimi-
nating the need for costly identity-labeled datasets. The
triplet loss, with hard positive and semi-hard negative min-
ing strategies, is used for learning discriminative embed-
dings. Data association is enhanced with a custom tracking
implementation that successfully integrates motion, appear-
ance, and location cues. YOLO11-JDE achieves competi-
tive results on MOT17 and MOT20 benchmarks, surpass-
ing existing JDE methods in terms of FPS and using up to
ten times fewer parameters. Thus, making our method a
highly attractive solution for real-world applications. The
code is publicly available at https://github.com/
inakierregueab/YOLO11-JDE.
Originalsprog | Engelsk |
---|---|
Titel | Winter Conference on Applications of Computer Vision Workshops |
Forlag | IEEE (Institute of Electrical and Electronics Engineers) |
Publikationsdato | 2025 |
Status | Udgivet - 2025 |
Begivenhed | Winter Conference on Applications of Computer Vision Workshops, - Tucson , USA Varighed: 28 feb. 2025 → 4 mar. 2025 |
Konference
Konference | Winter Conference on Applications of Computer Vision Workshops, |
---|---|
Land/Område | USA |
By | Tucson |
Periode | 28/02/2025 → 04/03/2025 |