YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

We introduce YOLO11-JDE, a fast and accurate multi-
object tracking (MOT) solution that combines real-time ob-
ject detection with self-supervised Re-Identification (Re-
ID). By incorporating a dedicated Re-ID branch into
YOLO11s, our model performs Joint Detection and Embed-
ding (JDE), generating appearance features for each detec-
tion. The Re-ID branch is trained in a fully self-supervised
setting while simultaneously training for detection, elimi-
nating the need for costly identity-labeled datasets. The
triplet loss, with hard positive and semi-hard negative min-
ing strategies, is used for learning discriminative embed-
dings. Data association is enhanced with a custom tracking
implementation that successfully integrates motion, appear-
ance, and location cues. YOLO11-JDE achieves competi-
tive results on MOT17 and MOT20 benchmarks, surpass-
ing existing JDE methods in terms of FPS and using up to
ten times fewer parameters. Thus, making our method a
highly attractive solution for real-world applications. The
code is publicly available at https://github.com/
inakierregueab/YOLO11-JDE.
Original languageEnglish
Title of host publicationWinter Conference on Applications of Computer Vision Workshops
PublisherIEEE (Institute of Electrical and Electronics Engineers)
Publication date2025
Publication statusPublished - 2025
EventWinter Conference on Applications of Computer Vision Workshops, - Tucson , United States
Duration: 28 Feb 20254 Mar 2025

Conference

ConferenceWinter Conference on Applications of Computer Vision Workshops,
Country/TerritoryUnited States
CityTucson
Period28/02/202504/03/2025

Fingerprint

Dive into the research topics of 'YOLO11-JDE: Fast and Accurate Multi-Object Tracking with Self-Supervised Re-ID'. Together they form a unique fingerprint.

Cite this