Improving a real-time object detector with compact temporal information

Martin Ahrnbom, Morten Bornø Jensen, Kalle Åström, Mikael Nilsson, Håkan Ardö, Thomas B. Moeslund

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

2 Citations (Scopus)
548 Downloads (Pure)

Abstract

Neural networks designed for real-time object detection have recently improved significantly, but in practice, look- ing at only a single RGB image at the time may not be ideal. For example, when detecting objects in videos, a foreground detection algorithm can be used to obtain compact temporal data, which can be fed into a neural network alongside RGB images. We propose an approach for doing this, based on an existing object detector, that re-uses pretrained weights for the processing of RGB images. The neural network was tested on the VIRAT dataset with annotations for object de- tection, a problem this approach is well suited for. The ac- curacy was found to improve significantly (up to 66%), with a roughly 40% increase in computational time.
Original languageEnglish
Title of host publication2017 IEEE International Conference on Computer Vision Workshop (ICCVW): Computer Vision for Road Scene Understanding and Autonomous Driving workshop
Number of pages8
PublisherIEEE
Publication date2017
Pages190-197
ISBN (Electronic)978-1-5386-1034-3
DOIs
Publication statusPublished - 2017
Event2017 IEEE International Conference on Computer Vision Workshop (ICCVW): Computer Vision for Road Scene Understanding and Autonomous Driving workshop - PALAZZO DEL CINEMA - VENICE CONVENTION CENTER, Venice, Italy
Duration: 22 Oct 201729 Oct 2017

Conference

Conference2017 IEEE International Conference on Computer Vision Workshop (ICCVW)
LocationPALAZZO DEL CINEMA - VENICE CONVENTION CENTER
Country/TerritoryItaly
CityVenice
Period22/10/201729/10/2017
SeriesIEEE International Conference on Computer Vision Workshops (ICCVW)
ISSN2473-9944

Fingerprint

Dive into the research topics of 'Improving a real-time object detector with compact temporal information'. Together they form a unique fingerprint.

Cite this