The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

Chris Holmberg Bahnsen, Andreas Møgelmose, Thomas B. Moeslund

Research output: Working paper/PreprintWorking paperResearch

216 Downloads (Pure)

Abstract

This tech report gives an introduction to two annotation toolboxes that enable the creation of pixel and polygon-based masks as well as bounding boxes around objects of interest. Both toolboxes support the annotation of sequential images in the RGB and thermal modalities. Each annotated object is assigned a classification tag, a unique ID, and one or more optional meta data tags. The toolboxes are written in C++ with the OpenCV and Qt libraries and are operated by using the visual interface and the extensive range of keyboard shortcuts. Pre-built binaries are available for Windows and MacOS and the tools can be built from source under Linux as well. So far, tens of thousands of frames have been annotated using the toolboxes.
Original languageEnglish
PublisherarXiv
Publication statusPublished - 11 Sept 2018

Bibliographical note

Bahnsen, C. H., Møgelmose, A., & Moeslund, T. B. (2018). The AAU Multimodal Annotation Toolboxes:
Annotating Objects in Images and Videos. arXiv.org

Cite this