The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

Chris Holmberg Bahnsen; Andreas Møgelmose; Thomas B. Moeslund

The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

Chris Holmberg Bahnsen, Andreas Møgelmose, Thomas B. Moeslund

Research output: Working paper/Preprint › Working paper › Research

216 Downloads (Pure)

Abstract

This tech report gives an introduction to two annotation toolboxes that enable the creation of pixel and polygon-based masks as well as bounding boxes around objects of interest. Both toolboxes support the annotation of sequential images in the RGB and thermal modalities. Each annotated object is assigned a classification tag, a unique ID, and one or more optional meta data tags. The toolboxes are written in C++ with the OpenCV and Qt libraries and are operated by using the visual interface and the extensive range of keyboard shortcuts. Pre-built binaries are available for Windows and MacOS and the tools can be built from source under Linux as well. So far, tens of thousands of frames have been annotated using the toolboxes.

Original language	English
Publisher	arXiv
Publication status	Published - 11 Sept 2018

Bibliographical note

Bahnsen, C. H., Møgelmose, A., & Moeslund, T. B. (2018). The AAU Multimodal Annotation Toolboxes:
Annotating Objects in Images and Videos. arXiv.org

Access to Document

Bahnsen_AAU_Annotation_ToolboxesSubmitted manuscript, 1 MBLicence: CC BY 4.0

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@techreport{a8d047eec1d548ccaa3d979b9b0a1730,

title = "The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos",

abstract = "This tech report gives an introduction to two annotation toolboxes that enable the creation of pixel and polygon-based masks as well as bounding boxes around objects of interest. Both toolboxes support the annotation of sequential images in the RGB and thermal modalities. Each annotated object is assigned a classification tag, a unique ID, and one or more optional meta data tags. The toolboxes are written in C++ with the OpenCV and Qt libraries and are operated by using the visual interface and the extensive range of keyboard shortcuts. Pre-built binaries are available for Windows and MacOS and the tools can be built from source under Linux as well. So far, tens of thousands of frames have been annotated using the toolboxes.",

author = "Bahnsen, {Chris Holmberg} and Andreas M{\o}gelmose and Moeslund, {Thomas B.}",

note = "Bahnsen, C. H., M{\o}gelmose, A., & Moeslund, T. B. (2018). The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos. arXiv.org",

year = "2018",

month = sep,

day = "11",

language = "English",

publisher = "arXiv",

type = "WorkingPaper",

institution = "arXiv",

}

TY - UNPB

T1 - The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

AU - Bahnsen, Chris Holmberg

AU - Møgelmose, Andreas

AU - Moeslund, Thomas B.

N1 - Bahnsen, C. H., Møgelmose, A., & Moeslund, T. B. (2018). The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos. arXiv.org

PY - 2018/9/11

Y1 - 2018/9/11

N2 - This tech report gives an introduction to two annotation toolboxes that enable the creation of pixel and polygon-based masks as well as bounding boxes around objects of interest. Both toolboxes support the annotation of sequential images in the RGB and thermal modalities. Each annotated object is assigned a classification tag, a unique ID, and one or more optional meta data tags. The toolboxes are written in C++ with the OpenCV and Qt libraries and are operated by using the visual interface and the extensive range of keyboard shortcuts. Pre-built binaries are available for Windows and MacOS and the tools can be built from source under Linux as well. So far, tens of thousands of frames have been annotated using the toolboxes.

AB - This tech report gives an introduction to two annotation toolboxes that enable the creation of pixel and polygon-based masks as well as bounding boxes around objects of interest. Both toolboxes support the annotation of sequential images in the RGB and thermal modalities. Each annotated object is assigned a classification tag, a unique ID, and one or more optional meta data tags. The toolboxes are written in C++ with the OpenCV and Qt libraries and are operated by using the visual interface and the extensive range of keyboard shortcuts. Pre-built binaries are available for Windows and MacOS and the tools can be built from source under Linux as well. So far, tens of thousands of frames have been annotated using the toolboxes.

M3 - Working paper

BT - The AAU Multimodal Annotation Toolboxes: Annotating Objects in Images and Videos

PB - arXiv

ER -