Depth Value Pre-Processing for Accurate Transfer Learning Based RGB-D Object Recognition

Andreas Aakerberg, Kamal Nasrollahi, Christoffer Bøgelund Rasmussen, Thomas B. Moeslund

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

12 Citations (Scopus)
954 Downloads (Pure)

Abstract

Object recognition is one of the important tasks in computer vision which has found enormous applications.Depth modality is proven to provide supplementary information to the common RGB modality for objectrecognition. In this paper, we propose methods to improve the recognition performance of an existing deeplearning based RGB-D object recognition model, namely the FusionNet proposed by Eitel et al. First, we showthat encoding the depth values as colorized surface normals is beneficial, when the model is initialized withweights learned from training on ImageNet data. Additionally, we show that the RGB stream of the FusionNetmodel can benefit from using deeper network architectures, namely the 16-layered VGGNet, in exchange forthe 8-layered CaffeNet. In combination, these changes improves the recognition performance with 2.2% incomparison to the original FusionNet, when evaluating on the Washington RGB-D Object Dataset.
Original languageEnglish
Title of host publicationInternational Joint Conference on Computational Intelligence
PublisherSCITEPRESS Digital Library
Publication date2017
Pages121-128
ISBN (Print)978-989-758-274-5
DOIs
Publication statusPublished - 2017
EventInternational Joint Conference on Computational Intelligence - Funchal, Portugal
Duration: 1 Nov 20173 Nov 2017
Conference number: 9
http://www.ijcci.org/

Conference

ConferenceInternational Joint Conference on Computational Intelligence
Number9
Country/TerritoryPortugal
CityFunchal
Period01/11/201703/11/2017
Internet address

Keywords

  • Deep Learning
  • Surface Normals
  • Computer Vision
  • Artificial Vision
  • RGB-D
  • Convolutional Neural Networks
  • TransferLearning

Fingerprint

Dive into the research topics of 'Depth Value Pre-Processing for Accurate Transfer Learning Based RGB-D Object Recognition'. Together they form a unique fingerprint.

Cite this