Contextual-guided Bag-of-visual-words model for Multi-class object categorization

Mehdi Mirza-Mohammadi*, Sergio Escalera, Petia Radeva

*Corresponding author for this work

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

9 Citations (Scopus)

Abstract

Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the object categorization problem, an image is represented by an unsorted set of discrete visual words (BOVW). In these models, relations among visual words are performed after dictionary construction. However, close object regions can have far descriptions in the feature space, being grouped as different visual words. In this paper, we present a method for considering geometrical information of visual words in the dictionary construction step. Object interest regions are obtained by means of the Harris-Affine detector and then described using the SIFT descriptor. Afterward, a contextual-space and a feature-space are defined, and a merging process is used to fuse feature words based on their proximity in the contextual-space. Moreover, we use the Error Correcting Output Codes framework to learn the new dictionary in order to perform multi-class classification. Results show significant classification improvements when spatial information is taken into account in the dictionary construction step.

Original languageEnglish
Title of host publicationComputer Analysis of Images and Patterns - 13th International Conference, CAIP 2009, Proceedings
Number of pages9
Publication date2009
Pages748-756
ISBN (Print)3642037666, 9783642037665
DOIs
Publication statusPublished - 2009
Externally publishedYes
Event13th International Conference on Computer Analysis of Images and Patterns, CAIP 2009 - Munster, Germany
Duration: 2 Sept 20094 Sept 2009

Conference

Conference13th International Conference on Computer Analysis of Images and Patterns, CAIP 2009
Country/TerritoryGermany
CityMunster
Period02/09/200904/09/2009
SponsorUniversity of Münster, International Association for Pattern Recognition, Olympus Soft Imaging Solutions GmbH, Philips
SeriesLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5702 LNCS
ISSN0302-9743

Fingerprint

Dive into the research topics of 'Contextual-guided Bag-of-visual-words model for Multi-class object categorization'. Together they form a unique fingerprint.

Cite this