Latent Classification Models for Binary Data

Helge Langseth; Thomas Dyhre Nielsen

doi:10.1016/j.patcog.2009.05.002

Latent Classification Models for Binary Data

Helge Langseth, Thomas Dyhre Nielsen

Department of Computer Science

Research output: Contribution to journal › Journal article › Research › peer-review

9 Citations (Scopus)

621 Downloads (Pure)

Abstract

One of the simplest, and yet most consistently well-performing set of classifiers is the naive Bayes models (a special class of Bayesian network models). However, these models rely on the (naive) assumption that all the attributes used to describe an instance are conditionally independent given the class of that instance. To relax this independence assumption, we have in previous work proposed a family of models, called latent classification models (LCMs). LCMs are defined for continuous domains and generalize the naive Bayes model by using latent variables to model class-conditional dependencies between the attributes. In addition to providing good classification accuracy, the LCM model has several appealing properties, including a relatively small parameter space making it less susceptible to over-fitting. In this paper we take a first-step towards generalizing LCMs to hybrid domains, by proposing an LCM model for domains with binary attributes. We present algorithms for learning the proposed model, and we describe a variational
approximation-based inference procedure. Finally, we empirically compare the accuracy of the proposed model to the
accuracy of other classifiers for a number of different domains, including the problem of recognizing
symbols in black and white images.

Original language	English
Journal	Pattern Recognition
Volume	42
Issue number	11
Pages (from-to)	2724-2736
ISSN	0031-3203
DOIs	https://doi.org/10.1016/j.patcog.2009.05.002
Publication status	Published - 2009

Access to Document

10.1016/j.patcog.2009.05.002

Langseth-nielsen-09aSubmitted manuscript, 320 KB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{6f7ede50f81511dd83f3000ea68e967b,

title = "Latent Classification Models for Binary Data",

abstract = "One of the simplest, and yet most consistently well-performing set of classifiers is the naive Bayes models (a special class of Bayesian network models). However, these models rely on the (naive) assumption that all the attributes used to describe an instance are conditionally independent given the class of that instance. To relax this independence assumption, we have in previous work proposed a family of models, called latent classification models (LCMs). LCMs are defined for continuous domains and generalize the naive Bayes model by using latent variables to model class-conditional dependencies between the attributes. In addition to providing good classification accuracy, the LCM model has several appealing properties, including a relatively small parameter space making it less susceptible to over-fitting. In this paper we take a first-step towards generalizing LCMs to hybrid domains, by proposing an LCM model for domains with binary attributes. We present algorithms for learning the proposed model, and we describe a variationalapproximation-based inference procedure. Finally, we empirically compare the accuracy of the proposed model to theaccuracy of other classifiers for a number of different domains, including the problem of recognizingsymbols in black and white images.",

author = "Helge Langseth and Nielsen, {Thomas Dyhre}",

year = "2009",

doi = "10.1016/j.patcog.2009.05.002",

language = "English",

volume = "42",

pages = "2724--2736",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier",

number = "11",

}

TY - JOUR

T1 - Latent Classification Models for Binary Data

AU - Langseth, Helge

AU - Nielsen, Thomas Dyhre

PY - 2009

Y1 - 2009

N2 - One of the simplest, and yet most consistently well-performing set of classifiers is the naive Bayes models (a special class of Bayesian network models). However, these models rely on the (naive) assumption that all the attributes used to describe an instance are conditionally independent given the class of that instance. To relax this independence assumption, we have in previous work proposed a family of models, called latent classification models (LCMs). LCMs are defined for continuous domains and generalize the naive Bayes model by using latent variables to model class-conditional dependencies between the attributes. In addition to providing good classification accuracy, the LCM model has several appealing properties, including a relatively small parameter space making it less susceptible to over-fitting. In this paper we take a first-step towards generalizing LCMs to hybrid domains, by proposing an LCM model for domains with binary attributes. We present algorithms for learning the proposed model, and we describe a variationalapproximation-based inference procedure. Finally, we empirically compare the accuracy of the proposed model to theaccuracy of other classifiers for a number of different domains, including the problem of recognizingsymbols in black and white images.

AB - One of the simplest, and yet most consistently well-performing set of classifiers is the naive Bayes models (a special class of Bayesian network models). However, these models rely on the (naive) assumption that all the attributes used to describe an instance are conditionally independent given the class of that instance. To relax this independence assumption, we have in previous work proposed a family of models, called latent classification models (LCMs). LCMs are defined for continuous domains and generalize the naive Bayes model by using latent variables to model class-conditional dependencies between the attributes. In addition to providing good classification accuracy, the LCM model has several appealing properties, including a relatively small parameter space making it less susceptible to over-fitting. In this paper we take a first-step towards generalizing LCMs to hybrid domains, by proposing an LCM model for domains with binary attributes. We present algorithms for learning the proposed model, and we describe a variationalapproximation-based inference procedure. Finally, we empirically compare the accuracy of the proposed model to theaccuracy of other classifiers for a number of different domains, including the problem of recognizingsymbols in black and white images.

U2 - 10.1016/j.patcog.2009.05.002

DO - 10.1016/j.patcog.2009.05.002

M3 - Journal article

SN - 0031-3203

VL - 42

SP - 2724

EP - 2736

JO - Pattern Recognition

JF - Pattern Recognition

IS - 11

ER -

Latent Classification Models for Binary Data

Abstract

Access to Document

AUB Link

Fingerprint

Cite this