The AI&M procedure for learning from incomplete data

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

We investigate methods for parameter learning from incomplete data that is
not missing at random. Likelihood-based methods then require the optimization of
a profile likelihood that takes all possible missingness mechanisms into account.
Optimizing this profile likelihood poses two main difficulties: multiple (local) maxima, and its very high-dimensional parameter space. In this paper a new method is presented for optimizing the profile likelihood that addresses the second difficulty: in the proposed AI\&M (adjusting imputation and maximization) procedure the optimization is performed by operations in the space of data completions, rather than
directly in the parameter space of the profile likelihood. We apply the AI\&M method to
learning parameters for Bayesian networks. The method is compared against
conservative inference, which takes into account each possible data completion,
and against EM. The results indicate that likelihood-based inference is still feasible
in the case of unknown missingness mechanisms, and that conservative inference is
unnecessarily weak. On the other hand, our results also provide evidence that the
EM algorithm is still quite effective when the data is not missing at random.
Original languageEnglish
Title of host publicationProceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI-06)
Number of pages8
PublisherAssociation for Uncertainty in Artificial Intelligence
Publication date2006
Pages225-232
ISBN (Print)0974903922
Publication statusPublished - 2006
EventUncertainty in Artificial Intelligence - Cambridge, United States
Duration: 13 Jun 200616 Jun 2006
Conference number: 22

Conference

ConferenceUncertainty in Artificial Intelligence
Number22
Country/TerritoryUnited States
CityCambridge
Period13/06/200616/06/2006

Fingerprint

Dive into the research topics of 'The AI&M procedure for learning from incomplete data'. Together they form a unique fingerprint.

Cite this