The AI&M procedure for learning from incomplete data

Manfred Jaeger

The AI&M procedure for learning from incomplete data

Institut for Datalogi

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

Abstract

We investigate methods for parameter learning from incomplete data that is
not missing at random. Likelihood-based methods then require the optimization of
a profile likelihood that takes all possible missingness mechanisms into account.
Optimizing this profile likelihood poses two main difficulties: multiple (local) maxima, and its very high-dimensional parameter space. In this paper a new method is presented for optimizing the profile likelihood that addresses the second difficulty: in the proposed AI\&M (adjusting imputation and maximization) procedure the optimization is performed by operations in the space of data completions, rather than
directly in the parameter space of the profile likelihood. We apply the AI\&M method to
learning parameters for Bayesian networks. The method is compared against
conservative inference, which takes into account each possible data completion,
and against EM. The results indicate that likelihood-based inference is still feasible
in the case of unknown missingness mechanisms, and that conservative inference is
unnecessarily weak. On the other hand, our results also provide evidence that the
EM algorithm is still quite effective when the data is not missing at random.

Originalsprog	Engelsk
Titel	Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI-06)
Antal sider	8
Forlag	Association for Uncertainty in Artificial Intelligence
Publikationsdato	2006
Sider	225-232
ISBN (Trykt)	0974903922
Status	Udgivet - 2006
Begivenhed	Uncertainty in Artificial Intelligence - Cambridge, USA Varighed: 13 jun. 2006 → 16 jun. 2006 Konferencens nummer: 22

Konference

Konference	Uncertainty in Artificial Intelligence
Nummer	22
Land/Område	USA
By	Cambridge
Periode	13/06/2006 → 16/06/2006

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

@inproceedings{914a1310a54c11dbb8eb000ea68e967b,

title = "The AI&M procedure for learning from incomplete data",

abstract = "We investigate methods for parameter learning from incomplete data that isnot missing at random. Likelihood-based methods then require the optimization ofa profile likelihood that takes all possible missingness mechanisms into account.Optimizing this profile likelihood poses two main difficulties: multiple (local) maxima, and its very high-dimensional parameter space. In this paper a new method is presented for optimizing the profile likelihood that addresses the second difficulty: in the proposed AI\&M (adjusting imputation and maximization) procedure the optimization is performed by operations in the space of data completions, rather thandirectly in the parameter space of the profile likelihood. We apply the AI\&M method tolearning parameters for Bayesian networks. The method is compared againstconservative inference, which takes into account each possible data completion,and against EM. The results indicate that likelihood-based inference is still feasiblein the case of unknown missingness mechanisms, and that conservative inference isunnecessarily weak. On the other hand, our results also provide evidence that theEM algorithm is still quite effective when the data is not missing at random.",

author = "Manfred Jaeger",

year = "2006",

language = "English",

isbn = "0974903922",

pages = "225--232",

booktitle = "Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI-06)",

publisher = "Association for Uncertainty in Artificial Intelligence",

note = "Uncertainty in Artificial Intelligence ; Conference date: 13-06-2006 Through 16-06-2006",

}

TY - GEN

T1 - The AI&M procedure for learning from incomplete data

AU - Jaeger, Manfred

N1 - Conference code: 22

PY - 2006

Y1 - 2006

N2 - We investigate methods for parameter learning from incomplete data that isnot missing at random. Likelihood-based methods then require the optimization ofa profile likelihood that takes all possible missingness mechanisms into account.Optimizing this profile likelihood poses two main difficulties: multiple (local) maxima, and its very high-dimensional parameter space. In this paper a new method is presented for optimizing the profile likelihood that addresses the second difficulty: in the proposed AI\&M (adjusting imputation and maximization) procedure the optimization is performed by operations in the space of data completions, rather thandirectly in the parameter space of the profile likelihood. We apply the AI\&M method tolearning parameters for Bayesian networks. The method is compared againstconservative inference, which takes into account each possible data completion,and against EM. The results indicate that likelihood-based inference is still feasiblein the case of unknown missingness mechanisms, and that conservative inference isunnecessarily weak. On the other hand, our results also provide evidence that theEM algorithm is still quite effective when the data is not missing at random.

AB - We investigate methods for parameter learning from incomplete data that isnot missing at random. Likelihood-based methods then require the optimization ofa profile likelihood that takes all possible missingness mechanisms into account.Optimizing this profile likelihood poses two main difficulties: multiple (local) maxima, and its very high-dimensional parameter space. In this paper a new method is presented for optimizing the profile likelihood that addresses the second difficulty: in the proposed AI\&M (adjusting imputation and maximization) procedure the optimization is performed by operations in the space of data completions, rather thandirectly in the parameter space of the profile likelihood. We apply the AI\&M method tolearning parameters for Bayesian networks. The method is compared againstconservative inference, which takes into account each possible data completion,and against EM. The results indicate that likelihood-based inference is still feasiblein the case of unknown missingness mechanisms, and that conservative inference isunnecessarily weak. On the other hand, our results also provide evidence that theEM algorithm is still quite effective when the data is not missing at random.

M3 - Article in proceeding

SN - 0974903922

SP - 225

EP - 232

BT - Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence (UAI-06)

PB - Association for Uncertainty in Artificial Intelligence

T2 - Uncertainty in Artificial Intelligence

Y2 - 13 June 2006 through 16 June 2006

ER -

The AI&M procedure for learning from incomplete data

Abstract

Konference

AUB Link

Fingeraftryk

Citationsformater