EPEM: Efficient Parameter Estimation for Multiple Class Monotone Missing Data

Nguyen, Thu; Nguyen, Duy H. M.; Nguyen, Huy; Nguyen, Binh T.; Wade, Bruce A.

Computer Science > Machine Learning

arXiv:2009.11360 (cs)

[Submitted on 23 Sep 2020]

Title:EPEM: Efficient Parameter Estimation for Multiple Class Monotone Missing Data

Authors:Thu Nguyen, Duy H. M. Nguyen, Huy Nguyen, Binh T. Nguyen, Bruce A. Wade

View PDF

Abstract:The problem of monotone missing data has been broadly studied during the last two decades and has many applications in different fields such as bioinformatics or statistics. Commonly used imputation techniques require multiple iterations through the data before yielding convergence. Moreover, those approaches may introduce extra noises and biases to the subsequent modeling. In this work, we derive exact formulas and propose a novel algorithm to compute the maximum likelihood estimators (MLEs) of a multiple class, monotone missing dataset when all the covariance matrices of all categories are assumed to be equal, namely EPEM. We then illustrate an application of our proposed methods in Linear Discriminant Analysis (LDA). As the computation is exact, our EPEM algorithm does not require multiple iterations through the data as other imputation approaches, thus promising to handle much less time-consuming than other methods. This effectiveness was validated by empirical results when EPEM reduced the error rates significantly and required a short computation time compared to several imputation-based approaches. We also release all codes and data of our experiments in one GitHub repository to contribute to the research community related to this problem.

Comments:	version 1
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2009.11360 [cs.LG]
	(or arXiv:2009.11360v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2009.11360

Submission history

From: Duy Minh Ho Nguyen [view email]
[v1] Wed, 23 Sep 2020 20:07:53 UTC (468 KB)

Computer Science > Machine Learning

Title:EPEM: Efficient Parameter Estimation for Multiple Class Monotone Missing Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:EPEM: Efficient Parameter Estimation for Multiple Class Monotone Missing Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators