Variance estimation by multivariate imputation methods in complex survey designs

Kim, J.-M.; Lee, K.-J.; Kim, W.

doi:10.3233/MAS-170394

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Variance estimation by multivariate imputation methods in complex survey designs

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, J.-M.	-
dc.contributor.author	Lee, K.-J.	-
dc.contributor.author	Kim, W.	-
dc.date.available	2019-03-08T11:38:04Z	-
dc.date.issued	2017	-
dc.identifier.issn	1574-1699	-
dc.identifier.uri	https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/6083	-
dc.description.abstract	In this paper, we consider variance estimation of the sample mean when the missing data have been imputed with multivariate imputation methods. Modern multivariate imputation methods to missing data are complicated and computationally expensive. These multivariate imputation methods do not require the normality assumption to impute the missing values. Under this assumption free condition, we compare the performance of variance estimation of six modern multivariate imputation methods including copula imputation, random forest imputation, principal component analysis imputation, and k-nearest neighbors imputation methods in complex sampling designs such as stratified sampling, cluster sampling and resampling approach to variance estimation by jackknife and bootstrap methods in stratified sampling. We conducted simulation studies using National Health and Nutrition Survey data considering 5% and 15% missing completely at random (MCAR) rates. Based on our 500 times resampling simulation study of the mean squares errors of the sample mean in complex survey designs, the percent relative efficiency (RE(%)) of the random forest (RF) imputation method appears to outperform other imputation methods overall when the data has high skewness at the 5% missing rate and when the data has high excessive kurtosis at the 15% missing rate whereas the principal component analysis (PCA) imputation method appears to outperform other imputation methods when the data has high skewness at the 5% and 15% missing rates. Especially, the RE(%) of the multivariate imputation methods appears to be efficient in the cluster sampling design when the data has high skewness or excessive kurtosis at the 15% missing rate. © 2017 IOS Press and the authors.	-
dc.format.extent	13	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IOS Press	-
dc.title	Variance estimation by multivariate imputation methods in complex survey designs	-
dc.type	Article	-
dc.identifier.doi	10.3233/MAS-170394	-
dc.identifier.bibliographicCitation	Model Assisted Statistics and Applications, v.12, no.3, pp 195 - 207	-
dc.description.isOpenAccess	N	-
dc.identifier.scopusid	2-s2.0-85029451777	-
dc.citation.endPage	207	-
dc.citation.number	3	-
dc.citation.startPage	195	-
dc.citation.title	Model Assisted Statistics and Applications	-
dc.citation.volume	12	-
dc.type.docType	Article	-
dc.publisher.location	네델란드	-
dc.subject.keywordAuthor	bootstrap	-
dc.subject.keywordAuthor	copula imputation	-
dc.subject.keywordAuthor	jackknife	-
dc.subject.keywordAuthor	Missing at random (MAR)	-
dc.description.journalRegisteredClass	scopus	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Business & Economics > Department of Applied Statistics > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Won Kuk photo

Kim, Won Kuk: 경영경제대학 (응용통계학과)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,565,783; Today View :13,054

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE