Flash News
Welcome to IARC

Publisher Login

Latest News
Welcome to IARC- JCR Report

Submit your Journal to get IARC-JCRR Indexing and Impact Factor

Impact Factor calculated by IARC on the basis of Journal Citation Reference (JCR) Report.


Contact: iarcdpi@gmail.com


A Study on Missing Data Management
Mitra M., Samanta R. K.

Published in: International Journal of Computer Sciences and Engineering
Volume- 5, Issue-2, pp.30-33, Feb 2017
DPI :-> 16.10022.IJCSE.2017.V5I2.3033.1246

Missing data, a persistent problem in most scientific research, should be handled very carefully, as role of data are vital in every analysis. Mishandling missing values may cause distorted analysis or may generate biased results. Valid and reliable models require good data preparation. Dozens of techniques have been proposed by methodologists to address the problem. Appropriate method should be taken into consideration for a particular study in order to achieve efficient and valid analysis. In this study we discuss different methods to handle missing data and compare three imputation methods: Arithmetic Mean Imputation, Regression Imputation and Multiple Imputation using EMB algorithm, performed on three data sets from UCI repository under the assumption of MAR based on Root Mean Square Error (RMSE) as an evaluation criteria.

Key-Words / Index Term
UCI database, Missing At Random (MAR), Missing Completely At Random (MCAR), Missing Not At Random (MNAR), Multiple Imputation, Expectation Maximization with Bootstrap approach (EMB), Root Mean Square Error (RMSE)

How to cite this article
Mitra M., Samanta R. K. , “A Study on Missing Data Management ”, International Journal of Computer Sciences and Engineering, 5, Issue-2, pp.30-33, Feb 2017. DPI:16.10022.IJCSE.V5.I2.1246