Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » Malaysian Journals
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top Journal Cited
 » Top Article Cited
 » Journal Citation Statistics
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Missing Values in Data Analysis: Ignore or Impute?

Ng, Chong Guan, and Yusoff M.S.B., (2011) Missing Values in Data Analysis: Ignore or Impute? Education in Medicine Journal, 3 (1). pp. 6-11. ISSN 2180-1932

[img]
Preview
PDF (Missing Values in Data Analysis: Ignore or Impute?) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
480Kb

Official URL: http://saifulbahri.com/eimj/2011/06/20/volume-3-issue-1-jan-june-2011/

Affiliations

University of Malaya. Faculty of Medicine
Universiti Sains Malaysia. School of Medical Sciences

Abstract

Objective: Missing values is commonly encountered in data analysis in all types of research. Various methods were introduced to handle this matter. This study aims to compare the result of using complete data analysis, missing indicator method, means substitution and single imputation in dealing with this issue.
Methods: 202 patients who were discharged from the psychiatric ward, University Malaya Medical Centre (UMMC) from 27th August 2007 to 15th April 2008 were recruited. The general psychopathology was measured with Brief Psychiatric Rating Scale (BPRS-24). The information on age, gender, race, marital status and psychiatric diagnosis were collected. On follow up, the patients who had early readmission (<6 months) were identified. A logistic regression model to determine early readmission based on all the variables was made. 10% (n=20) of the highest BPRS scores were deleted to simulate a missing at random (MAR) situation. Four different statistical methods were used to deal with the missing values.
Results: BPRS score was significantly associated with early readmission (p<0.01) in the original complete dataset. The associations based on complete data analysis, missing indicator method and mean substitution were biased and insignificant. Single imputation gave a closest significant estimate of the association (p<0.1).
Conclusion: Ignoring missing values will result in biased estimate in data analysis. Single imputation produced unbiased estimate of association in MAR situation.

Item Type:Journal
Keywords:missing values, imputation, complete data analysis, indicator, mean
Subjects:R Medicine, Dentistry, Pharmacy, Nursing
L Education
B Philosophy. Psychology. Religion, Islam
ID Code:11773

1. Van der Heijdan G.J.M.G., Donders A.G.T., Stijnen T., Moons K.G.M. (2006). Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: A clinical example. J Clin Epidemiol, 59, 1102-1109.

2. Moons K.G.M., Donders R.A.R.T., Stijnen T., Harrell, Jr F.E. (2006). Using the outcome for imputation of missing predictor values was preferred. J Clin Epidemiol, 59, 1092-1101.

3. Donders A.R.T., van der Heijden G.J.M.G., Stijnen T., Moons K.G.M. (2006). Review: a gentle introduction to imputation of missing values. J Clin Epidemiol, 59, 1087-1091.

4. Knol M.J., Janssen K.J.M., Donders A.R.T., Egberts A.C.G., Heerdink E.R., Grobbee D.E., Moons K.G.M., Geerlings M.I. (2010). Unpredictable bias when using the missing indicator method or complete case analysis for missing confounder values: an empirical example. J Clin Epidemiol, article in press.

5. Greenland S., Finkle W.D. (1995). A critical look at methods for handling missing covariates in epidemiologic regression analyses. Am J Epidemiol, 142, 1255-1264.

6. Little R.J. (1992). Regression with missing X’s: a review. J Am Stat Assoc, 87, 1227-1237.

7. Acock A.C. (2005). Working with missing values. J Marriage Fam, 67, 1012-1028.

8. Graham J.W. (2009). Missing data analysis: making it work in the real world. Annu Rev Psychol, 60,549-576.

9. Schaffer J.L., Graham J.W. (2002). Missing data: our view of the state of the art. Psychol Methods, 7, 147-177.

10. Rubin D.B. (1976). Inference and missing data. Biometrika, 63, 581-592.

11. Miettinen O.S. (1983). Regression analysis. In: Theoretical epidemiology: principles of occurrence research in medicine. New York, NY: Academic Press.

12. Cohen J, Cohen P (1983). Applied multiple regression/correlation analysis for the behavioural sciences (2nd ed.). Hillsdale, NJ: Erlbaum.

13. Overall J.E., Gorham D.R. (1962) The Brief Psychiatric Rating Scale (BPRS): A comprehensive review. J Operat Psychiatr, 11,48-65.

14. Overall J.E., Gorham D.R. (1976) The Brief Psychiatric Rating Scale, ECDEU Assessment manual for psychopharmacology, Guy W, ed, Rockville, MD: U. S. Department of Health, Education, and Welfare; 157-69.

15. Overall J.E., Gorham D.R. (1988). The Brief Psychiatric Rating Scale (BPRS): Recent developments in ascertainment and scaling. Psychopharmacol Bull, 24, 97-9.

16. Ventura M.A., Green M.F., Shaner A., Liberman R.P. (1993). Training and quality assurance with the brief psychiatric rating scale: “The drift buster”. Int J Meth Psych Res, 3, 221-244.

17. Laird N.M. (1988). Missing data in longitudinal studies. Stat Med, 7, 305-315.Meyer K, Windeler J. A new suggestion for the classification of missing values in the outcome of clinical trials. Clinin Res Regul Affairs 1998; 15: 17-24.

18. Meyer K., Windeler J. (1998). A new suggestion for the classification of missing values in the outcome of clinical trials. Clinin Res Regul Affairs, 15, 17-24.

19. Clark T.G., Altman D.G. (2003). Developing a prognostic model in the presence of missing data. An ovarian cancer case study. J Clin Epidemiol, 56, 28-37.

20. Jones M.P. (1996). Indicator and stratification methods for missing explanatory variables in multiple linear regression. J Am Stat Assoc, 91, 222-230.

21. Rubin D.B. (1996). Multiple imputation after 18+ years. J Am Stat Assoc, 91, 473-489.

22. Rubin D.B., Schenker N. (1991). Multiple imputation in health-care database: an overview and some applications. Stat Med, 10, 585-598.

Repository Staff Only: item control page