Missing value estimation methods for DNA microarrays
Citations Over TimeTop 1% of 2001 papers
Abstract
We present a comparative study of several methods for the estimation of missing values in gene microarray data. We implemented and evaluated three methods: a Singular Value Decomposition (SVD) based method (SVDimpute), weighted K-nearest neighbors (KNNimpute), and row average. We evaluated the methods using a variety of parameter settings and over different real data sets, and assessed the robustness of the imputation methods to the amount of missing data over the range of 1--20% missing values. We show that KNNimpute appears to provide a more robust and sensitive method for missing value estimation than SVDimpute, and both SVDimpute and KNNimpute surpass the commonly used row average method (as well as filling missing values with zeros). We report results of the comparative experiments and provide recommendations and tools for accurate estimation of missing microarray data under a variety of conditions.
Related Papers
- → A reinforcement learning-based approach for imputing missing data(2022)29 cited
- → Guided Multiple Imputation of Missing Data(2007)52 cited
- → Missing Value Imputation Using a Semi-supervised Rank Aggregation Approach(2008)12 cited
- → Missing Values Imputation Based on Iterative Learning(2013)4 cited
- Application of SOLAS to the Multiple Imputation for Missing Data(2003)