Identification of risk factors in epidemiologic study based on ROC curve and network
Citations Over Time
Abstract
This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high or low disease incidence, we estimate Receptor Operating Characteristic (ROC) curve of each risk factor. Then, through the difference between ROC curve of each factor and diagonal, we evaluate and screen for the important risk factors. In addition, based on the difference of ROC curves corresponding to any pair of factors, we define a new type of correlation matrix to measure their correlations with disease, and then use this matrix as adjacency matrix to construct a network as a visualization tool for exploring the structure among factors, which can be used to direct further studies. Finally, these methods are applied to analysis on water pollutants and gastrointestinal tumor, and analysis on gene expression data in tumor and normal colon tissue samples.
Related Papers
- → Language Model Pre-training Method in Machine Translation Based on Named Entity Recognition(2020)14 cited
- → Near-exact distributions for the independence and sphericity likelihood ratio test statistics(2009)24 cited
- → Combination of Features for Multilingual Speaker Identification with the Constraint of Limited Data(2013)9 cited
- → Hotelling’s T2 tests in paired and independent survey samples: An efficiency comparison(2016)5 cited
- → Maximizing the Index of Trees with Given Domination Number(2014)1 cited