Effect of method of deduplication on estimation of differential gene expression using RNA-seq
PeerJ2017Vol. 5, pp. e3091–e3091
Citations Over TimeTop 10% of 2017 papers
Anna V. Klepikova, Artem S. Kasianov, Mikhail S. Chesnokov, Н. Л. Лазаревич, Aleksey A. Penin, Maria D. Logacheva
Abstract
The use of unique molecular identifiers greatly improves accuracy of RNA-seq analysis, especially for highly expressed genes. We developed a set of scripts that enable handling of MI and their incorporation into RNA-seq analysis pipelines. Deduplication without MI affects results of differential gene expression analysis, producing a high proportion of false negative results. The absence of duplicate read removal is biased towards false positives. In those cases where using MI is not possible, we recommend using paired-end sequencing layout.
Related Papers
- → Identification of Salt Stress Response Genes in Rosa chinensis Leaves by Comparative RNA-seq Analysis of Transcriptome Dynamics(2018)4 cited
- → Collembola Regulate Biological Processes of a Plant: Evidence from a Rna-Seq-Based Transcriptome Analysis(2023)1 cited
- → Supplementary Table 10 from Characterizing the Impact of Smoking and Lung Cancer on the Airway Transcriptome Using RNA-Seq(2023)
- → Supplementary Table 3 from Characterizing the Impact of Smoking and Lung Cancer on the Airway Transcriptome Using RNA-Seq(2023)
- → Supplementary Table 9 from Characterizing the Impact of Smoking and Lung Cancer on the Airway Transcriptome Using RNA-Seq(2023)