0 citations0 references

Erratum: “Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers” [Med. Phys. 45 (7), 3449‐3459 (2018)]

Medical Physics2019Vol. 46(2), pp. 1080–1087

Citations Over TimeTop 21% of 2019 papers

Abstract

In our study (Deist et al. 20181) we collected 12 datasets from previous studies on post-radiotherapy outcomes (e.g., survival, toxicity etc.). The dataset outcomes were modelled with 6 classifiers using the other dataset parameters. In 1 out of 12 datasets we incorrectly identified the outcome parameter column. The manuscript needs minor updates in text and updates for all figures for correctness, but the overall conclusion remains completely unchanged. The dataset in question (set E in the published manuscript) concerned non-small cell lung cancer patients and the correct outcome was 2-year overall survival. However, we incorrectly built prediction models using vital status at last follow-up as the outcome. The effects on the manuscript are described below, followed by an erratum list. Additionally, an updated version of the manuscript (with Track Changes) is provided as a supplementary file. The effects on the results are negligible and the conclusion remains completely unchanged. However, minor updates throughout the manuscript are necessary for correctness. The erroneous processing of the dataset (originally set E) has been corrected and the analysis as described in the manuscript has been repeated. For this dataset this means that 500 new models have been generated in the experiment (5 folds * 100 repetitions), resulting in updated performance metrics for this set. In the manuscript we describe individual and aggregate results over all datasets and therefore several numbers in text and tables, and all figures need updating. Additionally, since we order the anonymized sets based on discrimination performance the erroneously processed set (originally set E) is now labelled set I, consequently causing changes in the labels of the other sets as well. With the correct outcome being modelled, the Areas Under the Curve (AUCs) drop for all classifiers for the dataset in question. Random forest (rf) remains the best ranking classifier, and elastic net logistic regression (glmnet) still ranks second. The ranking of the other 4 classifiers is now different for this dataset. In the published manuscript, we reported best classification performance across 12 datasets for rf and glmnet. This conclusion remains unchanged after the corrigendum. No effect on the median ranking of the other classifiers is observed, but the interquartile ranges have changed slightly as illustrated in the updated scatter-boxplot below. Furthermore, since we report results on a large number of datasets the effect of the erroneous processing of one dataset is limited, which is a confirmation of the robustness of the analysis performed and reported in the manuscript. Figure 2. [Color figure can be viewed at wileyonlinelibrary.com] Figure 3. Original [Color figure can be viewed at wileyonlinelibrary.com] Figure 4*. [Color figure can be viewed at wileyonlinelibrary.com] Figure. 5*. [Color figure can be viewed at wileyonlinelibrary.com] *Since we order the anonymized sets based on discrimination performance the erroneously processed set (originally set E) is now labelled set I, consequently causing changes in the labels of the other sets as well. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.

Citations Over TimeTop 21% of 2019 papers

Abstract

Related Papers