Automated COVID-19 diagnosis and prognosis with medical imaging and who is publishing: a systematic review
| Field | Value | Language |
| dc.contributor.author | Gillman, A.G. | en |
| dc.contributor.author | Lunardo, F. | en |
| dc.contributor.author | Prinable, J. | en |
| dc.contributor.author | Belous, G. | en |
| dc.contributor.author | Nicolson, A. | en |
| dc.contributor.author | Min, H. | en |
| dc.contributor.author | Terhorst, A. | en |
| dc.contributor.author | Dowling, J.A. | en |
| dc.date.accessioned | 2022-07-04T00:46:10Z | |
| dc.date.available | 2022-07-04T00:46:10Z | |
| dc.date.issued | 2022 | |
| dc.identifier.uri | https://hdl.handle.net/2123/29119 | |
| dc.description.abstract | Objectives: To conduct a systematic survey of published techniques for automated diagnosis and prognosis of COVID-19 diseases using medical imaging, assessing the validity of reported performance and investigating the proposed clinical use-case. To conduct a scoping review into the authors publishing such work. Methods: The Scopus database was queried and studies were screened for article type, and minimum source normalized impact per paper and citations, before manual relevance assessment and a bias assessment derived from a subset of the Checklist for Artificial Intelligence in Medical Imaging (CLAIM). The number of failures of the full CLAIM was adopted as a surrogate for risk-of-bias. Methodological and performance measurements were collected from each technique. Each study was assessed by one author. Comparisons were evaluated for significance with a two-sided independent t-test. Findings: Of 1002 studies identified, 390 remained after screening and 81 after relevance and bias exclusion. The ratio of exclusion for bias was 71%, indicative of a high level of bias in the field. The mean number of CLAIM failures per study was 8.3 ± 3.9 [1,17] (mean ± standard deviation [min,max]). 58% of methods performed diagnosis versus 31% prognosis. Of the diagnostic methods, 38% differentiated COVID-19 from healthy controls. For diagnostic techniques, area under the receiver operating curve (AUC) = 0.924 ± 0.074 [0.810,0.991] and accuracy = 91.7% ± 6.4 [79.0,99.0]. For prognostic techniques, AUC = 0.836 ± 0.126 [0.605,0.980] and accuracy = 78.4% ± 9.4 [62.5,98.0]. CLAIM failures did not correlate with performance, providing confidence that the highest results were not driven by biased papers. Deep learning techniques reported higher AUC (p < 0.05) and accuracy (p < 0.05), but no difference in CLAIM failures was identified. Interpretation: A majority of papers focus on the less clinically impactful diagnosis task, contrasted with prognosis, with a significant portion performing a clinically unnecessary task of differentiating COVID-19 from healthy. Authors should consider the clinical scenario in which their work would be deployed when developing techniques. Nevertheless, studies report superb performance in a potentially impactful application. Future work is warranted in translating techniques into clinical tools. | en |
| dc.language.iso | en | en |
| dc.rights | Other | |
| dc.subject | COVID-19 | en |
| dc.subject | Coronavirus | en |
| dc.title | Automated COVID-19 diagnosis and prognosis with medical imaging and who is publishing: a systematic review | en |
| dc.type | Article | en |
| dc.identifier.doi | 10.1007/s13246-021-01093-0 | |
| dc.relation.other | Commonwealth Scientific and Industrial Research Organisation, CSIRO | en |
| usyd.faculty | Faculty of Medicine and Health | en |
Associated file/s
There are no files associated with this item.
Associated collections