key: cord-0766395-ag3cdtcq authors: Davey, M G; Davey, M S; Ryan, É J; Boland, M R; McAnena, P F; Lowery, A J; Kerin, M J title: Is radiomic MRI a feasible alternative to OncotypeDX(®) recurrence score testing? A systematic review and meta-analysis date: 2021-10-11 journal: BJS Open DOI: 10.1093/bjsopen/zrab081 sha: 911b4f63726a0f42b005f196da90cbc33a16fb58 doc_id: 766395 cord_uid: ag3cdtcq BACKGROUND: OncotypeDX(®) recurrence score (RS) aids therapeutic decision-making in oestrogen-receptor-positive (ER+) breast cancer. Radiomics is an evolving field that aims to examine the relationship between radiological features and the underlying genomic landscape of disease processes. The aim of this study was to perform a systematic review of current evidence evaluating the comparability of radiomics and RS. METHODS: A systematic review was performed as per PRISMA guidelines. Studies comparing radiomic MRI tumour analyses and RS were identified. Sensitivity, specificity and area under curve (AUC) delineating low risk (RS less than 18) versus intermediate–high risk (equal to or greater than 18) and low–intermediate risk (RS less than 30) and high risk (RS greater than 30) were recorded. Log rate ratios (lnRR) and standard error were determined from AUC and 95 per cent confidence intervals. RESULTS: Nine studies including 1216 patients met inclusion criteria; the mean age at diagnosis was 52.9 years. Mean RS was 16 (range 0–75); 401 patients with RS less than 18, 287 patients with RS 18–30 and 100 patients with RS greater than 30. Radiomic analysis and RS were comparable for differentiating RS less than 18 versus RS 18 or greater (RR 0.93 (95 per cent c.i. 0.85 to 1.01); P = 0.010, heterogeneity (I(2))=0%) as well as RS less than 30 versus RS 30 or greater (RR 0.76 (95 per cent c.i. 0.70 to 0.83); P < 0.001, I(2)=0%). MRI sensitivity and specificity for RS less than 18 versus 18 or greater was 0.89 (95 per cent c.i. 0.85 to 0.93) and 0.72 (95 per cent c.i. 0.66 to 0.78) respectively, and 0.79 (95 per cent c.i. 0.72 to 0.86) and 0.74 (95 per cent c.i. 0.68 to 0.80) for RS less than 30 versus 30 or greater. CONCLUSION: Radiomic tumour analysis is comparable to RS in differentiating patients into clinically relevant subgroups. For patients requiring MRI, radiomics may complement and enhance RS for prognostication and therapeutic decision making in ER+ breast cancer. Breast cancer is a ubiquitous disease responsible for 11.6 per cent of new cancer diagnoses and is the second most common cause of cancer-related mortality, following lung carcinoma 1 . Historically, breast carcinoma was considered a homogeneous entity, with large-scale surgical resection and conventional cytotoxic chemotherapy prescription the cornerstone of controlling disease recurrence 2 . In the wake of the molecular era, heterogeneous biological properties have facilitated substratification of the disease into several distinct subtypes, with various multimodal treatment strategies being indicated, depending on the molecular and genetic processes driving tumorigenesis 3 . Although whole-genome sequencing is required to evaluate comprehensively intrinsic biological tumour characteristics, routine immunohistochemical appraisal of oestrogen receptor (ER), progesterone receptor (PgR) and human epidermal growth factor receptor-2 (HER2/neu) has been rapidly incorporated into clinical practice, and reliably separates breast cancer into four molecularly diverse subtypes 3 . Luminal molecular subtypes, often perceived to carry favourable prognoses and to be indolent in nature, are best treated with adjuvant endocrine hormonal therapy, and judicious systemic chemotherapy prescription 4, 5 . In the era of personalized medicine, multiomic analyses are now the norm for oncological patient management 6, 7 . Multigene molecular panels, such as the OncotypeDX V R (Genomic Health Inc. V R , Redwood City, California, US) 21-gene recurrence score (RS), have clearly segregated tumour types into those likely to benefit from systemic chemotherapy and those that would not 8 . However, despite validation and incorporation into European Society of Medical Oncology V R , National Institute of Clinical Excellence V R , American Society of Clinical Oncology V R and National Comprehensive Cancer Network V R guidelines [9] [10] [11] [12] , the assay carries several disadvantages, such as its cost and lengthy turnover time 13, 14 . Radiomics is an evolving field that aims to examine the relationship between radiological features and the underlying genomic landscape of disease processes. This involves a radiologist acquiring imaging and performing segmentation, before quantitative analyses of medical imaging are performed using artificial intelligence software, with the aim of enhancing existing data available to clinicians through non-intuitive mathematical evaluation of disease properties, many of which are imperceptible to the human eye 15, 16 . In recent times, alternatives to RS testing have been proposed, such as the application of radiomic tumour analysis into clinical practice to aid therapeutic decision making and prognostication in certain cases. A recent hypothesis suggests that radiomics tumour analysis may have a role in augmenting multiomic cancer care, and studies evaluating the value of radiomics in comparing genomic and radiomic profiles of cancers may shed some light on to this proposition. Accordingly, the aim of the present systematic review and meta-analysis was to determine whether radiomic evaluation offers a feasible alternative to RS testing in substratifying patients diagnosed with earlystage, oestrogen-receptor-positive (ERþ) breast cancer. This systematic review was performed in accordance to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines 17 and the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy 18 . As this is a systematic review of current literature, local institutional ethical approval was not required. population was patients diagnosed with ERþ/human epidermal growth factor receptor-2 negative (HER2-) breast cancer with 0 to 3 metastatic lymph nodes who had undergone preoperative MRI, surgical resection, and subsequent RS testing on their resected breast cancer specimen. Radiomic analyses were carried out using preoperative MRI of tumour tissue, and the discriminative ability of radiomic software was compared with the current standard RS testing for prognostication and therapeutic decision making in early ERþ breast cancer. Primary outcomes included the evaluation of the clinical utility of breast MRI to stratify cancers into RS categories for the prediction of the likelihood of recurrence (indicated through recurrence score). Sensitivity, specificity and area under the curve (AUC) scores from receiver operating characteristic (ROC) curve analyses were also used to establish the reliability of MRI in determining RS. Secondary outcomes included correlating RS testing and MRI findings to determine their predictive value in delineating clinically actionable versus non-actionable RS groups (i.e., those indicated for combined adjuvant chemoendocrine therapy (actionable) versus adjuvant endocrine as monotherapy (non-actionable)). An electronic search of the PubMed Medline, EMBASE and Scopus databases for relevant studies was performed on 30 November 2020. The search was performed for the following headings: (Breast Cancer) AND (Radiomics) AND (Biomarker). Included studies were limited to manuscripts published in the English language and studies were not restricted based on their year of publication. All duplicate studies were removed before titles were screened, and studies deemed appropriate had their abstracts and full texts reviewed. This search was performed by two independent reviewers. Studies deemed appropriate for inclusion were those applying radiomic artificial intelligence computational algorithms and methods (machine-learning techniques, conventional neural networks and deep-learning techniques) in order to predict RS group from preoperative imaging. Studies relying upon subjective judgement of independent radiologists were excluded. Studies meeting the following inclusion criteria were included: studies with patients with histologically confirmed ERþ/HER2breast cancer with 0 to 3 positive metastatic lymph nodes; studies in which RS testing had been performed on resected tumour specimens; studies in which breast MRI was performed before operation; studies detailing ROC curve analyses assessing the reliability of radiomic testing in differentiating RS groups (for example, RS 18 or less versus RS greater than 18, RS 29 or less versus RS greater than 30, etc.) or clinicopathological data with RS. Studies meeting any of the following exclusion criteria were excluded from this study: studies detailing RS testing and other imaging modalities (for example, CT, ultrasound sonography imaging, mammographic imaging, etc.); review articles; studies including less than five patients or case reports; editorial articles; and conference abstracts. The literature search was performed by two independent reviewers (M.G.D. and M.S.D) using the previously discussed predesigned search strategy. Duplicate studies were removed. Retrieved manuscripts were reviewed independently by both reviewers to ensure all inclusion criteria were met, before extracting the following data: first author name, year of publication, study design, country, level of evidence, study title, number of patients, RS data and AUC scores from ROC analyses. Data specific to AUC (expressed as rate ratios (RR), 95 per cent confidence intervals and P values) were directly extracted from tables and study text. Assessment of the quality of included radiomic studies was performed using the radiomics quality score, as outlined by Lambin and colleagues 19 . Where discrepancies in opinion occurred between the reviewers, a third reviewer was asked to arbitrate (M.R.B). Data pertaining to RS testing and clinicopathological data were presented as proportions using descriptive statistics. Statistical analysis was performed according to the Cochrane guidelines 18 . Study-specific estimates of sensitivity and specificity were calculated from study data. Summary ROC analysis was used to illustrate the relationship between sensitivity and specificity of MRI radiomic analysis and RS testing, and to convey the diagnostic test performance of MRI and RS testing. In cases where sensitivity and specificity were not outlined by authors, estimated diagnostic test sensitivity and specificity were extracted from ROC analyses with most accurate sensitivity prioritized. AUC were expressed as RR and each corresponding confidence interval was retrieved directly for use in this meta-analysis, as described by Kester and colleagues 20 . Either fixed or random effects models were applied on the basis of whether significant heterogeneity (I 2 > 50%) existed between studies included in any particular analysis. Symmetry of funnel plots were used to assess publication bias. Statistical heterogeneity was determined using I 2 statistics. Statistical significance was determined to be P < 0.050. Statistical analysis was performed using Review Manager (RevMan), version 5.4 (Nordic Cochrane Centre, Copenhagen, Denmark). The initial electronic search resulted in a total of 740 studies. Following removal of the 55 identified duplicate studies, the remaining 685 titles were screened for relevance, of which 40 had their abstracts or full texts assessed for eligibility. Overall, nine clinical studies were included in this analysis, as depicted in Fig. 1 [21] [22] [23] [24] [25] [26] [27] [28] [29] . Eight of these studies were from the US, while one study reported results from Korean patients 25 . Of the included studies, six used radiomic artificial intelligence, two used machine-learning techniques 23, 26 , and a single study used conventional neural networking to determine RS groups 22 . Individual studies included in this analysis are outlined in Table 1 . Clinical characteristics and OncotypeDX V R recurrence score Overall, 1216 patients were included in this study. The mean age at diagnosis was 52.9 (range 27-83) years. Four studies reported on menopausal status at the time of diagnosis; 50.8 per cent were postmenopausal (452 of 889 patients). Of 788 patients, the mean RS was 16 (range 0-75), with 401 patients (50.9 per cent) with RS less than 18, 287 (36.4 per cent) with RS 18-30, and 100 (12.7 per cent) with RS greater than 30 ( Table 1) . RS was associated with histopathological tumour subtype, tumour grade, ER, PgR and HER2 expression (all P < 0.001 (v 2 test)). In this analysis, 78.4 per cent of patients had node-negative disease (N0) and nodal status was not associated with RS (P ¼ 0.115 (v 2 test)), as illustrated in supplementary material online, Table S1 . Radiomic analysis of MRI images of resected tumour tissue was comparable for differentiating cancers with RS less than 18 and 18 or greater (RR 0.93 (95 per cent c.i. 0.85 to 1.01); P ¼ 0.010, I 2 ¼0%), although less comprehensive in providing disparity between RS less than 30 and 30 or greater (RR 0.76 (95 per cent c.i. 0.70 to 0.83); P < 0.001, I 2 ¼0%) (Fig. 2) . Fig. 3c . No single radiomic biomarker that correlated directly with RS could be ascertained from more than one study. Contemporary oncological practice recognizes that each tumour has an idiosyncratic gene expression profile, enabling differentiation of cancers into diverse biological and molecular groups and profiles, even within the same molecular subtypes 30 . This provides a unique challenge and opportunity to the clinician, as the correct classification facilitates personalized medicine, with the identification of patients who respond to particular therapeutic strategies based on careful elucidation of their underlying host and tumour characteristics. RS testing, which has been incorporated into international guidelines following validation in prospective analyses 4, [9] [10] [11] [12] , attempts to address this, and has facilitated the shift in the paradigm from pervasive conventional chemotherapy prescription to a more refined, personalized approach, within early-stage ERþ breast cancer. This systematic review and meta-analysis is the first appraising the feasibility of utilizing radiomic tumour analysis as a surrogate for RS, and incorporates data from 1216 breast cancers with corresponding MRI and genomic results. In the era of multiomic cancer care, radiomics offers the prospect of a novel, non-invasive modality for the personalization of treatment, as well as allowing for potential opportunities to augment or perhaps even replace current strategies in the future. The most significant finding in this review is the data suggesting radiomic tumour analysis may provide a potential alternative to RS testing in discriminating early ERþ breast cancer with low risk of disease recurrence (RS less than 18) from tumours with intermediate-high risk of recurrence (RS 18 or greater) based on preoperative MRI, with a sensitivity of almost 90 per cent. For the physician, these findings have a number of potential implications for clinical practice: currently, combined chemoendocrine prescription is indicated in select cases following RS testing, however, this pooled analysis suggests certain patients may bypass postoperative tumour specimen genomic testing, proceeding directly to adjuvant endocrine therapy following curative oncological resection, provided they have had a preoperative MRI and their imaging is congruent with RS less than 18 on radiomic analysis. Although current management guidelines for early breast cancer do not include routine MRI staging to determine the extent of locoregional disease burden 31 , cases such as invasive lobular carcinoma histological subtype, BRCA mutation carriers and equivocal mammographic imaging often rely on this modality to clarify diagnoses and to aid surgical planning 32, 33 . In such cases, radiomic analyses may be implemented to guide Ha et al. 22 Jacobs et al. 23 Saha et al. 26 -0. Ashaf et al. 21 Li et al. 24 Ha et al. 22 Jacobs et al. 23 Nam et al. 25 Sutton et al. 27 Sutton et al. 27 Li et al. 24 Ashaf et al. 21 Thakur et al . 28 67 18 64 31 31 10 10 25 12 11 10 1 3 therapeutic decision making, particularly as recent studies suggest as many as 60 per cent of cancers are RS less than 18 34, 35 . Consequently, if the discriminative ability of radiomic analysis is proven in well conducted randomized trials, preoperative radiomic evaluation of all early ERþ breast cancers requiring MRI could complement or even replace RS testing with several clinical and economic implications. Additionally, data from this pooled analysis supports radiomics in delineating cancers with RS greater than 30, with an implied sensitivity of almost 80 per cent, with similar specificity rates. Within the framework of early-stage ERþ disease, RS has emerged and successfully modulated cytotoxic chemotherapy prescription for the vast majority. However, differentiating those with RS greater than 30 is crucial, on account of the inherent indication for chemoendocrine therapy following substratification into this group, as proven through data published from the National Surgical Adjuvant Breast and Bowel (NSABP) projects 6 . However, it should be acknowledged that clinically relevant RS subgroups have been redefined by the work of Sparano and colleagues through the Trial Assigning Individualized Options for Treatment (TAILORx) 4 ; the initial evidence for RS testing indicated scores greater than 30 to represent accurately those likely to benefit most from systemic treatment 6 , adding validity to the results of this analysis. However, in modern oncological practice, RS greater than 25 is utilized as the cut-off in postmenopausal women, rendering the current analysis between the RS subgroups discussed somewhat at odds with current clinical practice. Albeit imperfect in the current form, non-invasive radiomic analysis remains an exciting prospect in oncology on a global scale. Buyin from large international investment companies may be imperative to finance technological enhancement, and validate current concepts and principles prospectively, while refining machinelearning strategies through high-throughput image-based screening. If given the opportunity to be validated, it is plausible that radiomics has potential to become embedded into multidisciplinary discussion within clinical oncological practice, and to further personalize oncological patient care. In recent times, there has been a move towards precision medicine in the field of oncology, with an emphasis placed on ensuring cost-effectiveness where possible. Moreover, in most European healthcare economies, the foundation for health and personal services for the majority of residents is funded through taxation of public funds . At present, the world's economy is subject to impending recession following the COVID-19 pandemic, and adaptations look imperative in ensuring cost-efficiency; RS testing is associated with a reported cost of approximately e3000 36 , with turnaround times of up to several weeks, while the cost of conventional MRI scanning is often substantially less in certain healthcare economies 37, 38 . Moreover, should radiomic neural networking become routine, its offers more cost-and time-effectiveness versus routine genomic analysis 39, 40 . While data procured from MRI pertaining to macroscopic features (for example, tumour site, size, shape, degree of disease burden, etc.) are of course invaluable, features occult to the radiologist, such as grey-level co-occurrence matrix textural features, MR-derived entropy and surface-to-volume ratio, may serve patients by personalizing their medical care 41 . Furthermore, some clinicians may argue failure to extract such radiomic tumour data as routine, once MRI has been performed, is a disservice to prospective patients. From this viewpoint, it seems appropriate and rational that radiomic analyses are incorporated into the oncology paradigm, with substitution or supplementation of current biomarkers, to continue developing and adapting strategies of patient management in a cost-effective manner. This systematic review has a number of inherent limitations. The field of radiomics involves a spectrum of artificial-intelligence techniques, including machine learning, conventional neural networks and deep-learning techniques. In this analysis, all of these methods are appraised under the umbrella term 'radiomics', despite variance in their reproducibility of data 42 . In this analysis, there was no single radiomic biomarker that correlated directly with RS that could be ascertained from more than one study, rendering sensitivity, specificity and rate ratios the most useful means of comparing radiomics and RS. Moreover, there is a limited number of patients in this analysis (1216 patients), and all included studies were of moderate levels of evidence, with none of the included studies being prospective in nature. Clinical cut-offs have been redefined following the publication of the results of the TAILORx trial 4 , yet all studies included this analysis use traditionally accepted cut-offs. This shortcoming limits the relevance of the conclusions which can be drawn from this analysis and fails to support its implementation into clinical practice using traditional cut-offs (for example, in the setting of a premenopausal woman with RS less than 18, based on the results from the current analysis, radiomics would spare such a patient combined chemoendocrine therapy (predictive rate ratio of 0.93 in delineating RS less than 18 from 18 or greater (Fig. 2a) ) indicating possible undertreatment). Difficulty ascertaining radiomic differences between RS less than 30 and greater than 30 is possibly due to a paucity of patients in the greater than 30 group being included in this analysis 21, 24 . Furthermore, current indications for preoperative MRI only include scenarios such as invasive lobular carcinoma and BRCA-mutation carriers, reducing the value of implementing results of this analysis into current clinical practice models. Furthermore, the timely access to MRI in the preoperative period has not been evaluated in this review and may pose a real-world challenge to the clinical implementation of radiomics. In spite of these acknowledged limitations, this analysis supports the mantra that radiomic tumour analysis is imminent in personalizing cancer care for prospective patients. Disclosure. The authors declare no conflicts of interest. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries The results of radical operations for the cure of carcinoma of the breast Personalizing the treatment of women with early breast cancer: highlights of the St Gallen International Expert Consensus on the Primary Therapy of Early Breast Cancer Adjuvant chemotherapy guided by a 21-gene expression assay in breast cancer Disease recurrence and oncological outcome of patients treated surgically with curative intent for estrogen receptor positive, lymph node negative breast cancer Treatment of lymph-node-negative, oestrogen-receptor-positive breast cancer: long-term findings from National Surgical Adjuvant Breast and Bowel Project randomised clinical trials Integrated multi-omics analyses in oncology: a review of machine learning methods and tools Clinicopathological correlates, oncological impact, and validation of Oncotype DX TM in a European Tertiary Referral Centre NCCN guidelines insights: breast cancer, version 1 ESMO Guidelines Committee. Primary breast cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up Use of biomarkers to guide decisions on adjuvant systemic therapy for women with early-stage invasive breast cancer Gene expression profiling and expanded immunohistochemistry tests to guide the use of adjuvant chemotherapy in breast cancer management: a systematic review and cost-effectiveness analysis Economic impact of gene expression profiling in patients with early-stage breast cancer in France Clinical use of the Oncotype DX genomic test to guide treatment decisions for patients with invasive breast cancer Radiomics in medical imaging-'how-to' guide and critical reflection Texture analysis and machine learning for detecting myocardial infarction in noncontrast low-dose computed tomography: unveiling the invisible Preferred reporting items for systematic reviews and metaanalyses: the PRISMA statement Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy Version 1.0. The Cochrane Collaboration Radiomics: the bridge between medical imaging and personalized medicine Meta-analysis of ROC curves Identification of intrinsic imaging phenotypes for breast cancer tumors: preliminary associations with gene expression profiles Convolutional neural network using a breast MRI tumor dataset can predict OncotypeDx recurrence score Integrated multiparametric radiomics and informatics system for characterizing breast tumor characteristics with the OncotypeDX gene assay MR imaging radiomics signatures for predicting the risk of breast cancer recurrence as given by research versions of MammaPrint, Oncotype DX, and PAM50 gene assays Radiomics signature on 3T dynamic contrast-enhanced magnetic resonance imaging for estrogen receptor-positive invasive breast cancers: preliminary results for correlation with Oncotype DX recurrence scores A study of association of Oncotype DX recurrence score with DCE-MRI characteristics using multivariate machine learning models Breast cancer subtype intertumor heterogeneity: MRI-based features predict results of a genomic assay Apparent diffusion coefficient in estrogen receptor-positive and lymph node-negative invasive breast cancers at 3.0T DW-MRI: a potential predictor for an oncotype Dx test recurrence score Qualitative radiogenomics: association between Oncotype DX test recurrence score and BI-RADS mammographic and breast MR imaging features Molecular portraits of breast cancer: tumour subtypes as distinct disease entities Role of preoperative MRI in the management of newly diagnosed breast cancer patients A systematic review: mindfulness intervention for cancer-related pain Magnetic resonance imaging in breast cancer: a literature review and future perspectives Community clinical practice patterns and mortality in patients with intermediate oncotype DX recurrence scores: who benefits from chemotherapy? Results of PONDx, a prospective multicenter study of the Oncotype DX V R breast cancer assay: real-life utilization and decision impact in French clinical practice Cost analysis of using Magee scores as a surrogate of Oncotype DX for adjuvant treatment decisions in women with early breast cancer Cost-effectiveness of MRI compared to mammography for breast cancer screening in a high risk population Diagnostic workup and costs of a single supplemental molecular breast imaging screen of mammographically dense breasts Radiomics for precision medicine: current challenges, future prospects, and the proposal of a new framework Applications and limitations of radiomics Exploration of PET and MRI radiomic features for decoding breast cancer phenotypes and prognosis Radiomics with artificial intelligence: a practical guide for beginners Supplementary material is available at BJS Open online.