key: cord-0887609-yz027nri authors: Ren, Jing; Li, Yuan; Yang, Jun-Jun; Zhao, Jia; Xiang, Yang; Xia, Chen; Cao, Ying; Chen, Bo; Guan, Hui; Qi, Ya-Fei; Tang, Wen; Chen, Kuan; He, Yong-Lan; Jin, Zheng-Yu; Xue, Hua-Dan title: MRI-based radiomics analysis improves preoperative diagnostic performance for the depth of stromal invasion in patients with early stage cervical cancer date: 2022-01-29 journal: Insights Imaging DOI: 10.1186/s13244-022-01156-0 sha: f34ba1a961c8aa12ea7bf0e5d1db4c84c41fc978 doc_id: 887609 cord_uid: yz027nri BACKGROUND: The depth of cervical stromal invasion is one of the important prognostic factors affecting decision-making for early stage cervical cancer (CC). This study aimed to develop and validate a T2-weighted imaging (T2WI)-based radiomics model and explore independent risk factors (factors with statistical significance in both univariate and multivariate analyses) of middle or deep stromal invasion in early stage CC. METHODS: Between March 2017 and March 2021, a total of 234 International Federation of Gynecology and Obstetrics IB1-IIA1 CC patients were enrolled and randomly divided into a training cohort (n = 188) and a validation cohort (n = 46). The radiomics features of each patient were extracted from preoperative sagittal T2WI, and key features were selected. After independent risk factors were identified, a combined model and nomogram incorporating radiomics signature and independent risk factors were developed. Diagnostic accuracy of radiologists was also evaluated. RESULTS: The maximal tumor diameter (MTD) on magnetic resonance imaging was identified as an independent risk factor. In the validation cohort, the radiomics model, MTD, and combined model showed areas under the curve of 0.879, 0.844, and 0.886. The radiomics model and combined model showed the same sensitivity and specificity of 87.9% and 84.6%, which were better than radiologists (sensitivity, senior = 75.7%, junior = 63.6%; specificity, senior = 69.2%, junior = 53.8%) and MTD (sensitivity = 69.7%, specificity = 76.9%). CONCLUSION: MRI-based radiomics analysis outperformed radiologists for the preoperative diagnosis of middle or deep stromal invasion in early stage CC, and the probability can be individually evaluated by a nomogram. Cervical cancer (CC) constitutes a heavy burden on women's health globally. It is the fourth most frequently occurring female malignancy and the fourth most common cause of cancer-related deaths. Approximately 604,127 new cases of CC are reported annually, with 341,831 deaths worldwide [1, 2] . The treatment for early stage (2018 International Federation of Gynecology and Obstetrics (FIGO) IB-IIA) CC includes surgery and primary chemoradiotherapy, and the determination of treatment strategies is largely dependent on tumor-related prognostic factors [3, 4] , including the depth of cervical stromal invasion (DOI) [5, 6] . According to the Querleu-Morrow classification of radical hysterectomy, early stage CC patients without middle or deep 1/3 stromal invasion and a tumor size less than 2 cm can opt to undergo limited radical hysterectomy to avoid complications [7, 8] . However, most patients with middle or deep 1/3 stromal invasion are usually treated with radical hysterectomy and adjuvant radiotherapy, especially in the presence of other risk factors, such as special pathological types (adenocarcinoma, adenosquamous carcinoma, and neuroendocrine carcinoma, etc.), lymphovascular space invasion, or a large tumor size [3, 9] . In addition, concurrent radiochemotherapy is recommended over surgery for patients with middle or deep 1/3 stromal invasion and risk factors mentioned above, as it can achieve equal treatment efficacy with the combination of radical hysterectomy and adjuvant radiotherapy, and can avoid surgery-related adverse effects [5, [10] [11] [12] . Though the most accurate diagnosis of DOI is currently obtained through postoperative pathological analysis, novel preoperative approaches to accurately diagnose DOI is imperative to provide more reliable evidence for decision-making and to improve confidence in the management of patients with early stage CC. Experience-related conventional imaging methods with limited accuracies, such as ultrasound, magnetic resonance imaging (MRI), and positron emission tomography-computed tomography (PET/CT), can be used to diagnose the DOI of early stage CC before treatment [13] [14] [15] . Transvaginal sonography has shown 63.6% accuracy, while MRI showed variable diagnostic sensitivities of 70% for radiologists with 7 years of experience in gynecologic cancer imaging, but this was only 50% for radiologists with 4 years of experience [13, 14] . PET/CT is not an ideal method for diagnosing DOI due to its high false-positivity rate of 40.8% [15] . Previous studies have found that some clinical characteristics, including tumor diameter on imaging, were associated with the DOI in patients with early stage CC, but their value in preoperative diagnosis of middle or deep 1/3 stromal invasion is yet to be investigated [16, 17] . Another preoperative method that may be applied for the diagnosis of DOI is radiomics. Radiomics analysis can convert medical images into high-dimensional, mineable data to evaluate tumor heterogeneity that cannot be identified by gross observation. The combination of radiomics-derived data and clinical data is a promising approach to enhance clinical management [18] . Since MRI is the best imaging modality to describe the extent of CC for treatment planning, several MRI-based radiomics analyses have been performed to predict prognostic factors (e.g., lymph node metastasis, parametrial invasion, and lymphovascular space invasion), treatment response, and patient survival [19] [20] [21] [22] [23] [24] [25] [26] . Similar to clinical characteristics, limited radiomics studies aiming for preoperative diagnosis of middle or deep 1/3 stromal invasion in early stage CC have been conducted. Therefore, in this study, we explored the value of MRIbased radiomics analysis and clinical characteristics in the preoperative diagnosis of middle or deep 1/3 stromal invasion. The study aimed to develop and validate diagnostic models to facilitate decision-making in the management of patients with early stage CC. Our institutional review board approved this retrospective study, so the requirement for informed consent was waived. This study was conducted and prepared by following the TRIPOD statement and CLAIM guideline for artificial intelligence [27, 28] . Patients with a clinical staging of FIGO IB1-IIA1 CC, as defined by the 2018 FIGO staging system, who underwent radical hysterectomy and pelvic lymph node dissection at our institution between March 2017 and March 2021 were enrolled in this study. The inclusion criteria were as follows: (1) pelvic MRI examination including sagittal T2-weighted imaging (T2WI) was performed within 14 days before surgery; (2) pathologic evaluation of the DOI was attainable; and (3) preoperative biopsy confirmed squamous cell carcinoma (SCC), adenocarcinoma (AC), or adenosquamous carcinoma (ASC). Patients were excluded for the following reasons: (1) patients who underwent treatment, such as neoadjuvant chemotherapy, radiotherapy, or cervical conization, prior to MRI examination or between MRI and surgery; (2) tumors were invisible on sagittal T2WI (considering the challenges to determine the ROIs of invisible tumors); and (3) insufficient image quality to extract radiomics features. A total of 234 patients were included in the study. A flowchart of this study is presented in Fig©1. Eligible patient data were randomly divided into a training cohort (188 patients, mean age 44.63 ± 9.61 years) and a validation cohort (46 patients, mean age 46.56 ± 10.36 years), at a ratio of 8:2. The clinical characteristics of all enrolled patients, including age, 2018 FIGO stage, menopausal status, preoperative biopsy histological type, and maximal tumor diameter (MTD) on MR images that the lesions appeared largest, were obtained from medical records. Clinical and imaging records of the patients before 2018 were reviewed by a gynecological oncologist with 10 years of experience and were restaged according to the 2018 FIGO staging criteria [29] . Postoperative pathological examination results were the gold standard in this study. DOI was measured as a percentage of the tumor depth to the cervical radius in millimeters (mm) and was recorded in the pathology report as "tumor depth/cervical radius (mm)" [6] . Superficial 1/3 stromal invasion was defined as "superficial stromal invasion" and middle or deep 1/3 stromal invasion as "middle or deep stromal invasion" [9] . Preoperative pelvic MRI examinations were performed using a Signa Excite 1. The InferScholar Center software (Infervision Medical Technology Co., Ltd., version 3.2) was used for threedimensional manual segmentation [30] . Firstly, the initial DICOM image dataset of each patient was anonymized and then uploaded to the software. Then a radiologist with 10 years of experience in gynecological MRI interpretation delineated the region of interest (ROI) along the tumor contour on each sagittal T2WI slice. In the process, the radiologist also referred to other sequences of images (T1W and diffusion-weighted imaging) to ensure the accuracy of image segmentation. Each segmentation was subsequently validated by a senior radiologist with 17 years of experience in gynecological MRI interpretation. Any disagreement over segmentation was resolved by a consultation to reach a consensus. Both radiologists were aware of the CC diagnosis, but they were blinded to the clinical and histopathological data. After manual segmentation, the original DICOM images and segmentation results were normalized according to pixel spacing and slice thickness to reduce the influence of various acquisition parameters of different MR image systems on the stability of radiomics features [31] . Subsequently, radiomics features, including first-order features, shape-based features, gray level co-occurrence matrix (GLCM) features, gray level dependence matrix (GLDM) features, gray level run length matrix (GLRLM) features, gray level size zone matrix (GLSZM) features, and neighboring gray-tone difference matrix (NGTDM) features were extracted from each ROI. Shape-based features were extracted from the original images, and the other six sets of features were extracted from both the original and processed images. Then, each feature was standardized using z-score normalization to obtain a standard image intensity normal distribution, thereby facilitating good feature robustness [32] . Pycharm (version 2019.1.3; https:// www. jetbr ains. com/) was used for the normalization of the original images and ROI and for extraction of radiomics features. Since not all extracted radiomics features correlated with middle or deep stromal invasion, a three-step feature selection was performed to verify important features with high predictive powers. First, a significance test for each feature was conducted, and features with statistical significance (p < 0.05) were retained. Second, all retained features were matched pairwise, and if the Pearson correlation coefficient between two features was > 0.85, then the feature with the higher p value in the significance test was eliminated, and the remaining features were processed as follows. Finally, a fivefold cross-validationbased least absolute shrinkage and selection operator (LASSO) was applied to select features with nonzero coefficients from the remaining features. LASSO regularization involves parameter λ to control the number of selected features. In this study, the optimal λ was selected as the lowest binomial deviance in the training cohort data, consequently retaining a relatively small number of features to fit further models. After radiomics feature selection, a radiomics model based on logistic regression was built using the training cohort data to predict middle or deep stromal invasion. Radiomics feature selection and logistic regression model building used Rstudio (V.3.5.0; https:// www.r-proje ct. org/). Independent risk factors for predicting middle or deep stromal invasion were identified by univariate and multivariate analysis of the patient data for five clinical characteristics (age, 2018 FIGO stage, menopausal status, preoperative biopsy histological type, and MTD on MRI). A combined model incorporating the independent risk factors and selected radiomics features was built using training cohort data. To clinically apply the diagnostic models, a multivariable logistic regression analysis was applied to build a nomogram based on the training cohort data that visually represented the combined model. The nomogram serves as an individual tool that integrates the radiomics signature with independent risk factors to predict the probability of middle or deep stromal invasion in early stage CC. The radiomics signature in the nomogram was the linear sum of the selected radiomics features and their corresponding coefficients. To compare the diagnostic performance of the radiomics models and radiologists, two radiologists with 5 and 10 years of experience in gynecological MRI interpretation independently determined the presence of middle or deep stromal invasion for each patient in validation cohort (n = 46). They were blind to the patient information but were aware of the CC diagnosis, and they made the decision by browsing through all the MR sequences. Statistical analysis was conducted using R version 3.5.0 (https:// www.r-proje ct. org/) and SPSS version 21.0.0.0 (https:// www. ibm. com). The independent sample t test was used to compare the differences in continuous variables (age and MTD on MRI) between the superficial stromal invasion and middle or deep stromal invasion groups, as well as the training and validation cohorts. The chi-square test was used to evaluate the significance of categorical variables, including 2018 FIGO stage, menopausal status, and preoperative biopsy histological type between the superficial and middle or deep stromal invasion groups. The difference in the prevalence of middle or deep stromal invasion between the training and validation cohorts was also compared using the chi-square test. Fisher's exact test was used for variables with a frequency of less than five. Independent risk factors for middle or deep stromal invasion were identified using multivariate logistic regression analysis with the forward Wald method by inputting significant variables found by univariate analysis. The area under the curve (AUC) of the receiver operating characteristic (ROC) curve was calculated to assess the diagnostic performance of the radiomics model, independent risk factors, and the combined model in the validation cohort. Their sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated to further evaluate their performance, and 95% confidence intervals (CIs) were estimated using 1000-replicate bootstrapping. Interobserver agreement of the two radiologists was evaluated by Cohen's κ coefficient test (< 0.20, poor agreement; 0.21-0.40, fair agreement; 0.41-0.60, moderate agreement; 0.61-0.80, substantial agreement; > 0.80, almost perfect agreement). Statistical significance was set at p < 0.05. Among the 234 enrolled patients, 70 (29.9%) had superficial stromal invasion and 164 (70.1%) had middle or deep stromal invasion. There was no significant difference in the middle or deep stromal invasion prevalence between the training and validation cohorts (p = 0.859). Premenopausal cases accounted for 70.5% of all patients. The percentages of patients with IB1, IB2, IB3, and IIA1 CC were 41.4%, 50.9%, 3.0%, and 4.7%, respectively. The majority of patients (69.2%) had preoperatively proven SCC. MTD on MRI was 23.10 ± 9.28 mm (range 5.8-63.0 mm). There were no significant differences in clinical characteristics between the training and validation cohorts (p > 0.05) ( Table 1) . In the training cohort, 1,454 radiomics features were extracted from each segmented tumor volume on sagittal T2 images of each patient after original image and ROI normalization. To reduce the risk of over-fitting, the three-step feature selection process based on the training cohort data led to the exclusion of non-relevant and redundant features. The first step retained 944 radiomics features with statistical significance (p < 0.05) between superficial stromal invasion group and middle or deep stromal invasion group. Then redundant features were discarded and 130 representative features were proceeded to the following process. Finally, the optimal λ was selected and consequently led to the selection of five significant radiomics features for the prediction of middle or deep stromal invasion (Fig. 2) The results are shown in Table 3 , and the ROC curves are shown in Fig. 3 . The radiomics signature and MTD on MRI were used to develop a nomogram. The nomogram is shown in Fig. 4 . Representative images of middle or deep cervical stromal invasion and superficial cervical stromal invasion are shown in Fig. 5 . The interobserver agreement of the two radiologists was fair (κ = 0.362). The diagnostic accuracy was 73.9% for the senior and 60.9% for the junior. Of the 33 patients with the presence of middle or deep stromal invasion, the junior radiologist identified 21 patients and the senior radiologist identified 25 patients. The sensitivity and PPV of the two radiologists were 63.6% and 77.8%, and 75.7% and 86.2%, respectively. The senior radiologist identified more patients without middle or deep stromal invasion (9/13) than the junior radiologist (7/13), with specificities and NPVs of 69.2% and 52.9% for the senior radiologist, and 53.8% and 36.8% for the junior radiologist (Table 3) . A radiomics model based on T2W images is valuable in the preoperative diagnosis of middle or deep stromal invasion of early stage CC. MTD on MRI was shown to be an independent risk factor of middle or deep stromal invasion. A nomogram was constructed to facilitates the clinical application for the individual prediction of the middle or deep stromal invasion probability in patients with early stage CC. Accurate preoperative diagnosis of middle or deep stromal invasion contributes to optimal treatment decision-making, and facilitate doctor-patient communication on treatment strategy selection. For early stage CC patients, surgery is recommended only when the patient is deemed to have no indication for adjuvant chemoradiotherapy [12] . Among 234 patients included Fig. 3 ROC curves of combined model, radiomics model, and tumor maximum diameter on MRI for predicting middle or deep stroma invasion in the validation cohort. The senior radiologist's performance is indicated by the black cross and the junior radiologist's performance is indicated by the red cross in this study, middle or deep stromal invasion was present in 164 (70.1%) patients, indicating that a significant portion of early stage CC patients are at risk for adjuvant chemoradiotherapy after surgery, especially if other risk factors are present. Based on the tradition MR images, the variability of DOI diagnosis was substantially large among radiologists with different seniority. Furthermore, the sensitivity and specificity of the radiologists were lower than that of the radiomics model based on sagittal T2WI. The results suggest that the radiomics model has the potential to assist in clinical settings by reducing the rate of misdiagnoses in middle or deep stromal invasion and improving confidence in decision-making in oncologic management. Additionally, the better negative predictive value and positive predictive value of the radiomics model verified its potential to assist radiologists and oncologists in doctor-patient communication, which means that if the patient is diagnosed as middle or deep stromal invasion by the radiomics model, primary chemoradiotherapy rather than surgery may be proposed more confidently by the clinicians. To identify the independent risk factors of middle or deep stromal invasion, univariate and multivariate analyses on clinical characteristics were performed. MTD on MRI was the only independent risk factor for middle or deep stromal invasion in early stage CC. The results were consistent with a previous study, where the authors found that when the cut-off value was 20.5 mm, twodimensional MTD on ultrasound showed good performance in predicting deep stromal invasion with an AUC of 0.83, a sensitivity of 90.5%, and a specificity of 61.1% [17] . However, the combined model incorporating a radiomics signature and MTD on MRI only achieved a slight improvement in AUC and the same sensitivity and specificity compared to that of our proposed radiomics model. This suggests that tumor size correlates with DOI but has limited predictive value. The FIGO staging system is considered the most powerful tool in treatment planning and counseling patients regarding the prognosis of CC, and the latest 2018 FIGO allowed imaging findings to stage the disease for the first time [33] . In previous study, radiomics model for survical prediction in CC patients showed better performance when FIGO stage was added [34] . In the current study, the 2018 FIGO stage was not incorporated in the combined model because it showed significant differences between the superficial and middle or deep stromal invasion groups in univariate analysis, but not in multivariate analysis. A possible reason is that the 2018 FIGO staging of early CC includes some information on tumor diameter, but the information is limited as tumor size is only divided into 2 cm or 4 cm. Radiomics features were extracted from the mainstay T2WI sequence, since it provides indications of disease extent and the relationship of the tumor with surrounding tissues in CC [35] . Tumor segmentation is a critical step in radiomics analysis flow. Multiple-segmentation by multiple clinicians is a method to provided reproducible and reliable radiomic features. As reported in previous studies, 95.2% of radiomics features extracted from T2WI showed high reproducibility (intraclass correlation coefficient ≥ 0.75) in different CC segmentations contoured by three observers [36] . Considering that, interobserver segmentation variability was not provided in the present study, but each segmented volumes were carefully validated by a senior radiologist to ensure that the radiomic features are reliable. In addition, a realworld patient cohort was included to reduce selection bias, but the large majority of patients with CC in the present study were excluded considering the potential impact of the long interval and preoperative treatments on the accuracy of pathological results. Three different MRI devices were implemented in the real-world patient cohort, which may had a potential impact on the stability of radiomics features. Thus, the normalized DICOM image and segmentation results allowed each feature to have the same mean of 0 and a standard deviation of 1, contributing to the normal distribution [31, 37] . The standardization process was considered a useful method to facilitate good feature robustness in CC, which was maintained for 94.4-100% of T2W radiomics features [32] . Moreover, the use of high-dimensional features contributed to the good performance of the model. Four of the five selected features obtained after wavelet transformation are high-dimensional features. These features were difficult to decipher through simple observation, but they provide considerably richer information about the intensity, shape, size, and volume of cervical tumors. There are several limitations to this study. First, all patients were recruited from a single center with a relatively small sample size. Although imaging data with different scanning parameters were included, the exclusion of large majority of CC patients in our institution may increase the risk of selection bias. The diagnostic performance of the radiomics model should be validated with a larger multicenter data in the future. Second, the radiomics features were only derived from sagittal T2W images. Previous studies showed the value of diffusion-weighted images and contrasted-enhanced images in predicting prognostic factors such as lymph node metastasis and lymphovascular space invasion in CC patients [20, 21] . Finally, the ROI segmentation was only conducted by one radiologist in this study. The extracted radiomics features robustness to segmentation variabilities was unknown, and the overfitting risk caused by irreproducible features existed. Therefore, radiomics studies on multiple segmentations are necessary. In conclusion, radiomics model based on T2WI outperformed radiologists in the preoperative diagnosis of middle or deep stromal invasion in patients with early stage CC. MTD on MRI is an independent risk factor for middle or deep stromal invasion but has limited sensitivity and specificity. In addition, the nomogram incorporating a radiomics signature and MTD on MRI can evaluate the probability of middle or deep stromal invasion for early stage CC. Global cancer statistics 2020: GLO-BOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries Cervical cancer prevalence, incidence and mortality in low and middle income countries: a systematic review The European Society of Gynaecological Oncology/European Society for Radiotherapy and Oncology/European Society of Pathology guidelines for the management of patients with cervical cancer Role of adjuvant therapy after radical hysterectomy in intermediate-risk, early-stage cervical cancer Depth of cervical stromal invasion as a prognostic factor after radical surgery for early stage cervical cancer Classification of radical hysterectomy Update on the Querleu-Morrow classification of radical hysterectomy National Comprehensive Cancer Network. NCCN Clinical Practice Guidelines in Oncology. Cervical Cancer (Version 1.2020) Primary surgery versus primary radiation therapy with or without chemotherapy for early adenocarcinoma of the uterine cervix Randomized study between radical surgery and radiotherapy for the treatment of stage IB-IIA cervical cancer: 20-year update Cervical cancer: ESMO clinical practice guidelines for diagnosis, treatment and follow-up Comparison of MRI and high-resolution transvaginal sonography for the local staging of cervical cancer Stage IB1 cervical cancer: role of preoperative MR imaging in selection of patients for fertility-sparing radical trachelectomy 18)F-FDG PET/CT can correct the clinical stages and predict pathological parameters before operation in cervical cancer The feasibility of (18)F-FDG PET/CT for predicting pathologic risk status in early-stage uterine cervical squamous cancer Preoperative prediction of lymph node metastasis and deep stromal invasion in women with invasive cervical cancer: prospective multicenter study using 2D and 3D ultrasound Radiomics: extracting more information from medical images using advanced feature analysis Radiomics analysis of magnetic resonance imaging improves diagnostic performance of lymph node metastasis in patients with cervical cancer Multiparametric MRI-based radiomics nomogram for predicting lymph node metastasis in early-stage cervical cancer MR-based radiomics nomogram of cervical cancer in prediction of the lymph-vascular space invasion preoperatively Radiomics analysis of multiparametric MRI evaluates the pathological features of cervical squamous cell carcinoma Preoperative prediction of parametrial invasion in early-stage cervical cancer with MRI-based radiomics nomogram Pretreatment MRI radiomics based response prediction model in locally advanced cervical cancer. Diagnostics (Basel) Multi-habitat based radiomics for the prediction of treatment response to concurrent chemotherapy and radiation therapy in locally advanced cervical cancer Radiomic features of cervical cancer on T2-and diffusion-weighted MRI: prognostic value in low-volume tumors suitable for trachelectomy Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement Checklist for artificial intelligence in medical imaging (CLAIM): a guide for authors and reviewers Revised FIGO staging for carcinoma of the cervix uteri Deep learning-based triage and analysis of lesion burden for COVID-19: a retrospective study with external validation A computed tomography-based radiomic prognostic marker of advanced high-grade serous ovarian cancer recurrence: a multicenter study Robustness of magnetic resonance radiomic features to pixel size resampling and interpolation in patients with cervical cancer Cancer of the cervix uteri Radiomic score as a potential imaging biomarker for predicting survival in patients with cervical cancer Staging, recurrence and follow-up of uterine cervical cancer using MRI: updated guidelines of the European Society of Urogenital Radiology after revised FIGO staging 2018 Repeatability and reproducibility of MRI-based radiomic features in cervical cancer Multicenter evaluation of MRIbased radiomic features: a phantom study Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations The authors extend gratitude to Jiang-Fen Wu and Wei-Xiong Tan for their guidance on the materials and methods of the study. The datasets analyzed during the current study are available from the corresponding author on reasonable request. Ethics approval and consent to participate Our institutional review board approved this retrospective study, so the requirement for informed consent was waived. Not applicable.