Development and validation of a CT-based radiomics nomogram for preoperative prediction of tumor histologic grade in gastric adenocarcinoma
2021-03-13JiaHuangHuashengYaoYexingLiMengyiDongChuHanLanHeXiaomeiHuangTingXiaZongjianYiHuihuiWangYuanZhangJianHeChanghongLiangZaiyiLiu
Jia Huang,Huasheng Yao,Yexing Li,Mengyi Dong,Chu Han,Lan He,Xiaomei Huang,Ting Xia,5,Zongjian Yi,6,Huihui Wang,Yuan Zhang,Jian He,Changhong Liang,Zaiyi Liu
1Graduate College,Shantou University Medical College,Shantou 515041,China;2 Department of Radiology,Guangdong Provincial People’s Hospital,Guangdong Academy of Medical Sciences,Guangzhou 510080,China;3School of Automation Science and Engineering,South China University of Technology,Guangzhou 510006,China;4Graduate College,Southern Medical University,Guangzhou 510515,China;5School of Medicine,South China University of Technology,Guangzhou 510006,China;6 School of Biomedical Sciences and Engineering,South China University of Technology,Guangzhou 510006,China;7Department of Interventional Radiology,Cancer Center,Guangdong Provincial People’s Hospital,Guangdong Academy of Medical Sciences,Guangzhou 510080,China
Abstract Objectives:To develop and validate a radiomics nomogram for preoperative prediction of tumor histologic grade in gastric adenocarcinoma (GA).Methods:This retrospective study enrolled 592 patients with clinicopathologically confirmed GA (low-grade:n=154;high-grade:n=438) from January 2008 to March 2018 who were divided into training (n=450) and validation(n=142) sets according to the time of computed tomography (CT) examination.Radiomic features were extracted from the portal venous phase CT images.The Mann-Whitney U test and the least absolute shrinkage and selection operator (LASSO) regression model were used for feature selection,data dimension reduction and radiomics signature construction.Multivariable logistic regression analysis was applied to develop the prediction model.The radiomics signature and independent clinicopathologic risk factors were incorporated and presented as a radiomics nomogram.The performance of the nomogram was assessed with respect to its calibration and discrimination.Results:A radiomics signature containing 12 selected features was significantly associated with the histologic grade of GA (P<0.001 for both training and validation sets).A nomogram including the radiomics signature and tumor location as predictors was developed.The model showed both good calibration and good discrimination,in which C-index in the training set,0.752 [95% confidence interval (95% CI):0.701-0.803];C-index in the validation set,0.793 (95% CI:0.711-0.874).Conclusions:This study developed a radiomics nomogram that incorporates tumor location and radiomics signatures,which can be useful in facilitating preoperative individualized prediction of histologic grade of GA.
Keywords:Adenocarcinoma;histologic grade;nomograms;stomach neoplasm;X-ray computed tomography
Introduction
Although the incidence of gastric cancer has decreased due to improved primary prevention,the mortality rate of gastric cancer still ranks third globally (1,2).In 2012,there were an estimated 952,000 new cases,of which nearly three-quarters occurred in Asia,and more than two-fifths occurred in China (3).Gastric adenocarcinoma (GA)accounts for 95% of gastric malignancies (4).Referring to the World Health Organization (WHO) classification (5th edition),well and moderately differentiated GAs are classified as low-grade,and poorly differentiated GAs are classified as high-grade for better prognostic significance and reproducibility (5,6).The histologic grade of tumor differentiation has been proven to be associated with clinical outcomes (7-13).Patients with high-grade disease face a higher risk of lymph node metastasis and recurrence than those with low-grade disease,which leads to a poorer prognosis after curative surgery (14,15).Recent studies have demonstrated that patients with limited metastatic disease exhibit favorable overall survival after receiving neoadjuvant chemotherapy (16,17).However,the risk of lymphopenia and hemoglobinopathy is also significantly increased (18).To alleviate the side effects brought by excessive preoperative treatments,it is worth predicting the histologic grade of GA.Furthermore,predicting the preoperative histologic grade can also help identify patients with a higher risk of recurrence and help select individualized and proper therapeutic strategies,which will ultimately improve the outcome.
Currently,preoperative endoscopic biopsy is used as the gold standard for the diagnosis of GA and its histologic grade (19).However,as an invasive procedure,endoscopic biopsy carries a risk of iatrogenic perforation and postendoscopic infection (20,21).Additionally,intratumor heterogeneity may inevitably produce sampling bias,and the histological results may be inconsistent with those obtained during surgery (22,23).For patients with“indefinite for neoplasm/dysplasia”lesions,the diagnostic accuracy for adenocarcinoma is even lower than 70% (24).Therefore,an auxiliary method in addition to biopsy for GA grading may be beneficial.
In clinical practice,contrast-enhanced computed tomography (CT) is used for preoperative staging of GA(19).Liuet al.leveraged the textures of CT images to predict gastric cancer differentiation degree,Lauren classification,and vascular invasion status (25).However,subjective selection of CT textures may cause intra-and inter-observer variability.Liet al.studied iodine concentration in spectral CT,which demonstrated potential in the diagnosis of gastric cancer and its histologic types (22).However,spectral CT is still not widely used.Recently,radiomics has been proposed to convert medical images into high-dimensional features,which allows highthroughput data mining and quantitative feature measurements (26,27).In addition,a number of studies have shown that radiomic feature extraction and comprehensive feature analysis are conductive to individualized management for patients (28-31).Although a previous study used radiomics signatures to predict adverse histopathological status (including WHO grade) of gastric cancer (32),an optimal prediction model combining radiomics signatures and clinical features has yet to be developed.
In this research,we aimed to associate the radiomics signature with clinical characteristics to construct and validate a radiomics nomogram to assist in predicting the histologic grade of GAs preoperatively.
Materials and methods
Patients
This study was approved by the institutional research board of Guangdong Provincial People’s Hospital. The requirement for informed consent was waived because this was a retrospective study.The records and images of 738 patients with GA in Guangdong Provincial People’s Hospital from January 2008 to May 2018 were collected and obtained.All these patients with GA satisfied the following inclusion criteria:1) contrast-enhanced CT examination ≤1 month preoperatively;2) visible tumor lesions on CT images;3) postoperative specimen with pathologic diagnosis of histologic differentiation grade of GA; and 4) received no preoperative neoadjuvant chemotherapy or radiotherapy.A total of 146 cases were excluded from the 738 cases collected.More details can be found inFigure 1.
Figure 1 Flowchart of patient enrollment.
The 592 enrolled patients with GA (mean age,58.83±12.31;range,26-90) years were separated into training and validation sets based on the CT examination time.The training set consisted of 450 patients,and the validation set consisted of 142 patients.
We collected baseline clinicopathologic characteristics,including age,sex,clinical stage,CT-reported tumor location,preoperative carcinoembryonic antigen level(CEA,0-5 ng/mL:normal;>5 ng/mL:abnormal),carbohydrate antigen (CA19-9,0-27 U/mL:normal;>27 U/mL:abnormal),time of preoperative CT scan and date of surgery,preoperative and postoperative histologic grade,clinical stage,CT-reported main tumor location and T stage from the institution archives.There were no missing data for any of the clinical messages described above except for 77 missing preoperative histologic grades.The relative location of the tumor inside the stomach was also taken into consideration.According to the third edition of the Japanese classification of gastric carcinoma (33),tumors of the stomach were anatomically designated into three parts according to the main tumor location on CT:upper,middle and lower portions.The gastric stump was listed as the fourth location since it does not belong to any of the aforementioned locations.The clinical stage and CTreported T stage were diagnosed based on the American Joint Committee on Cancer (AJCC) staging system (8th edition) (34).
Histologic grade assessment
Pathohistologic examination of each pre- and postoperative specimen was performed by two pathologists blinded to the CT findings.In concordance with the 5th edition of the WHO tumor classification of the digestive system (5) and with the consent of the two pathologists,the histologic results were divided into low-and high-grade.
CT image acquisition protocol
Before the CT scan,each patient fasted for more than 5 h and was asked to drink 600-1,000 mL of water.Then,routine non-enhanced CT was performed with multidetector row CT (MDCT) scanners,which covered the whole stomach region.Then,iodinated contrast(Ultravist 370,Bayer Schering Pharma,Berlin,Germany)was injected into the antecubital vein through an automatic power pump injector (Ulrich CT Plus 150,Ulrich Medical,Ulm,Germany) at a dose of 1.5 mL/kg and a rate of 3.5 mL/s.After 30-and 60-second delaying intervals,arterial phase and portal venous phase images were obtained,respectively.The CT scanning parameters are shown inSupplementary Table S1.
Radiomics feature extraction and feature analysis methodology
Since most gastric tumor lesions showed significant enhancement in the portal venous phase and could be well delineated from the adjacent tissues,CT images of this phase were obtained from the picture archiving and communication system (PACS,Carestream,Canada) for further image feature extraction.ITK-SNAP software(Version 3.6.0,http://www.itksnap.org,USA) was used for manual delineation of visible tumor contours and obtaining regions of interest (ROIs) for radiomics analysis by two radiologists with 2 and 5 years of experience in the diagnosis of gastrointestinal images (35).Intragastric air,necrosis area,enlarged lymph nodes,and perigastric adipose tissue were carefully excluded from the contours(Figure 2).
Then,a volume of interest (VOI) was constructed by integrating the sequences of ROIs.Before feature extraction,the CT image sequence was rescaled with one pixel size of 1 mm × 1 mm × 1.25 mm by linear interpolation.The pixels from -300 Housefield units (HU)to 700 HU were normalized to the range of [1,100]by applying min-max normalization.After that,the radiomic features of VOIs were extracted and analyzed using the embedded algorithms provided by MATLAB R2019b(MathWorks,Natick,MA,USA).Extracted features consisted of first-order intensities,texture features,wavelet features,and shape-and size-based features.
Selection of radiomics features and construction of radiomics signature
Figure 2 An example of manual segmentation in gastric adenocarcinoma. (A) A diffusely infiltrating mass with enhancement is shown on the portal venous phase CT image;(B)The manually segmented area is shown on the same axial slice.
To investigate intra-and inter-observer reproducibility,25 cases were chosen randomly from the training set,and VOI delineation and radiomics feature extractions were repeated by the two radiologists.To determine the more robust features,the reproducibility and stability of the extracted features were assessed by intra-and inter-class correlation coefficients (ICCs),which were calculated based on feature extractions by reader 1 twice and once by each of the two readers,respectively.Only stable features with ICC>0.90 were reserved (36).
For data standardization,features of the training and validation sets were first normalized by the mean () and standard deviation (SD) of the training set with z-score according to the following formula:
To reduce the feature dimension and to filter out redundant features,the Wilcoxon rank-sum test was introduced to further determine the features with a statistically significant difference (P<0.05) in two labels from the preselected features.The least absolute shrinkage and selection operator regression (LASSO) algorithm was utilized for final dimensionality reduction.Ten-fold crossvalidation was used to select the tuning parameter (λ) by using the minimum criteria and 1 standard error of the minimum criteria (the 1-SE criteria).To lower the risk of overfitting,the log (λ) corresponding to 1-SE was chosen,where there are 12 nonzero coefficients.The details are provided inFigure 3.The features with the most significance and the best reproducibility were ultimately preserved.After that,the coefficients of each feature were calculated by a generalized linear model.The radiomics score,i.e.,the Rad-score,of every single patient was calculated by a weighted linear combination of the features and their corresponding coefficients.
Assessment of radiomics signature predictive performance
The discriminative power of the radiomics signature was tested by the receiver operating characteristic (ROC) curve.The maximum positive likelihood ratio (true positive value/false positive value) was used to determine the optimal cutoff value.Note that the cutoff value of the validation set was directly drawn from the training set to guarantee independence.The specificity,sensitivity and accuracy were also computed using the same cutoff value.The area under the receiver operating characteristic curve(AUC) was also introduced to evaluate the effectiveness of the predictive model in both the training and validation sets.
Radiomics nomogram construction
Figure 3 Feature selection with LASSO binary logistic regression model.(A) Tuning parameter (λ) selection in the LASSO model using 10-fold cross-validation.AUC was plotted vs.log (λ).Using the minimum criteria and the 1 standard error of the minimum criteria (the 1-SE criteria),dotted vertical lines were drawn at the best value.The 1-SE criteria were chosen according to 10-fold cross-validation,whereas 12 radiomics features were chosen;(B) LASSO coefficient profiles of features.A coefficient profile plot was plotted vs.log (λ).As λ becomes larger,the coefficients of more features shrunk to 0.Each colored line represents the coefficient of each feature.The vertical grey line was drawn at the selected λ,where 12 features had nonzero coefficients.LASSO,least absolute shrinkage and selection operator;AUC,area under the receiver operating characteristic curve.
Multivariable logistic regression analysis was performed to establish a prediction model by combining the radiomics signature and independent clinicopathologic predictors with P values less than 0.05 in the univariable analysis.We visualized the model as a radiomics nomogram in the training set based on multivariable logistic analysis to promote the clinical application value of the prediction model.
Assessment of radiomics nomogram predictive performance
The ROC curve was applied in the assessment of the discriminative performance of the radiomics nomogram.DeLong’s test was executed to calculate the statistical significance in comparing differences in AUCs between the radiomics nomogram and the clinical model and between the radiomics nomogram and biopsy results.The radiomics nomogram was assessed by a calibration curve.
Statistical analysis
Statistical analysis was performed using IBM SPSS Statistics (Version 26.0;IBM Corp.,New York,USA) and R software (Version 3.5.1;R Foundation for Statistical Computing,Vienna,Austria).Student’sttest and Pearson Chi-square test were used to compare continuous and categorical data,respectively.All P values were two sided,and P<0.05 was considered statistically significant.
Results
Clinical characteristics
In the total 592 cases included,515 accepted preoperative biopsies,among which 97 with preoperative diagnosis of low-grade GA and 346 patients with preoperative diagnosis of high-grade GA had identical postoperative results as the preoperative ones. Forty of the patients who were preoperatively diagnosed with low-grade GA were postoperatively diagnosed with high-grade GA;among them,28 patients’ radiomic results were consistent with their postoperative results.Thirty two of the patients who were preoperatively diagnosed with high-grade GA had postoperative low-grade results;among them,23 patients’radiomic results were consistent with their postoperative low-grade results.The training set consisted of 300 males and 150 females,with average age of 58.54±12.29 (range:26-90) years,and the validation set consisted of 88 males and 54 females,with average age of 59.77±12.38 (range:30-87) years.No statistically significant difference was found in age or sex between the training and validation sets(P=0.296;P=0.305,respectively).
Table 1demonstrates the clinical characteristics ofpatients included in the study.Age,sex and tumor location exhibited significant differences between patients with lowand high-grade GAs in both the training and validation sets(P=0.001 and P=0.004;P=0.003 and P<0.001;P<0.001 and P<0.001).In contrast,CEA and CA19-9 levels showed no significant difference between them in either the training or validation sets (P>0.05).
Table 1 Characteristics of patients in training and validation sets
Radiomics features reproducibility
For each patient,the VOI was extended into a total of 16,384 three-dimensional radiomics features,of which 9,449 features were strongly reproducible (ICC>0.90).For further selection,the Wilcoxon rank-sum test was conducted to filter the features with statistically significant differences between GA of different grades and drew 427 filtered features.
Radiomics features selection and signature construction
The LASSO logistic regression model was used to select the final 12 features with relatively high predictive outcome from the 427 robust features in the training set (Figure3A,B).The radiomics score (Rad-score) of each case with GA was calculated by a weighted linear combination of these twelve features and their corresponding coefficients.
Radiomics signature predictive performance
For low-and high-grade GA patients,the median Radscores were 0.283 and 0.215 in the training set and 0.302 and 0.189 in the validation set,respectively.Regardless of the training set or the validation set,the median Rad-scores in the low-grade group were higher than those in the highgrade group (P<0.001 for both).The ROC curves shown inFigure 4yield AUCs of 0.705 [95% confidence interval(95% CI):0.650-0.760]for the training set and 0.720 (95%CI:0.630-0.810) for the validation set,demonstrating the discriminative power of the radiomics signature.The statistical results,including sensitivity,specificity,positive predictive value (PPV),negative predictive value (NPV)and accuracy,are demonstrated inTable 2.
Radiomics nomogram construction
Figure 4 Receiver operating characteristic (ROC) curves of radiomics nomogram,radiomics signature and tumor location alone in training set (A) and validation set (B) respectively.Calibration curves of radiomics nomogram in the training set (C) and the validation set (D).AUC,area under the receiver operating characteristic curve;95% CI,95% confidence interval.
Table 2 Predictive performance of radiomics nomogram,clinical characteristics and radiomics signature in training and validation sets
In the univariate analysis,sex,age,radiomics score,CEA level and tumor location were associated with the histologic grade of GA.Multivariable logistic regression analysis identified the radiomics score and tumor location as independent predictors (Table 3).The tumor location was integrated into the nomogram with the radiomics signature(Figure 5).
Radiomics nomogram predictive performance
Figure 4demonstrates the ROC curves of the radiomics nomogram,radiomics signature and tumor location alone.The radiomics nomogram demonstrated relatively good discriminative power for histologic grade prediction among the three indicators,with improvements in the AUC from 0.662 for the clinical model to 0.752 (P<0.05,DeLong’s test) in the training set and from 0.716 for the clinical model to 0.793 (P=0.03,DeLong’s test) in the validation set.In our datasets,biopsy outperformed the nomogram,with AUCs of 0.824 in the training set (P<0.05,DeLong’s test) and 0.825 in the validation set (P<0.05,DeLong’s test).The calibration curve of the radiomics nomogram shown inFigure 4depicted good agreement between the observed outcome and the prediction.
Discussion
In this study,a radiomics nomogram that preoperatively predicts the histologic grade of GA based on tumor location and radiomics signatures extracted from CT images was developed and validated.It demonstrated AUCs of 0.752 and 0.793 in the training and validation sets,respectively.
In practice,surgeons conventionally use endoscopic biopsy for the diagnosis of GA and its histologic grade.However,the risks of gastric perforation,bleeding,abdominal pain,postendoscopic infection,and an overall discordance rate of 21.5% between pathohistologic results of endoscopic biopsy and postoperative specimens in mucosal GA indicates a rather limited value of endoscopic biopsy (19-21,37).The comparison between the preoperative and postoperative pathohistologic results ofour study also exhibited a difference of 13.98%,which further verifies that the existence of intra-tumor heterogeneity could lead to a substantial discordance rate while affecting clinical decisions.Thus,researchers are now investigating auxiliary noninvasive approaches to precisely predict the histologic grade of GA preoperatively.A previous study analyzed CT texture to predict pathohistological characteristics,including the differentiation degree of gastric cancer,with a sensitivity of 78.7% and a specificity of 83.3% (25).However,CT texture is subjectively assessed and may lead to intra-and interobserver variability.Therefore,we controlled this variability in our study by constructing a radiomics signature with more reproducible radiomics features(ICC>0.90).Linet al.explored and discovered the potential diagnostic significance of preoperative CEA/CA19-9 levels(38),so we included these tumor markers in our study,in addition to other clinical and semantic imaging data,including age,sex and CT-reported tumor location,which were analyzed by univariable analysis.Then,a radiomics nomogram was modeled by combining the radiomics features with independent predictive factors drawn by multivariable analysis.In 2018,a published study showed that normalized arterial phase iodine concentration in spectral CT demonstrated a high efficiency in diagnosing poorly differentiated GA (22).Currently,spectral CT is not widely used in daily clinical practice.Liet al.applied machine learning-based computational models to predict adverse histopathological status,including WHO grade,of gastric cancer,with AUCs of 0.65 and 0.63 in the training and validation sets,respectively (32),while the AUCs of the radiomics nomogram developed in our study were higher(0.752 in the training set and 0.793 in the validation set).Although biopsy outperformed the radiomics model in our study,as a noninvasive method,radiomics still has the potential to aid in diagnosis when the patient cannot tolerate an endoscopic examination due to coagulation disorders or critically ill status or refuses to undergo endoscopy.Since contrast-enhanced CT is routinely performed for the staging and management of GA,CTbased radiomics nomograms may be more applicable and more popular in clinical practice.The nomogram may facilitate preoperative identification of patients with a higher risk of recurrence,worse clinicopathological features and poorer prognosis.
Table 3 Univariable and multivariable analysis of risk factors
Figure 5 Developed radiomics nomogram.The radiomics nomogram was developed in the training set with radiomics signature and tumor location incorporated.Location,tumor location of the stomach (1,upper-third;2,middle-third;3,lower-third;4,gastric stump).
There are some limitations in this study.The radiomics features used in the study were extracted only from the portal venous phase CT images because the differentiation between the tumor and the adjacent normal gastric tissues was maximal in this phase.Nonenhanced scans,arterial phase,and venous phase images with extracted radiomics features should be investigated in further studies.Because of our retrospective collection of cases,the quality of CT images acquired by different CT machines may be unstable.Prospective studies are expected to avoid confounding factors and to assure discriminative power in GA histologic grades of radiomics signatures.
Conclusions
A radiomics nomogram in discriminating low-grade GA from high-grade GA before treatment was developed and validated.As a quantitative and noninvasive approach,the radiomics nomogram may play a part in facilitating personalized management and improving outcomes of patients with GA.
Acknowledgements
This study was supported by the National Key Research and Development Program of China (No.2017YFC 1309100),the National Science Fund for Distinguished Young Scholars (No.81925023),and the National Natural Science Foundation of China (No.82071892,81771912,81901910).
Footnote
Conflicts of Interest:The authors have no conflicts of interest to declare.
Table S1 CT scanning parameters for patients
杂志排行
Chinese Journal of Cancer Research的其它文章
- Colorectal cancer burden and trends:Comparison between China and major burden countries in the world
- Changing trends of disease burden of gastric cancer in China from 1990 to 2019 and its predictions:Findings from Global Burden of Disease Study
- Clinical characteristics and clinicopathological correlations of bilateral breast cancer in China:A multicenter study from Chinese Society of Breast Surgery (CSBrS-006)
- Multi-center investigation of breast reconstruction after mastectomy from Chinese Society of Breast Surgery:A survey based on 31 tertiary hospitals (CSBrS-004)
- Laparoscopic vs. open surgery for gastrointestinal stromal tumors of esophagogastric junction:A multicenter,retrospective cohort analysis with propensity score weighting
- Radiotherapy combined with nimotuzumab for elderly esophageal cancer patients:A phase II clinical trial