APP下载

Quantitative multiparametric magnetic resonance imaging can aid non-alcoholic steatohepatitis diagnosis in a Japanese cohort

2021-04-13KentoImajoLouiseTetlowAndreaDennisElizabethShumbayawondaSofiaMouchtiTimothyKendallEveFryerShogiYamanakaYasushiHondaTakaomiKessokuYujiOgawaMasatoYonedaSatoruSaitoCatherineKellyMattKellyRajarshiBanerjeeAtsushiNakajima

World Journal of Gastroenterology 2021年7期

Kento Imajo, Louise Tetlow, Andrea Dennis, Elizabeth Shumbayawonda, Sofia Mouchti, Timothy J Kendall,Eve Fryer, Shogi Yamanaka, Yasushi Honda, Takaomi Kessoku, Yuji Ogawa, Masato Yoneda, Satoru Saito,Catherine Kelly, Matt D Kelly, Rajarshi Banerjee, Atsushi Nakajima

Abstract

Key Words: Corrected T1; Fibro-inflammation; Non-invasive imaging; Non-alcoholic steatohepatitis; Multiparametric magnetic resonance imaging; Non-alcoholic fatty liver disease

INTRODUCTION

Non-alcoholic fatty liver disease (NAFLD) is the most common cause of chronic liver disease, affecting approximately 25% of the general population worldwide[1]and up to 30% of the general population in Japan[2]. Additionally, up to 19% of individuals in south east Asia with body mass index (BMI) ≤ 25 kg/m2have “lean” or “non-obese”NAFLD[3,4]. The scope of disease aetiologies within NAFLD differ in both clinical significance and prognosis[1,5-7]on a continuum that encompasses “simple steatosis” or NAFL (hepatic steatosis without inflammation), to non-alcoholic steatohepatitis(NASH) and NASH cirrhosis. NASH results when fat accumulation in the liver triggers inflammatory signals and reactive oxygen species that can amplify liver injury and stimulate fibrosis[8]. NASH is predicted to become the leading cause of liver transplant over the coming decade[1]as NASH patients have a greater tendency to develop advanced liver fibrosis, cirrhosis, and hepatocellular carcinoma[9-11].

Liver biopsy is the current gold standard for differentiating simple steatosis from NASH as well as staging the severity of fibrosis in patients with NAFLD[12]. However,due to the limitations associated with biopsy[13], high procedure costs, high levels of discordance between readers, and poor acceptability by patients, there has been an increase in the use of non-invasive imaging biomarkers to diagnose and monitor the disease.

Vendor-neutral and scalable multiparametric magnetic resonance imaging (mpMRI)measurements of liver fat proton density fat fraction (PDFF) with IDEAL and iron corrected T1-mapping (cT1) are emerging as promising quantitative imaging biomarkers for NASH. MRI liver fat correlates strongly with histologically graded steatosis across the clinical range seen in NASH[14]and has high diagnostic accuracy in stratifying all grades of liver steatosis[15-17]. cT1 correlates with ballooning[18], and has been shown to predict clinical outcomes in patients with chronic liver disease[19,20]. Both metrics have good technical validity with high repeatability and reproducibility across MRI manufacturers and field strengths[21]. Additionally, due to their sensitivity to subtle changes in hepatic fat and fibro-inflammation, mpMRI techniques are increasingly used as inclusion criteria endpoints in NASH clinical trials and are included in the FDA Biomarker Qualification Program.

Magnetic resonance elastography (MRE) is an alternative MR based approach that can be used to stage fibrosis. MRE has shown utility in identifying patients with NASH from those with simple steatosis[22]whilst also being able to detect the presence of advanced fibrosis in patients with chronic liver disease[23]. However, MRE has not demonstrated sufficient utility for the longitudinal monitoring of fibrosis progression or regression[14,24].

Ultrasound based methods such as vibration-controlled transient elastography(VCTE) liver stiffness measure (LSM) have shown good utility in identifying patients with advanced fibrosis[25,26], however they may be less reliable in patients who are morbidly obese; a high-risk group for NASH[25]. VCTE controlled-attenuation parameter (CAP) has been shown to be sensitive to early changes in liver fat albeit with a low ability to differentiate steatosis levels.

The aim of this study was to evaluate the diagnostic performance of quantitative mpMRI, MRE, and transient elastography (VCTE and 2 D shear-wave elastography) to identify patients with suspected NASH and to report on the correlations between these non-invasive technologies and histology.

MATERIALS AND METHODS

This was an observational trial conducted and sponsored by the Yokohoma City University Hospital between January 2019 and February 2020. N = 151 adult participants who underwent a liver biopsy for suspected NASH were included in this interim analysis. Participants were invited to undergo core liver biopsy if there was evidence of steatosis (Proton density fat fraction ≥ 5.2% or CAP ≥ 236)[16]. Patients were excluded if there was contraindication to MRI, history of alcoholism, or evidence of other chronic liver disease (Figure 1). The study was conducted in accordance with the ethical principles of the Declaration of Helsinki 2013, was approved by the Ethics Committee of Yokohama City University Hospital and was registered as a clinical trial(UMIN Clinical Trials Registry: UMIN000026145).

Histopathological evaluations

Figure 1 CONSORT diagram of patient pathway and inclusion into the study. mpMRI: Quantitative multiparametric magnetic resonance imaging; CAP:Controlled-attenuation parameter; MRE: Magnetic resonance elastography; NASH: Non-alcoholic steatohepatitis; NAFLD: Non-alcoholic fatty liver disease; VCTE:Vibration-controlled transient elastography.

Liver biopsy samples were obtained using a 16-gauge needle biopsy kit with an adequate liver biopsy defined as being ≥ 20 mm in length and/or with ≥ 10 portal tracts. Biopsy samples were assessed independently by three histopathologists, one at YCUH at the time of collection and then later by a further two pathologists using digitalised biopsy slides. Histological scoring of fibrosis, steatosis, lobular inflammation, and hepatocyte ballooning was performed by all pathologists, and overall disease activity was graded according to the NAFLD activity score [NASH Clinical Research Network (NAS)]. NASH was defined as NAS ≥ 4, ballooning ≥ 1 and inflammation ≥ 1 as described in AASLD guidelines[12]. All scores were taken from the median score for each component from the three pathologists.

Non-invasive biomarkers

MR liver fat and cT1 measurements were made using the non-contrast LiverMultiScan®mpMRI protocol (Perspectum, Oxford, United Kingdom), performed with the patient in the supine position using a 3 Tesla GE Discovery 750 W scanner system (GE Healthcare, Milwaukee, WI, United States). The average scan time for this protocol was 10 min. Four single transverse slices were captured through the liver centered on the porta hepatis. Anonymised mpMRI data were analysed off-site by image analysts trained in abdominal anatomy and artefact detection, who were blinded to the clinical data and risk grouping. For MR liver fat, three 15 mm diameter circular regions of interest (ROIs) were selected on the transverse maps to cover a representative sample of the liver parenchyma. For cT1 (ms), three ROIs were placed on the central slice within the typical percutaneous biopsy region. Median values from all pixels within the ROIs were calculated and used as the representative score.

MRE examinations were performed using the same MRI scanner and following a 2-dimensional MRE protocol[27]. Interpretation of MRE images to obtain stiffness values was performed by abdominal radiologists[22]. VCTE-LSMs were obtained by one operator using either 3.5 MHz M-probe and/or 2.5 MHz XL-probe dependent upon suitability (waste-hip circumference and BMI) and through use of the automatic probe selection tool embedded within the Fibroscan operating software[28]. VCTE-LSM measures with at least 10 valid shots and a success rate of ≥ 60% were considered reliable and used for statistical analysis. Hepatic steatosis was assessed using the CAP value provided by the device. 2 D shear-wave elastography (2 D-SWE) measurements were obtained using a Logiq S8 system (GE Healthcare)[29]. Example images are shown in Figure 2 for all methods.

Statistical analysis

Descriptive statistics were used to summarise baseline participant characteristics.Continuous variables were reported as mean and standard deviation, categorical variables were reported as frequency and percentage, and confidence intervals (CI)were reported at the 95% level. Mean difference in biomarker values between those with NAFL and those with NASH were compared using independent t-tests or Wilcoxon signed-rank tests, depending on distribution. Diagnostic performance of non-invasive biomarkers was assessed using area under receiver operator characteristic curve (AUC) with multivariate logistic regressions utilised to assess performance of combined biomarkers. Correlations between median scores from the 3 pathology reads and image-derived markers (MR liver fat, cT1, MRE, VCTE-LSM and CAP) were assessed using Spearman’s rank correlation coefficient.

Inter-rater variability analysis between the pathology scores given by the 3 pathologists were performed using tri-variate weighted kappa statistic, with the overall variability seen for each of the histological metrics assessed by Krippendorff’s alpha[30].

Statistical analysis was performed using R software version 3.6.1 with P < 0.05 considered to be statistically significant. Case-wise deletion was employed to include only complete cases for MR metrics, VCTE-LSM and CAP for each analysis as appropriate rather than imputing any missing values.

RESULTS

Of the initial 151 patients who underwent liver biopsy, 145 were eligible for the study.The average age was 60 (± 13) years, 61% patients were male, 40% BMI ≥ 30 kg.m-2and 60% of patients had NASH (NAS ≥ 4 with ballooning ≥ 1 and inflammation ≥ 1;Table 1). There was a broad range of histology scores across all aspects of the key histopathological hallmarks of NASH (Table 2). From the entire cohort, one had missing MRE data, two had missing LiverMultiScan®data, 28 missing or uninterpretable Fibroscan data, and 61 had missing or uninterpretable 2 D-SWE data.Investigation into potential causes of failure for the ultrasound based methods revealed BMI to be significantly higher in the cases in which VCTE-LSM was unreported (31.7 kg/m2vs 28.1 kg/m2, P < 0.05), and similarly was elevated in those with missing 2 D-SWE (29.5 kg/m2vs 27.9 kg/m2).

Diagnostic accuracy of non-invasive imaging markers for differentiation of disease activity

Steatosis (NAFL): MR Liver fat [AUC: 0.92 (CI: 0.87-0.98)], and CAP [AUC: 0.75 (CI:0.58-0.92)] both discriminated between steatosis ≥ 1, while the other imaging markers were unable to [cT1 AUC: 0.64 (CI: 0.46-0.82), MRE AUC: 0.53 (CI: 0.33-0.72), VCTELSM AUC: 0.60 (CI: 0.37-0.82), 2 D-SWE AUC: 0.53 (CI: 0.22-0.84)]. For steatosis ≥ 2, MR liver fat again showed the best performance [MR liver fat: AUC: 0.86 (CI: 0.80-0.93); vs CAP AUC: 0.68 (CI: 0.59-0.78)]. 7 patients had CAP technical failure and 10 patients were not able to gain reliable results.

NASH:To diagnose steatohepatitis (NAS ≥ 4 with at least one in both ballooning and inflammation), MR liver fat [AUC: 0.80 (CI: 0.73-0.87)], cT1 [AUC: 0.75 (CI: 0.67-0.84)],and CAP [AUC: 0.71 (CI: 0.62-0.80)] all had good discriminatory performance, while VCTE-LSM [AUC: 0.56 (CI: 0.45-0.66)], MRE [AUC: 0.57 (CI: 0.47-0.67)] and 2 D-SWE[AUC: 0.58 (CI: 0.45-0.71)] were not as effective (Figure 3). Multivariate analysis to explore the potential for increased diagnostic performance of biomarkers used in combination for identifying those with NASH, revealed the combination of cT1 and MR liver fat [AUC: 0.83 (CI: 0.76-0.90)] was superior to the individual markers and tothe combination of VCTE-LSM and CAP in combination [AUC: 0.71 (CI: 0.66-0.8)](Figure 4).

Table 1 Participant demographics

Fibrosis:To measure fibrosis alone, MRE [AUC: 0.97 (CI: 0.94-1.0)], VCTE-LSM [AUC:0.94 (CI: 0.9-0.99)] and 2 D-SWE [AUC: 0.94 (CI: 0.86-1.0)] were all excellent at discriminating for any fibrosis (≥ 1) whilst MR liver fat [AUC: 0.68 (CI: 0.44-0.92)], cT1[AUC: 0.63 (CI: 0.3-0.97)] and CAP [AUC: 0.6 (CI: 0.39-0.81)] were not as effective.These performances were maintained for discriminating those with more advanced fibrosis (≥ 2): MRE [AUC: 0.92 (CI: 0.87-0.97)]; VCTE-LSM [AUC: 0.88 (CI: 0.81-0.95)];2 D-SWE [AUC: 0.87 (CI: 0.76-0.99)] compared to cT1 [AUC: 0.62 (CI: 0.49-0.74)] MR liver fat [AUC: 0.60 (CI: 0.48-0.72)] and CAP [AUC: 0.57 (CI: 0.45-0.70)]. VCTE-LSM,and to an even greater extent 2 D-SWE, however both had high levels of missing data(20% and 42% respectively), likely to be related to elevated obesity.

Table 2 Range of histology scores reported across the entire cohort

Advanced NASH:To identify those with NASH and fibrosis (NAS ≥ 4 with F ≥ 2),both cT1 [AUC: 0.74 (CI: 0.66-0.82)] and MR liver fat [AUC: 0.71 (CI: 0.63-0.80)]outperformed the other measures; MRE [AUC 0.66 (CI: 0.57-0.75)], VCTE-LSM [AUC:0.64 (CI: 0.54-0.74)], CAP [AUC: 0.68 (CI: 0.59-0.78)], and 2 D-SWE [AUC: 0.62 (CI: 0.49-0.75)]. Combining cT1 and MR liver fat improved the performance [AUC: 0.76 (CI:0.69-0.84)] and was superior to the combination of VCTE-LSM and CAP [AUC: 0.70(CI: 0.61-0.79)], (Figure 4).

Correlations between image derived biomarkers and histological markers of disease

Figure 2 Example images from each technology. A: Corrected T1 map; B: Magnetic resonance liver fat map; C: Fibroscan controlled-attenuation parameter;D: Fibroscan Liver Stiffness; E: magnetic resonance elastography liver stiffness map; and F: 2 D Shear-wave elastography.

MRI cT1 correlated significantly with all key aspects of histology [fibrosis, steatosis,ballooning and lobular inflammation (rs= 0.24, rs= 0.29, rs= 0.39, rs= 0.31, respectively)]and with overall NAS (rs= 0.58). MR liver fat was positively correlated with steatosis(rs= 0.70), and with inflammation (rs= 0.28), ballooning (rs= 0.29) and overall NAS (rs=0.64) but was negatively correlated with fibrosis (rs= -0.25). MRE was significantly correlated with fibrosis (rs= 0.75) as well as ballooning and lobular inflammation (rs=0.32, rs= 0.16 respectively) and negatively correlated with steatosis (rs= -0.23). VCTELSM correlated significantly with fibrosis (rs= 0.69) and ballooning (rs= 0.29), and CAP with steatosis (rs= 0.39), inflammation (rs= 0.26), ballooning (rs= 0.33) and NAS (rs=0.49). 2 D shear-wave elastography was positively correlated with fibrosis (rs= 0.72)and ballooning (rs= 0.34; Table 3).

Concordance between pathologists

Assessment of the overall agreement (Krippendorff’s alpha) of the pathologists for each histological marker showed that there was moderate agreement for indicators of steatosis (a = 0.58) and NAS (a = 0.42), fair agreement for ballooning and fibrosis (a =0.40 for both), and none to slight agreement on lobular inflammation (a = 0.11).Assessment of the trivariate weighted kappa scores from individual pathologists for each metric showed no reoccurring pattern of agreement between any two pathologists (Table 4).

DISCUSSION

The diagnosis of NASH is important as it provides prognostic information indicating an increased risk of fibrosis progression and liver-related mortality but has hitherto been limited because of the need for histological verification. In clinical practice,distinguishing between NAFL, NASH, and NASH with fibrosis is highly desirable for risk stratification. Patients with steatosis can be educated about future cardiovascular risk, and lifestyle measures to prevent them, while those with NASH, or NASH andfibrosis, can be monitored and treated (pioglitazone, cevoglitazar) to reduce the risk of cardiovascular and liver clinical complications. In this study, we were able to identify and differentiate patients with steatosis, NASH, and NASH with fibrosis with good diagnostic performance using non-invasive technologies which characterise liver tissue accurately.

Table 3 Spearman's correlation coefficients for all variables

Table 4 Inter-rater variability between the three pathologists for each histology metric

The diagnosis of NASH is based, at present, on the histological presence of steatosis and either lobular inflammation or ballooning, with the presence of fibrosis highlighting disease progression. Currently, the only biomarkers shown to predict outcomes in patients are histological fibrosis and cT1[19,20,31]. However, cT1 is sensitive to ballooning, inflammation, steatosis as well as fibrosis, and so cannot be considered a pure fibrosis biomarker. When measuring just fibrosis, MRE had greater correlations to histology and greater sensitivity than other methods but could not by itself distinguish NAFL from NASH or NASH with fibrosis. Whilst ultrasound methods are also effective for staging advanced fibrosis, in line with reported literature[32-35], they can be less reliable in obese patients with higher BMI as observed in the patients in this cohort with missing VCTE and 2 D-SWE data. In contrast BMI has not been shown to systematically affect the failure rate for MRE[23]or of mpMRI[36,37], except for the very unusual circumstances when a participant is simply too large to fit into the scanner.All elastography measures however have been shown to be confounded by iron,steatosis and inflammation[38]in the measure of fibrosis.

Figure 3 Receiver operator characteristic curves for discriminating those with non-alcoholic steatohepatitis (NAS ≥ 4 and ballooning > 1 or inflammation > 1) from those without. Area under the curve and confidence intervals displayed in the legend for each non-invasive measure. MRE:Magnetic resonance elastography; CAP: Controlled-attenuation parameter; cT1: Corrected T1; LSM: Liver stiffness measure; PDFF: Proton density fat fraction.

Figure 4 Receiver operator characteristic curves for discriminating those with non-alcoholic steatohepatitis (left) and those with nonalcoholic steatohepatitis with fibrosis (right). Area under the curve and confidence intervals displayed in the legend for each non-invasive measure. CAP:Controlled-attenuation parameter; VCTE: Vibration-controlled transient elastography; cT1: Corrected T1; NASH: Non-alcoholic steatohepatitis.

The assessment of steatosis is important as the accumulation of liver fat is linked with the progression of hepatocyte injury that can ultimately result in fibrosis[39]; the downstream consequence of NASH which is linked to poor clinical outcomes[31]. In this study, MR liver fat demonstrated better performance than CAP for the detection and differentiation of steatosis grades. As a result, this makes MR liver fat a good marker of NAFL and NASH. However, where cT1 appears to have an advantage over MR liver fat as a non-invasive biomarker for NASH is in the detection and diagnosis of patients with both NASH and fibrosis. Both cT1, and uncorrected T1 have been reported to be elevated in those with advanced liver fibrosis[40-43]. A key limitation of MR liver fat as a solo biomarker of NASH is that patients with advanced fibrosis and cirrhosis have lower acclimations of liver fat than those at earlier stages of disease[44,45].Our results, in line with those reported in the literature, demonstrate a negative association between MR liver fat and advanced fibrosis[44,45]. This is an important consideration and is reflected in the superior diagnostic accuracy of cT1 when compared to MR liver fat for identifying those with NASH and fibrosis ≥ 2, with the composite marker of cT1 and MR liver fat showing the greatest diagnostic accuracy.

New diagnostic tools are evaluated by comparison to histological measures to evaluate their utility. Nevertheless, recent studies on the reproducibility of histology[46]have established that biopsy, as the de facto benchmark, is not perfect. We also observed discordance between different pathologists across the four cardinal pathological features (steatosis, ballooning, lobular inflammation and fibrosis), with no common pattern of concordance seen between any two pathologists. Previous literature discusses the subjective nature of histological grading systems[47]which is evident in these findings, demonstrating that subjectivity as well as potential sampling and human error can affect results[48-50]. This analysis highlights the need for more robust endpoints with which to evaluate the performance of non-invasive diagnostics.The gold standard would be a tool that predicts clinical outcomes and can thus provide prognosis. Whilst such studies are both time-consuming and costly in order to generate the necessary evidence, encouragingly, initial work in this space has demonstrated potential for cT1 to predict clinical outcomes in patients with chronic liver disease[19,20]. It is hoped that further validation of these observations in patients with NASH specifically will aid in the development of a clinical pathway that does not rely on invasive liver biopsy.

This study was not without its limitations, with the pre-screening step prior to liver biopsy likely truncating the correlations that would be observed across the full disease spectrum. While this may also impact the diagnostic accuracy evaluated, this prescreening approach is representative of clinical practice. Observed failure rates for the ultrasound-based methods also have the potential to skew the results given its dependence on BMI, with patients with high BMI less likely to be included in the study. In practice this has a bearing on the value proposition of such technologies for screening and monitoring as failed measurements will result in necessary further clinical tests for patients.

CONCLUSION

In summary, this study demonstrates the clinical utility of mpMRI for the stratification of NAFLD, and encourages mpMRI use as a non-invasive alternative to biopsy in the clinical care pathway. Quantitative mpMRI metrics showed the strongest overall correlations to the histological components of NASH with fewer technical failures.mpMRI also out-performed MRE and ultrasound-based elastography methods in the identification of patients with NASH and fibrosis. The ability to risk stratify patients in a single non-invasive test is a particular strength of mpMRI, offering a safe and costeffective alternative to liver biopsy.

ARTICLE HIGHLIGHTS

Research background

Non-alcoholic fatty liver disease affects 25% of the population worldwide and up to 30% in the Japanese population, and in some can progress to non-alcoholic steatohepatitis (NASH), a leading cause of liver transplant due to its strong propensity to develop into cirrhosis and hepatocellular carcinoma.

Research motivation

Liver biopsy is the current reference standard for a clinical diagnosis of NASH, a method that is expensive, invasive and suffers from great observer variability. Noninvasive and scalable alternatives are required in order to meet the burgeoning demands of the disease on clinical caseloads across the globe.

Research objectives

The main objectives of the study were to evaluate the diagnostic performance of noninvasive, image derived metrics to identify patients with suspected NASH. The metrics under investigation included two quantitative multi-parametric magnetic resonance imaging (MRI) measures, iron corrected T1 mapping [(cT1), a marker of fibro-inflammation] and proton density liver fat fraction (a marker of liver fat),magnetic resonance elastography and ultrasound based transient elastography(vibration-controlled transient elastography and 2 D shear-wave elastography), both markers of liver stiffness.

Research methods

In an observational study of patients who were being screened clinically on suspicion of NASH, n = 145 individuals underwent liver biopsy and concomitant imaging measures of liver health. Diagnosis of NASH was based on histology, graded using the NAS- Clinical Research Network scoring system and diagnostic accuracy of the imagederived metrics assessed using area under receiver operator characteristic curve. In addition, the biopsy slides were read by 2 further pathologists and comparisons made to explore the level of agreement on diagnosis between individual doctors.

Research results

In this study assessing the ability of different non-invasive biomarkers to detect NASH, MR liver fat and cT1 were superior to the other metrics investigated. Crucially however, the composite marker of cT1 and MR liver fat showed the greatest diagnostic accuracy for identifying those with NASH and also those with NASH with fibrosis.These measures also had very few technical failures. This is the first assessment and direct comparison of these technologies in a Japanese cohort. We also observed discordance between different pathologists across the four cardinal pathological features (steatosis, ballooning, lobular inflammation and fibrosis), with no common pattern of agreement seen between any two pathologists.

Research conclusions

These results demonstrate the clinical utility of quantitative multiparametric magnetic resonance imaging (mpMRI) for the identification and stratification of patients with NASH from those with evidence of NAFLD and encourages mpMRI use as a noninvasive alternative to biopsy in the clinical care pathway. Quantitative mpMRI metrics showed the strongest correlation to the histological components of NASH with fewer technical failures. mpMRI also out-performed magnetic resonance elastography and ultrasound-based elastography methods in the identification of patients with NASH and fibrosis. Liver biopsy suffered from high levels of inter-reading disagreement, highlighting the pressing need for alternative diagnostic tests for NASH.

Research perspectives

The ability to risk stratify patients in a single non-invasive test is a particular strength of mpMRI, offering a safe and cost-effective alternative to liver biopsy. Future work should be focused on validating these findings further and on longer term outcomes studies to investigate the prognostic natures of these measurements in a Japanese population.