Abstract
Objective
To evaluate whether computed tomography (CT) reconstruction algorithms affect the CT texture features of the liver parenchyma.
Materials and Methods
This retrospective study comprised 58 patients (normal liver, n = 34; chronic liver disease [CLD], n = 24) who underwent liver CT scans using a single CT scanner. All CT images were reconstructed using filtered back projection (FBP), hybrid iterative reconstruction (IR) (iDOSE4), and model-based IR (IMR). On arterial phase (AP) and portal venous phase (PVP) CT imaging, quantitative texture analysis of the liver parenchyma using a single-slice region of interest was performed at the level of the hepatic hilum using a filtration-histogram statistic-based method with different filter values. Texture features were compared among the three reconstruction methods and between normal livers and those from CLD patients. Additionally, we evaluated the inter- and intra-observer reliability of the CT texture analysis by calculating intraclass correlation coefficients (ICCs).
Results
IR techniques affect various CT texture features of the liver parenchyma. In particular, model-based IR frequently showed significant differences compared to FBP or hybrid IR on both AP and PVP CT imaging. Significant variation in entropy was observed between the three reconstruction algorithms on PVP imaging (p < 0.05). Comparison between normal livers and those from CLD patients revealed that AP images depend more strongly on the reconstruction method used than PVP images. For both inter- and intra-observer reliability, ICCs were acceptable (> 0.75) for CT imaging without filtration.
Interest in the quantification of spatial and temporal heterogeneity within solid malignancies on medical imaging has steadily grown over the past several years (12). In particular, computed tomography (CT) texture analysis (CTTA), initially developed in an attempt to provide an objective, quantitative assessment of tumor heterogeneity by analyzing the distribution of pixel values on CT images, has increasingly garnered attention as a potential quantitative biomarker of tissue heterogeneity (345). Previous studies have demonstrated that CTTA parameters are significantly associated with histopathologic features and clinical outcomes in a variety of primary and metastatic tumors (46) and that it has utility in various non-oncologic applications, including hepatic fibrosis (478). However, the majority of CTTA studies performed thus far were retrospective in manner, using CT images collected over years (9101112); therefore, various CT scanners using various scanning techniques and reconstruction algorithms were jointly analyzed. A potential problem related to the inconsistent use of CT scanners or reconstruction algorithms is that it may introduce statistical artifacts into the images, thus failing to identify true tissue heterogeneity (131415). Therefore, to establish a clinical level of credibility for CTTA, it is crucial to investigate the impact of different reconstruction methods and scanning techniques on CT images.
Over the past several years, as the number of CT examinations has increased globally, concern over radiation exposure has risen; therefore, various new techniques have been developed to reduce the radiation dose (16). One of such dose reduction approach is the use of iterative reconstruction (IR) algorithms. Model-based, the full IR method has specifically demonstrated a higher degree of noise reduction, enabling significant radiation dose reduction, as it combines many more iterations using complex mathematics than hybrid IR methods (17). However, as model-based IR is based upon a different statistical metric, it is well known to radiologists that the resultant reconstructed images have a different morphology, presenting a mildly blurred and pixelated texture, compared to those reconstructed with either filtered back projection (FBP) or the hybrid IR (181920). Several previous studies have further suggested that such image blurring may affect their quantitative measurement on CT images (122122). Other studies have also shown that these reconstruction algorithms could affect both the extraction and analysis of quantitative imaging features of focal liver lesions, lung nodules, and renal stones (1221). Nonetheless, the impact of reconstruction methods on imaging heterogeneity and texture analysis, especially in the liver parenchyma, has not yet been elucidated.
Therefore, in the present study, we aimed to evaluate the impact of reconstruction methods on the CTTA features of the liver parenchyma by comparing FBP, hybrid IR (iDOSE4), and model-based IR (IMR) methods, as well as the intra- and inter-observer reliability of CTTA, to assess its clinical usefulness.
This retrospective study was approved by the Institutional Review Board of our hospital. Because all clinical data and CT images were obtained retrospectively from medical records, the requirement for informed consent was waived.
A search of our picture archiving and communication system database revealed that 649 patients had undergone liver CT using a single CT scanner at our hospital from Oct. 2016 to Mar. 2017. The inclusion criteria of this study were: (a) patients who had available CT images reconstructed with FBP, hybrid, and model-based IR techniques; and (b) those who had evaluable hepatic parenchyma. Those who met any of the following criteria were then excluded from the study population: (a) history of any non-surgical treatment for hepatocellular carcinoma (HCC) (n = 398), (b) history of surgical treatment of the liver within 6 months prior to the scanning date (n = 9), and (c) diffuse/infiltrative HCC or multiple (> 10) focal liver lesions (n = 15). Among the 227 remaining patients, 191 were clinically diagnosed with chronic liver disease (CLD), 24 of whom had a pathologic diagnosis of CLD, including hepatitis B-related liver cirrhosis (LC) (n = 18), cryptogenic LC (n = 2), hepatitis C-related LC (n = 1), alcoholic LC (n = 1), primary biliary cholangitis (n = 1), and severe fatty liver disease (n = 1). Of the others, 34 patients with no medical record of CLD and with a radiologic interpretation compatible with a normal liver were included and classified as the normal liver group; 76% of them had focal hepatic lesions such as hemangioma or cyst (n = 26). The other patients had a previous history of early stage cancer and had undergone follow-up CT imaging (n = 8).
Finally, 58 patients (men:women = 25:33; mean age, 56.88 ± 11.84 years) were included in our study population. There were 16 men and eight women in the CLD group (mean age, 57.33 ± 8.82 years), and nine men and 25 women in the normal liver group (mean age, 56.56 ± 13.69 years).
All CT examinations were performed on a 256-slice MDCT machine (Brilliance-iCT; Philips Healthcare, Cleveland, OH, USA) at our hospital. The liver CT protocol was composed of four phases: pre-contrast, arterial phase (AP), portal venous phase (PVP), and delayed phase. CT scanning was performed using the following parameters; tube voltage, 100 kVp; tube current-time products, 280 mAs; detector collimation, 128 × 0.625 mm; rotation time, 0.5 seconds; pitch, 1.0; slice thickness, 3 mm. A nonionic contrast material, iobitridol (Xenetix 350; Guerbet Korea, Seoul, Korea) was injected at a dose of 520 mg/kg of body weight using a power injector (Stellant D; Medrad, Warrendale, PA, USA) for 30 seconds at a rate of 2.1–4.3 mL/s according to body weight. Timing for the PVP scan was determined using the bolus tracking technique; that is, AP imaging was automatically performed 17 seconds after the attenuation coefficient of the abdominal aortic blood reached 100 Hounsfield units. PVP images were acquired 70 seconds after the start of contrast media administration.
The obtained CT images were automatically reconstructed using three different methods including FBP, hybrid IR (iDOSE4), and model-based IR (IMR). For hybrid IR, the standard level 4 setting (50%/50% blend of IR/FBP) of iDOSE4 was applied, and for model-based IR, IMR at level 1, the low level of noise reduction setting, was used. The choice of IMR level 1 was based on a previous investigation (19).
The single slice axial image at which portal vein bifurcation was best visualized was selected from AP and PVP images by one radiologist (with two years of clinical experience), and reconstructed using three different reconstruction methods. Thereafter, the first reviewer (with one year of clinical experience) performed CTTA using the selected images by delineating a region of interest (ROI) in the right lobe of the liver and by running commercially available software, applying the filtration-histogram statistic-based method offered by TexRAD. Specifically, a circular ROI was drawn with the diameter as long as the horizontal axis of the vertebral body shown on the sliced image with an effort to avoid large vessels or ductal structures. The ROIs were re-drawn for each reconstructed image. An example of the filtration-histogram texture analysis process performed by TexRAD is depicted in Figure 1.
To assess inter- and intra-observer reliability, the CTTA procedure was repeated using the selected images reconstructed with the FBP method. For the inter-observer reliability test, another attending radiologist (with four years of clinical experience) performed CTTA independently in a blinded manner. For the intra-observer reliability test, the first reviewer repeated the CTTA procedure 1 month after the initial analysis.
For CTTA, spatial filters, which evaluate the ROI at a different scale with object radii of different size, denoted by spatial scaling factors (SSF) of 0 to 6, were applied. SSF 0 indicates no filtration, SSF 2, 4, and 6 indicate a 2-mm, 4-mm, and 6-mm radius which represent fine, medium, and coarse filters, respectively. All filtered images were obtained using the Laplacian of the Gaussian spatial band-pass filter. Histograms of the pixel values were quantified using standard descriptors such as the mean attenuation, standard deviation (SD), entropy, skewness, and kurtosis. The mean indicates the average of pixels within the ROI. SD was the measure of how much variation or dispersion exists from the mean value (6). Entropy is a parameter that reflects the irregularity or complexity of the pixel intensities (45), skewness is a measure of the asymmetry of the histogram, reflecting the average brightness of highlighted objects, and kurtosis is a measure of the peakedness of the histogram (6).
For comparisons between the three reconstruction methods, repeated-measures analysis of variance with Bonferroni's correction was used, and the independent t test was performed to evaluate the difference between the normal liver group and CLD group. To assess the inter-observer and intra-observer reliability of CTTA, the intraclass correlation coefficient (ICC), defined as the proportion of the total error not associated with measurement error, was calculated. ICCs of < 0.40, 0.40–0.75, and 0.76–1.00 signified poor, moderate, and excellent agreement, respectively (23). For all studies, a p value < 0.05 was considered to indicate a statistically significant difference. All statistical analyses were performed using commercially available software (MedCalc version 17.4; MedCalc Software, Ostend, Belgium; and IBM SPSS, version 21; IBM Corp., Armonk, NY, USA).
The CT texture parameters of the liver parenchyma according to reconstruction methods and filter values are summarized in Tables 1 (AP) and 2 (PVP). IR techniques were shown to affect various CT texture features of the liver parenchyma in the same individuals across different filters. SD and entropy were found to be highest for FBP, followed by hybrid IR and model-based IR, while skewness and kurtosis were observed to be highest for model-based IR, followed by hybrid IR and FBP. This tendency was found for both AP and PVP images regardless of the filter values used and at statistically significant levels in most cases. Model-based IR showed significant differences from FBP and hybrid IR for all measured features without filtration (p < 0.0001) for both phases. Additionally on PVP, model-based IR was significantly different from FBP and hybrid IR in terms of SD, entropy, skewness, and kurtosis when fine and medium filters (SSF = 2, 4) were used.
For SD, statistically significant differences between the three reconstruction methods were more frequently observed on AP images. Entropy was demonstrated to be significantly different between the three reconstruction methods on PVP images regardless of the filter values. The difference in skewness and kurtosis between the three reconstruction methods was significant at filter values of 0 and 2 for both phases. Overall, the differences between the reconstruction techniques showed a tendency to become less frequently significant as filter values increased. As an example, the difference in skewness between FBP and hybrid IR was statistically significant when the filter was 0 and 2, while it was non-significant with medium and coarse filters (SSF = 4, 6) on AP (p < 0.0001 vs. 0.1919, 0.6659) and PVP images (p < 0.0001 vs. 0.2381, 1).
Differences in CT texture parameters between normal livers and CLD are summarized in Tables 3 (AP) and 4 (PVP). Between the two groups, there were more frequent significant differences for AP than PVP images. Among the reconstruction methods, model-based IR showed the least amount of significant differences for both phases. In the PVP images reconstructed with model-based IR, there was no statistically significant difference between the normal liver and CLD groups, except for entropy without filtration (3.72 vs. 3.83, p = 0.014). In addition, the values of texture features showed a tendency to be greater in the CLD group than in the normal liver group for both phases. When the difference between the two groups was statistically significant, the mean value of the texture feature was always greater in the CLD group. As an example, the SD and entropy of AP images reconstructed with FBP were significantly different between the normal liver and CLD groups, with a greater value in the CLD group (p < 0.05) (Fig. 2).
Regarding both inter- and intra-observer reliability, ICCs were excellent without filtration (SSF = 0) for both phases. The ICCs of entropy were also always excellent (> 0.75) regardless of the filter values used. Inter-observer ICCs were also excellent for mean and SD, regardless of the filter used. Furthermore, ICCs showed a tendency to become smaller as the filter became coarser, both for inter- and intra-observer reliability (Table 5).
Our study demonstrated that the CTTA features of the liver parenchyma were significantly affected by the reconstruction algorithms used, with model-based IR more frequently showing significant differences in CT texture features compared to the results of FBP or hybrid IR for both phases. This result is in good agreement with previous studies that have investigated the impact of reconstruction methods on CTTA features, focusing on specific lesions (1221). Our results may be explained by the potential alteration of CT features that can occur during the process of noise reduction using hybrid IR and model-based IR techniques compared with FBP. The hybrid IR algorithm first analyzes the projection data as FBP, then applies a model, including photon statistics, to detect very noisy measurements. Thereafter, through an iterative process, the noisy data are penalized while edges are preserved, thereby spatial resolution is preserved with substantial noise reduction (24). However, these extra processes may have also led to differences in texture features between FBP and hybrid IR. Moreover, model-based IR, which does not use FBP data to create the final results (20) but is rather composed of its own unique noise spectrum and tissue depiction pattern (24), showed an even greater significant difference from the other two reconstruction algorithms.
More specifically, the three reconstruction methods examined in our study showed a significant difference in entropy in decreasing order of FBP, hybrid IR, and model-based IR, while for skewness and kurtosis, it was found to be in the opposite order. As IR algorithms directly reduce noise and artifacts resulting from irregularities such as photon starvation, beam hardening, and nonlinearity of individual detector elements (25), it is not surprising that entropy would decrease when an IR algorithm is applied (21). It is also notable that model-based IR exhibited lower entropy than hybrid IR. To the contrary, skewness and kurtosis demonstrated the highest values in the model-based IR image. While reducing noise, model-based IR also acts to smoothen edges; this smudging feature may lead to lesser pixel variability that is reflected in the higher skewness and kurtosis values. In addition to the impact of the reconstruction algorithm itself, the spatial scaling factor was also shown to be a factor influencing CT texture parameters. Our results showed that as the filter became coarser, the significant differences between the three reconstruction methods were less frequent. Considering that the fine filter, which highlights smaller objects, will pick up subtle changes derived from reconstruction algorithms more sensitively than a coarse filter, this result is as expected (26).
We also found that the normal liver group and CLD group demonstrated significant differences in CT texture features more frequently on AP than PVP images. When the difference between the two groups was statistically significant, the mean value of the texture parameter was always higher in the CLD group. These results are as expected considering that CLD has a histologically more heterogeneous parenchyma. As the parenchyma of the liver in CLD has a larger fibrotic component with increased arterial fractions than normal liver parenchyma, the difference between the two groups would be more emphasized on AP images (272829). Furthermore, model-based IR found statistically significant differences between the two groups less frequently than the other methods. We can interpret this result as stemming from the lower capability of model-based IR to reflect subtle heterogeneity in the liver parenchyma in comparison with FBP or hybrid IR, owing to loss of minor image details during its powerful noise reduction (30).
Another notable finding was that the inter- and intra-observer reliability of CTTA was acceptable across all of the various texture parameters. However, ICC tended to become less reliable as the filter became coarser for both inter- and intra-observer reliability. We surmise that this result may be derived from the statistical reason that a higher filter value has a lower number of highlighted units to be used during analysis than a smaller filter value.
Our study has several limitations. First, this is a retrospective study with a limited sample size of 58 patients; therefore, the potential for selection bias cannot be excluded. Second, the CLD group consisted of only 24 patients, and the range of their disease progression status was not evaluated. However, considering that the main focus of our study was on the change and variability of CT texture features according to the use of different reconstruction algorithms, the heterogeneity of our study population should not have affected our main conclusions. Third, our study evaluated the influence of reconstruction algorithms only on first order histogram parameters. As a previous study on PET/CT demonstrated that the standardization of feature extraction methodology is important for second and higher order texture features (31), further analysis of the impact of reconstruction methods on higher order parameters is warranted. Fourth, our study was designed using a single CT machine at a single institution. Thus, investigation of various IR algorithms offered by diverse CT manufacturers with various CT machines for CTTA is warranted in the future.
In conclusion, CT texture features of the liver parenchyma using the filtration-histogram method were significantly affected by the CT reconstruction algorithm used across various filters. Therefore, it is important for CTTA researchers to evaluate the dependency of potential imaging biomarkers on imaging acquisition and reconstruction parameters, as the quality and repeatability of CTTA studies may depend strongly on the consistency of image acquisition and reconstruction method (14). Using unified imaging parameters for CCTA would be an easy and simple solution, but the standardization of feature extraction methodology for CTTA via feature normalization methods, such as voxel size and gray level normalization, could be an alternative approach (32) for more reliable inter-subject comparison and intra-subject longitudinal monitoring.
References
1. Grootjans W, Tixier F, van der, Vriens D, Le Rest CC, Bussink J, et al. The impact of optimal respiratory gating and image noise on evaluation of intratumor heterogeneity on 18F-FDG PET imaging of lung cancer. J Nucl Med. 2016; 57:1692–1698. PMID: 27283931.
2. Yun BL, Cho N, Li M, Jang MH, Park SY, Kang HC, et al. Intratumoral heterogeneity of breast cancer xenograft models: texture analysis of diffusion-weighted MR imaging. Korean J Radiol. 2014; 15:591–604. PMID: 25246820.
3. Bashir U, Siddique MM, Mclean E, Goh V, Cook GJ. Imaging heterogeneity in lung cancer: techniques, applications, and challenges. AJR Am J Roentgenol. 2016; 207:534–543. PMID: 27305342.
4. Lubner MG, Smith AD, Sandrasegaran K, Sahani DV, Pickhardt PJ. CT texture analysis: definitions, applications, biologic correlates, and challenges. Radiographics. 2017; 37:1483–1503. PMID: 28898189.
5. Ganeshan B, Miles KA. Quantifying tumour heterogeneity with CT. Cancer Imaging. 2013; 13:140–149. PMID: 23545171.
6. Miles KA, Ganeshan B, Hayball MP. CT texture analysis using the filtration-histogram method: what do the measurements mean? Cancer Imaging. 2013; 13:400–406. PMID: 24061266.
7. Daginawala N, Li B, Buch K, Yu H, Tischler B, Qureshi MM, et al. Using texture analyses of contrast enhanced CT to assess hepatic fibrosis. Eur J Radiol. 2016; 85:511–517. PMID: 26860661.
8. Lubner MG, Malecki K, Kloke J, Ganeshan B, Pickhardt PJ. Texture analysis of the liver at MDCT for assessing hepatic fibrosis. Abdom Radiol (NY). 2017; 42:2069–2207. PMID: 28314916.
9. Ganeshan B, Abaleke S, Young RC, Chatwin CR, Miles KA. Texture analysis of non-small cell lung cancer on unenhanced computed tomography: initial evidence for a relationship with tumour glucose metabolism and stage. Cancer Imaging. 2010; 10:137–143. PMID: 20605762.
10. Chae HD, Park CM, Park SJ, Lee SM, Kim KG, Goo JM. Computerized texture analysis of persistent part-solid ground-glass nodules: differentiation of preinvasive lesions from invasive pulmonary adenocarcinomas. Radiology. 2014; 273:285–293. PMID: 25102296.
11. Miles KA, Ganeshan B, Griffiths MR, Young RC, Chatwin CR. Colorectal cancer: texture analysis of portal phase hepatic CT images as a potential marker of survival. Radiology. 2009; 250:444–452. PMID: 19164695.
12. Solomon J, Mileto A, Nelson RC, Roy Choudhury K, Samei E. Quantitative features of liver lesions, lung nodules, and renal stones at multi-detector row CT examinations: dependency on radiation dose and reconstruction algorithm. Radiology. 2016; 279:185–194. PMID: 26624973.
13. Yasaka K, Akai H, Mackin D, Court L, Moros E, Ohtomo K, et al. Precision of quantitative computed tomography texture analysis using image filtering: a phantom study for scanner variability. Medicine (Baltimore). 2017; 96:e6993. PMID: 28538408.
14. Mackin D, Fave X, Zhang L, Fried D, Yang J, Taylor B, et al. Measuring computed tomography scanner variability of radiomics features. Invest Radiol. 2015; 50:757–765. PMID: 26115366.
15. Zhao B, Tan Y, Tsai WY, Qi J, Xie C, Lu L, et al. Reproducibility of radiomics for deciphering tumor phenotype with imaging. Sci Rep. 2016; 6:23428. PMID: 27009765.
16. Berrington de González A, Mahesh M, Kim KP, Bhargavan M, Lewis R, Mettler F, et al. Projected cancer risks from computed tomographic scans performed in the United States in 2007. Arch Intern Med. 2009; 169:2071–2077. PMID: 20008689.
17. Liu L. Model-based iterative reconstruction: a promising algorithm for today’s computed tomography imaging. J Med Imaging Radiat Sci. 2014; 45:131–136.
18. Yu MH, Lee JM, Yoon JH, Baek JH, Han JK, Choi BI, et al. Low tube voltage intermediate tube current liver MDCT: sinogram-affirmed iterative reconstruction algorithm for detection of hypervascular hepatocellular carcinoma. AJR Am J Roentgenol. 2013; 201:23–32. PMID: 23789655.
19. Yoon JH, Lee JM, Yu MH, Baek JH, Jeon JH, Hur BY, et al. Comparison of iterative model-based reconstruction versus conventional filtered back projection and hybrid iterative reconstruction techniques: lesion conspicuity and influence of body size in anthropomorphic liver phantoms. J Comput Assist Tomogr. 2014; 38:859–868. PMID: 25321625.
20. Chang W, Lee JM, Lee K, Yoon JH, Yu MH, Han JK, et al. Assessment of a model-based, iterative reconstruction algorithm (MBIR) regarding image quality and dose reduction in liver computed tomography. Invest Radiol. 2013; 48:598–606. PMID: 23511193.
21. Kim H, Park CM, Lee M, Park SJ, Song YS, Lee JH, et al. Impact of reconstruction algorithms on CT radiomic features of pulmonary tumors: analysis of intra- and inter-reader variability and inter-reconstruction algorithm variability. PLoS One. 2016; 11:e0164924. PMID: 27741289.
22. Barrett HH, Myers KJ, Hoeschen C, Kupinski MA, Little MP. Task-based measures of image quality and their relation to radiation dose and patient risk. Phys Med Biol. 2015; 60:R1–R75. PMID: 25564960.
23. Shin JM, Kim TH, Haam S, Han K, Byun MK, Chang YS, et al. The repeatability of computed tomography lung volume measurements: comparisons in healthy subjects, patients with obstructive lung disease, and patients with restrictive lung disease. PLoS One. 2017; 12:e0182849. PMID: 28796825.
24. Geyer LL, Schoepf UJ, Meinel FG, Nance JW Jr, Bastarrika G, Leipsic JA, et al. State of the Art: Iterative CT Reconstruction Techniques. Radiology. 2015; 276:339–357. PMID: 26203706.
25. Willemink MJ, de Jong PA, Leiner T, de Heer LM, Nievelstein RA, Budde RP, et al. Iterative reconstruction techniques for computed tomography Part 1: technical principles. Eur Radiol. 2013; 23:1623–1631. PMID: 23314600.
26. Ng F, Ganeshan B, Kozarski R, Miles KA, Goh V. Assessment of primary colorectal cancer heterogeneity by using whole-tumor texture analysis: contrast-enhanced CT texture as a biomarker of 5-year survival. Radiology. 2013; 266:177–184. PMID: 23151829.
28. Lautt WW. Mechanism and role of intrinsic regulation of hepatic arterial blood flow: hepatic arterial buffer response. Am J Physiol. 1985; 249(5 Pt 1):G549–G556. PMID: 3904482.
29. Yoon JH, Lee JM, Klotz E, Jeon JH, Lee KB, Han JK, et al. Estimation of hepatic extracellular volume fraction using multiphasic liver computed tomography for hepatic fibrosis grading. Invest Radiol. 2015; 50:290–296. PMID: 25493416.
30. Park SB, Kim YS, Lee JB, Park HJ. Knowledge-based iterative model reconstruction (IMR) algorithm in ultralow-dose CT for evaluation of urolithiasis: evaluation of radiation dose reduction, image quality, and diagnostic performance. Abdom Imaging. 2015; 40:3137–3146. PMID: 26197735.
31. Leijenaar RT, Nalbantov G, Carvalho S, van Elmpt WJ, Troost EG, Boellaard R, et al. The effect of SUV discretization in quantitative FDG-PET Radiomics: the need for standardized methodology in tumor texture analysis. Sci Rep. 2015; 5:11075. PMID: 26242464.
32. Shafiq-Ul-Hassan M, Latifi K, Zhang G, Ullah G, Gillies R, Moros E. Voxel size and gray level normalization of CT radiomic features in lung cancer. Sci Rep. 2018; 8:10545. PMID: 30002441.