|Year : 2020 | Volume
| Issue : 6 | Page : 537-542
Comparison of Multimodal Ultrasound Imaging with Conventional Ultrasound Risk Stratification Systems in Presurgical Risk Stratification of Thyroid Nodules
M Shreyamsa1, Anand Mishra1, Pooja Ramakant1, Anit Parihar2, Kul R Singh1, Chanchal Rana3, Sasi Mouli1
1 Department of Endocrine Surgery, King George's Medical University, Lucknow, Uttar Pradesh, India
2 Department of Radiodiagnosis, King George's Medical University, Lucknow, Uttar Pradesh, India
3 Department of Pathology, King George's Medical University, Lucknow, Uttar Pradesh, India
|Date of Submission||08-Oct-2020|
|Date of Acceptance||10-Dec-2020|
|Date of Web Publication||12-Jan-2021|
Kul R Singh
Department of Endocrine Surgery, Shatabdi Phase 2 Building, 7th Floor, King George's Medical University, Shah Mina Road, Lucknow, Uttar Pradesh
Source of Support: None, Conflict of Interest: None
| Abstract|| |
Background: Ultrasonography (US) is an indispensable tool in the management of thyroid nodules, not only for assessing tumor characteristics but also to assign risk of malignancy and guide in management. Various guidelines and US-based risk stratification systems have been proposed for this purpose. This study aims to compare the diagnostic performances of multimodal US-based risk scores (French TIRADS, TMC-RSS) with conventional US-based scoring systems (Korean TIRADS, ACR-TIRADS, ATA risk stratification). Material and Methods: A total of 168 nodules from 139 patients were studied and categorized in each of the risk stratification systems. Sensitivity, specificity, positive and negative predictive values, and accuracy of each system were computed. ROC curves were plotted and area under curve (AUC) for each scoring system noted. Results: Thirty five (21%) of the 168 nodules were malignant on final histopathological examination. TMC-RSS fared the best in predicting malignant nodules with a sensitivity of 96.2% and specificity of 88.6%, while the PPV and NPV were 97% and 86.1%, respectively. The AUC for TMC-RSS was 0.924 (95% CI, 0.860–0.988; P < 0.001). Conclusion: Multimodal US-based risk stratification incorporating non-grayscale characteristics in addition to conventional systems like the TMC-RSS improves the diagnostic performance of ultrasound imaging of thyroid nodules.
Keywords: Multimodal imaging, thyroid, ultrasound, ultrasound risk-stratification
|How to cite this article:|
Shreyamsa M, Mishra A, Ramakant P, Parihar A, Singh KR, Rana C, Mouli S. Comparison of Multimodal Ultrasound Imaging with Conventional Ultrasound Risk Stratification Systems in Presurgical Risk Stratification of Thyroid Nodules. Indian J Endocr Metab 2020;24:537-42
|How to cite this URL:|
Shreyamsa M, Mishra A, Ramakant P, Parihar A, Singh KR, Rana C, Mouli S. Comparison of Multimodal Ultrasound Imaging with Conventional Ultrasound Risk Stratification Systems in Presurgical Risk Stratification of Thyroid Nodules. Indian J Endocr Metab [serial online] 2020 [cited 2021 Apr 10];24:537-42. Available from: https://www.ijem.in/text.asp?2020/24/6/537/306756
| Introduction|| |
The prevalence of thyroid nodules has increased over the past few decades, mostly due to advances in imaging techniques. The reported prevalence ranges from 2% in iodine sufficient areas to up to 45% in iodine-deficient areas. High-resolution ultrasound (US) can detect thyroid nodules in 19–68% of random individuals with increased incidence in females and elderly. It is important to exclude malignancy in these nodules, seen in 7–15%. In India, thyroid malignancies account for 1.8% of all cancers, with about 18,600 cases diagnosed every year. Mortality rates, however, are very low, responsible for 0.4–0.5% of all cancer-related deaths. This mortality rate has remained rather stable in spite of increase in the incidence of thyroid cancers, attributable to improvements in diagnostics and possible change of risk factors.
US has become an indispensable tool in the management of thyroid nodules, not only for assessing the tumor characteristics but also to assign risk of malignancy and formulate management strategies. Various guidelines and US-based risk stratification systems have been proposed to guide surgeons toward optimal management in thyroid nodules. The first among such classification systems was proposed in 2009 by Horvath et al., based on an already established risk classification system for breast lumps, called the Thyroid Imaging Reporting and Data System (TIRADS). Many other versions of the TIRADS have been proposed since then, like the KWAK-TIRADS, Korean TIRADS, ACR-TIRADS, and so on. Professional academic bodies like the American Thyroid Association (ATA) and the British Thyroid Association also devised a risk stratification system based on the US findings. These systems utilize different parameters in US parameters and hence differ in their diagnostic performances. There are many US characters (like vascularity, tissue elasticity, etc.) not featured in the TIRADS but are known to improve the diagnostic capabilities of US when incorporated. Multimodal scoring systems including these additional features were proposed to improve the diagnostic accuracy, like the French TIRADS, TMC-RSS, etc. Many studies comparing conventional US-based risk scores have been performed but there is a dearth of studies comparing multimodal risk scores with the conventional scoring systems. The purpose of this study was to compare the diagnostic performances of conventional versions of TIRADS, ATA risk stratification system, and multimodality scoring systems in identifying malignant thyroid nodules.
| Material and Methods|| |
This cross-sectional, observational study was performed in the Departments of Endocrine Surgery and Radiodiagnosis at the King George's Medical University, Lucknow, India. Patients with thyroid nodules who satisfied the inclusion criteria were recruited in the study after obtaining written consent. Clearance from the institutional ethical committee was obtained on 20/ 02/ 2019. A total of 161 patients were approached and after exclusion, 168 nodules from 139 patients were studied from March 2018 to October 2019. A dedicated radiologist performed the US evaluation and findings were recorded on predesigned proforma. Before starting the study, training sessions were held to establish a baseline consensus on the performance, evaluation, and interpretation of US. The optimal US was defined as image that was acquired while the patient held his or her breath, without any motion artifacts.
Ultrasonography was performed by using LA533 apple probe linear array transducer (Esaote) of 12 MHz frequency. Adequacy of external compression was assessed via the quality indicator and a compression of more than 50% on the scale was considered optimum. Elastograms were obtained from the transverse plane by manually setting the region of interest within the lesion. Both colorimetric elastograms (Asteria classification) and strain ratio were obtained. Color Doppler was used to assess vascularity and flow pattern was noted. The histopathological examination (HPE) reports were obtained postoperatively.
Ultrasound findings were analyzed for baseline parameters. Nodule characteristics were studied and categorized in each of the following scoring systems:
- Korean TIRADS: proposed by the Korean Society of Thyroid Radiology.
- ACR-TIRADS: proposed by the American College of Radiology.
- ATA risk stratification.
- French TIRADS: proposed by the French Society of Thyroidology.
- Thyroid Multimodal Imaging Comprehensive Risk Stratification System (TMC-RSS): proposed by Tata Memorial Hospital, Mumbai.
The US features considered in each scoring system are shown in [Table 1].
Inclusion and exclusion criteria
All nodules measuring 4 cm or less were included in the study. Patients with diffuse thyroid enlargement, autoimmune and inflammatory disorders, and those patients not willing to participate in the study were excluded.
Analysis of data
Data were analyzed and reported as the mean ± SD for continuous variables and frequency (percentage) for categorical variables. The P values were calculated by the t-test or Mann–Whitney U test for continuous variables and the Chi-square test or Fisher exact test for categorical variables. Multivariate logistic regression was performed to test the association of different parameters. Significance was set at P value equal to or less than 0.05. All statistical tests were performed using SPSS software (version 23).
| Results|| |
Among the 139 patients, 115 (82.8%) were females and 24 (17.2%) males. Mean age of the patients was 35.3 + 13.2 years (range 9–70 years). Thirty five (21%) of the 168 nodules were malignant on final HPE. Mean tumor size was 2.93 + 0.67 cm for benign nodules, while malignant nodules it was 3.1 + 0.78 cm. Cytological and histological characteristics of nodules are shown in [Table 2].
For analyses of data and risk assignment, three groups were formed. The low risk group comprised TIRADS 1–3 of Korean and ACR, 4A of the French systems, benign through low suspicion subcategories of ATA, and category 1 of TMC-RSS. TIRADS 4 of Korean and ACR, TIRADS 4B of the French system, intermediate-suspicion category of ATA, and category 2 of TMC-RSS were grouped as intermediate risk for malignancy, while the remainder were classified as high risk for malignancy. The risk stratifications of the nodules, according to different scoring systems, are presented in [Table 3].
Risk reassignment from conventional TIRADS and ATA to multimodal systems is shown in [Table 4]. Notable risk reassignment was observed from conventional TIRADS to TMC-RSS. Most significant reassignment was seen from the intermediate-risk category, where 12 nodes were downgraded to low risk, while another 12 nodules were upgraded to high risk, reducing the number of nodules in the intermediate category.
The diagnostic performances of all the US-based scoring systems in differentiating benign and malignant nodules were analyzed. Sensitivity, specificity, positive and negative predictive values, and accuracy of each system were computed. For this purpose, low-risk score was considered as a predictor of benignity, while high-risk scores were considered malignant. As the intermediate-risk score is an area of uncertainty, two separate analyses were performed, one considering the intermediate score as a benign and the other as an indicator of malignancy. [Table 5] shows the overall performance for both considerations. Specificity increased but sensitivity reduced when intermediate risk was considered as an indicator of malignancy. For further analyses, intermediate-risk group was considered as an indicator of malignancy. [Figure 1] shows the Positive predictive value (PPV), Negative predictive value (NPV), and accuracy of all scoring systems. TMC-RSS performed better than the other scoring systems.
Receiver operating characteristics curves were plotted for all the systems. The area under curves (AUC) improved with addition of auxiliary parameters [Figure 2]. All scoring systems showed AUC of more than 0.8 indicating an excellent performance, except the K-TIRADS which had an AUC of 0.78. TMC-RSS showed maximum AUC of 0.92, reiterating its superior performance in identifying malignant nodules.
| Discussion|| |
US is considered as an extension of clinical examination in the context of evaluation of a thyroid nodule. Due to the high prevalence of thyroid nodules, it is very important to identify those at high risk for malignancy. Although many US features are proven to be robust indicators of malignancy, no single feature is reliably predictive. Hence, many risk-stratification models have been developed that combine several suspicious US features in order to improve the diagnostic ability of US. Each system ascribes differential degree of risks to the individual US features in order to determine a nodule's risk of malignancy, and this risk assigned to a particular feature varies substantially in each system. As a result, no system is universally accepted.
The 4-tier K-TIRADS is simple to use and analyze but has been criticized for laying emphasis on US patterns rather than the high-risk findings themselves. This makes it difficult to classify nodules which lack a typical pattern but carry high-risk findings. The ACR-TIRADS integrates all US features, which are assigned a numerical score based on their malignant potential. It is technically more complex than other systems. The reported drawback of this system is that nodules with mixed echogenic patterns may be placed in a lower grade, resulting in false-negative diagnoses. In our study, we observed similar difficulties with categorization and risk assignment in the conventional US-based scoring systems due to overlapping findings. False negativity was 7.2% in K-TIRADS and 7% in ACR-TIRADS, while false positivity was 12.5% and 7.1%, respectively. Nodules classified into the intermediate-risk category were high (16.5% in K-TIRADS and 16% in ACR-TIRADS). The ATA guidelines, first proposed in 2009 and revised in 2015, give a 5-tier risk stratification system based on US features. The limitation of this system is that it gives equal importance to all suspicious features, while laying little emphasis on independent risk factors like composition of a nodule. In our study, the ATA risk stratification showed false negativity of 7.93% and highest false positivity (22.5%) among all scoring systems. About 6.5% of all nodules were of intermediate risk.
Attempts to standardize the US terminologies for diagnosis of thyroid nodules are still ongoing, and this has led to further advancements in US techniques. Addition of multiple non-grayscale parameters to the conventional US findings has shown a lot of promise in this regard. The F-TIRADS is a 5-tier system which along with the conventional US high-risk features has stiffness of the nodule on elastography (ES) and suspicious lymph nodes as indicators of malignancy. Initially criticized for its difficulty in reproducibility, it was subsequently shown to have a better interobserver agreement. Requirement of fair amount of experience to perform and interpret elastographic findings may limit the extensive use of this system. The false negativity in F-TIRADS was 4.5% while false positivity was 11.1%, and 11% nodules were categorized as intermediate risk. F-TIRADS showed better sensitivity and specificity compared to conventional US-based scoring systems (94.7% and 80%, respectively). This improvement in diagnostic performance has been shown in other studies evaluating utility of ES with conventional TIRADS.
Along with elasticity, the role of nodule vascularity in diagnosis of malignancy in imaging studies has always been debated. Previously, studies have shown that vascularity alone is not a reliable indicator of malignancy. But recent reports using advanced techniques have reemphasized the role of vascularity in diagnosis of malignancy on imaging., The TMC-RSS is a quantitative algorithm for characterizing nodules and consists of conventional US features in combination with Color Doppler, ES, and cervical nodal status. It assigns a positive score for suspicious features and negative score for benign features. As it is completely a quantitative scoring system, it reduces interobserver reporting variability. TMC-RSS had a false-negative rate of 2.27% while the false positivity was zero. The number of nodules classified as intermediate risk was the least among all scoring systems (5.35%). The initial study on TMC-RSS showed a sensitivity of 90%, specificity of 89%, and accuracy of 91%. Our study showed 96.2% sensitivity, 91.4% specificity, and an accuracy of 94.7%. The most significant aspect of TMC-RSS was recategorization of intermediate nodules and reduction in number of both false positives and false negatives. Few nodules were also downgraded to intermediate risk from high risk and this reassignment may not have had any impact on the final results, as intermediate-risk category is also considered as an indicator of malignancy for final analysis. Many other studies have demonstrated improvement in diagnostic performance using additional multimodal parameters., The ROC curves for each system were plotted. There was an incremental trend in the AUC from conventional TIRADS systems through multimodality systems. The TMC-RSS showed maximum AUC, confirming its better performance and improved diagnostic accuracy.
Similar attempts at utilizing multimodal US features in evaluation of thyroid nodules have shown promise. Another study incorporating minor features and negative score for benign characters showed comparable outcomes, with an AUC of 0.921, overall sensitivity of 82%, and specificity of 87.6%. This study also concluded that it is possible to categorize all the nodules into one of the risk categories, unlike conventional US-based scoring systems where overlapping suspicious features may prevent appropriate categorization. Our study reflects a similar pattern, where TMC-RSS was successful in assigning a risk category to all the nodules. There are some limitations to our study. First, we have considered only nodules which are of 4 cm or less. Although initially restricted to small nodules, recent studies have shown that with advances in techniques, ES can be useful in large nodules as well. Inclusion of nodules of all sizes will yield a better picture of applicability of the multimodality scoring systems. Second, an increase in the sample size will render the data more robust. Third, the study is performed in patients of a single specialty center by a single trained ultrasonologist. Hence, the results may not be replicable when applied in the community. In conclusion, multimodal ultrasound imaging risk stratification systems like the TMC-RSS improve the diagnostic performance of ultrasound imaging of thyroid nodules compared to scoring systems incorporating only conventional grayscale features.
Declaration of patient consent
The authors certify that they have obtained all appropriate patient consent forms. In the form, the patient(s) has/have given his/her/their consent for his/her/their images and other clinical information to be reported in the journal. The patients understand that their names and initials will not be published and due efforts will be made to conceal their identity, but anonymity cannot be guaranteed.
Financial support and sponsorship
Conflicts of interest
There are no conflicts of interest.
| References|| |
Parsa AA, Gharib H. Epidemiology of thyroid nodules. In: Gharib H, editor. Thyroid Nodules: Diagnosis and Management. Cham: Springer International Publishing; p. 1-11.
Haugen BR, Alexander EK, Bible KC, Doherty GM, Mandel SJ, et al
. 2015 American Thyroid Association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: The American Thyroid Association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid 2016;26:1-133.
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2018;68:394-424.
Vaccarella S, Franceschi S, Bray F, Wild CP, Plummer M, Dal Maso L. Worldwide thyroid-cancer epidemic? The increasing impact of overdiagnosis. N Engl J Med 2016;375:614-7.
Horvath E, Majlis S, Rossi R, Franco C, Niedmann JP, Castro A, et al
. An ultrasonogram reporting system for thyroid nodules stratifying cancer risk for clinical management. J Clin Endocrinol Metab 2009;94:1748-51.
Asteria C, Giovanardi A, Pizzocaro A, Cozzaglio L, Morabito A, Somalvico F, et al
. US-elastography in the differential diagnosis of benign and malignant thyroid nodules. Thyroid 2008;18:523-31.
Shin JH, Baek JH, Chung J, Ha EJ, Kim JH, Lee YH, et al
. Ultrasonography diagnosis and imaging-based management of thyroid nodules: Revised Korean Society of Thyroid Radiology consensus statement and recommendations. Korean J Radiol 2016;17:370-95.
Tessler FN, Middleton WD, Grant EG, Hoang JK. ACR thyroid imaging, reporting and data system (TI-RADS): White paper of the ACR TI-RADS committee. J Am Coll Radiol 2017;14:587-95.
Russ G. Risk stratification of thyroid nodules on ultrasonography with the French TI-RADS: Description and reflections. Ultrasonography 2016;35:25-38.
Mahajan A, Vaish R, Arya S, Sable N, Pande S, Paul P, et al
. Diagnostic performance of thyroid multimodal-imaging comprehensive risk stratification scoring (TMC-RSS) system in characterising thyroid nodules. J Clin Oncol 2017;35(15_suppl):e17588.
Shen Y, Liu M, He J, Wu S, Chen M, Wan Y, et al
. Comparison of different risk-stratification systems for the diagnosis of benign and malignant thyroid nodules. Front Oncol 2019;9:378.
Middleton WD, Teefey SA, Reading CC, Langer JE, Beland MD, Szabunio MM, et al
. Comparison of performance characteristics of American College of Radiology TI-RADS, Korean Society of Thyroid Radiology TIRADS, and American Thyroid Association guidelines. AJR Am J Roentgenol 2018;210:1148-54.
Stoian D, Ivan V, Sporea I, Florian V, Mozos I, Navolan D, et al
. Advanced ultrasound application - impact on presurgical risk stratification of the thyroid nodules. Ther Clin Risk Manag 2020;16:21-30.
Russ G, Bigorgne C, Royer B, Rouxel A, Bienvenu-Perrard M. [The thyroid imaging reporting and data system (TIRADS) for ultrasound of the thyroid]. J Radiol 2011;92:701-13.
Russ G, Royer B, Bigorgne C, Rouxel A, Bienvenu-Perrard M, Leenhardt L. Prospective evaluation of thyroid imaging reporting and data system on 4550 nodules with and without elastography. Eur J Endocrinol 2013;168:649-55.
Lippolis PV, Tognini S, Materazzi G, Polini A, Mancini R, Ambrosini CE, et al
. Is elastography actually useful in the presurgical selection of thyroid nodules with indeterminate cytology? J Clin Endocrinol Metab 2011;96:E1826-30.
Xue J, Cao X-L, Shi L, Lin CH, Wang J, Wang L. The diagnostic value of combination of TI-RADS and ultrasound elastography in the differentiation of benign and malignant thyroid nodules. Clin Imaging 2016;40:913-6.
Khadra H, Bakeer M, Hauch A, Hu T, Kandil E. Is vascular flow a predictor of malignant thyroid nodules? A meta-analysis. Gland Surg 2016;5:576-82.
Toomatari SBM, Mohammadi A, Sepehrvand N, Toomatari SEM, Ghasemi-Rad M, Shamspour SZ, et al
. A novel computerised quantification of thyroid vascularity in the differentiation of malignant and benign thyroid nodules. Pol J Radiol 2019;84:e517-21.
Baig FN, van Lunenburg JTJ, Liu SYW, Yip SP, Law HKW, Ying M. Computer-aided assessment of regional vascularity of thyroid nodules for prediction of malignancy. Sci Rep 2017;7:14350.
Mahajan A, Vaidya T, Vaish R, Sable N. The journey of ultrasound-based thyroid nodule risk stratification scoring systems: Do all roads lead to thyroid imaging, reporting and data system (TIRADS)? J Head Neck Physicians Surg 2017;5:57-65. [Full text]
Pei S, Cong S, Zhang B, Liang C, Zhang L, Liu J, et al
. Diagnostic value of multimodal ultrasound imaging in differentiating benign and malignant TI-RADS category 4 nodules. Int J Clin Oncol 2019;24:632-9.
Zhao R-N, Zhang B, Jiang Y-X, Yang X, Lai XJ, Zhu SL, et al
. Ultrasonographic multimodality diagnostic model of thyroid nodules. Ultrason Imaging 2019;41:63-77.
Delfim RLC, da Veiga LCG, Vidal APA, Lopes FPPL, Vaisman M, Teixeira PFDS. Likelihood of malignancy in thyroid nodules according to a proposed thyroid imaging reporting and data system (TI-RADS) classification merging suspicious and benign ultrasound features. Arch Endocrinol Metab 2017;61:211-21.
Zhao C-K, Xu H-X. Ultrasound elastography of the thyroid: Principles and current status. Ultrasonography 2019;38:106-24.
[Figure 1], [Figure 2]
[Table 1], [Table 2], [Table 3], [Table 4], [Table 5]