Journal List > Ann Lab Med > v.44(6) > 1516088439

Park: Why Terminology Standards Matter for Data-driven Artificial Intelligence in Healthcare

INTRODUCTION

Artificial intelligence (AI) has become invaluable in healthcare for disease diagnosis, treatment planning, and clinical decision-making. AI uses algorithms that emulate the human brain to learn, synthesize, analyze, generalize, and solve problems using natural language processing (NLP), machine learning, deep learning, and large language models [1]. The accuracy and performance of AI algorithms depend on the quality and quantity of data they process. Maximizing the potential of AI algorithms requires feeding them substantial data from various healthcare systems. Unfortunately, most healthcare systems operate with different data formats and document architectures, compounding issues with unstructured, non-standardized data, which require increased storage space and processing time. Moreover, processing such data can introduce errors that may skew analysis results, highlighting the need for data interoperability in AI algorithms.
Herein, we examined the importance of interoperability, particularly semantic interoperability based on terminology standards, the global adoption of terminology standards, collaborative activities in terminology standard development, and proposed strategies to improve their usability in laboratory medicine.

INTEROPERABILITY AND STANDARDS

Interoperability refers to the ability of two or more systems to exchange and utilize information, including foundational, structural, semantic, and organizational levels [2]. Foundational interoperability involves the most basic level of data receipt without requiring interpretation by the receiving system. Structural interoperability specifies the format and structure of the data. Fast Healthcare Interoperability Resources (FHIR) is an emerging standard for structural interoperability. Semantic interoperability ensures unambiguous data exchange, with key standards including the International Statistical Classification of Diseases (ICD), Systematized Nomenclature of Medicine - Clinical Terms (SNOMED CT), and Logical Observation Identifiers Names and Codes (LOINC). Organizational interoperability secures data exchange among organizations beyond the technical context. Standards include governance, policy, social, and legal rules and regulations to facilitate the secure exchange of data within and among organizations.

TERMINOLOGY STANDARDS

Terminology standards are crucial for ensuring structured and meaningful data exchange. ICD, SNOMED CT, and LOINC are the most widely used terminology standards.
The ICD is a global statistical classification for coding the causes and consequences of diseases and deaths. WHO designed the ICD to enhance the comparability of morbidity and mortality data internationally [3]. ICD-10 codes vary from three to seven characters. The first three characters designate the disease category. The next three characters indicate the related etiology, anatomic site, severity, or other clinical details. The seventh character is used to document care episodes for injuries and other conditions with external causes. ICD-10 has a mono-hierarchical structure with residual categories for diseases not classified elsewhere.
SNOMED CT is the most comprehensive multilingual clinical terminology globally, including over 366,000 concepts with multiple descriptions and logical definitions organized hierarchically [4]. The core components of SNOMED CT are concepts, descriptions, and relationships. Each concept represents a unique clinical meaning and is organized hierarchically to reflect different levels of granularity. Descriptions, which are human-readable terms associated with concepts, are categorized as fully specified names (FSNs) that fully capture the meaning of a concept and synonyms that offer alternative expressions of the concept. A relationship represents an association between concepts and is classified as either a subtype (IS-A) or an attribute. Relationships facilitate the logical definition of concepts for computational processing. SNOMED CT encompasses a broad spectrum of clinical concepts such as diseases, procedures, medications, and outcomes for documentation in electronic health records (EHRs).
LOINC is a standardized terminology for describing laboratory and clinical observations [5]. Laboratory LOINC includes tests, measurements, or observations about a specimen, covering domains such as chemistry, hematology, cell counts, and antibiotic susceptibilities. Clinical LOINC includes tests, measurements, and observations about patients, including vital signs, hemodynamics, and radiological findings. Currently, there are over 100,000 LOINC codes, each comprising six parts that represent the component, property measured, timing, type of sample, type of scale, and method used. LOINC assigns three names to each concept: the six-part fully specified name, the clinician-friendly long common name, and a short name suitable for column headers in reports.

ADOPTION OF TERMINOLOGY STANDARDS

According to a survey of 27 member countries by the Organisation for Economic Co-operation and Development (OECD) in 2021, the adoption of EHRs has increased since 2016 [6]. In EHRs, patient data are coded in a structured manner using terminology standards. The survey revealed that multiple terminology standards are used for the same data category in many countries. For example, ICD is used for capturing diagnoses in 21 countries. Conversely, Australia and Switzerland use SNOMED CT and Greece, Hungary, and the United States (US) use both terminologies to record diagnoses. For medication, the Anatomical Therapeutic Chemical (ATC) classification is used in 16 of the 27 countries. Australia and Switzerland use SNOMED CT, and other countries use locally developed terminology standards. For surgical procedures, eight of the 27 countries use ICD codes, such as ICD-9-CM and ICD-10-PCS; five use SNOMED CT; and the remainder use national codes. Laboratory tests are captured using LOINC in 16 countries, whereas SNOMED CT is used in Iceland, Slovenia, and Sweden. Table 1 presents the adoption of health terminology by domain in six OECD member countries.

COLLABORATIVE ACTIVITIES IN TERMINOLOGY STANDARD DEVELOPMENT

Collaboration among developers of terminology standards has increased over the past few years. In 2022, the Regenstrief Institute and SNOMED International (SI) agreed to develop a LOINC extension in which each LOINC concept has an equivalent SNOMED CT concept [7]. The SNOMED CT concept equivalent of a LOINC is fully defined, implying that each LOINC part has an equivalent SNOMED CT concept (Fig. 1). LOINC extensions in the SNOMED CT format are available at https://loincsnomed.org/browser. A LOINC concept can be searched using a text string, LOINC name, or SNOMED CT concept ID. The LOINC extension links LOINC, which provides laboratory content in an understandable format, and SNOMED CT, which provides the computable framework together in a complementary way. The LOINC extension aims to reduce duplication of efforts by providers who use different combinations of LOINC and SNOMED CT and leverages the rich semantic/ontological representation of SNOMED CT for the analysis of laboratory reports, irrespective of whether they use LOINC or SNOMED CT. Another collaborative effort is to map SNOMED CT to classifications such as ICD [8]. The WHO and SI mapped SNOMED CT concepts to ICD-10 and published a reference set via the SNOMED CT browser. Other efforts include mapping SNOMED CT to Current Procedural Terminology (CPT) in the US and mapping SNOMED CT to Office of Population Censuses and Surveys Classification of Interventions and Procedures-4 (OPCS-4) and ICD-10 in the United Kingdom (UK).

BiNDing TerMINOLOGY STANDARDS TO STRUCTURAL STANDARDS

A global initiative was undetaken to combine structural and semantic standards, thereby enhancing the interoperability of data from diverse sources; for instanceHL7 FHIR was integrated with SNOMED CT and LOINC. For example, terminology binding of US Core Data for Interoperability (USCDI) using US Core Profiles in the US Core Implementation Guide [9, 10]. The USCDI represents the basic building blocks of healthcare interoperability, including a standardized set of data classes, data elements, and associated terminology standards such as SNOMED CT and LOINC. The US Core Implementation Guide provides detailed technical specifications and implementation guidance to support USCDI adoption. Australia, Canada, the UK, and Korea are adopting similar approaches.

STRATEGIES FOR ADOPTING TERMINOLOGY STANDARDS IN LABORATORY MEDICINE

While terminology standards are essential for ensuring the semantic interoperability of healthcare data, their adoption presents challenges [11]. The availability of multiple terminology standards for representing laboratory medicine complicates achieving a consensus on a singular standard. This dilemma may be addressed by allowing a range of complementary terminologies and accepting the inherent differences among them. LOINC codes are suitable for coding laboratory tests, whereas SNOMED CT is appropriate for coding non-numerical results. Data interoperability may be facilitated by cross-mapping LOINC and SNOMED CT or by adopting SNOMED CT as a reference terminology to act as an interlingua among various standards.
Terminology standards inevitably change in content and structure owing to the evolution of medical knowledge, the need for concept refinement or disambiguation, content redundancy, and changes in code or meaning. As most terminology standards are designed to accommodate these evolutions, clinicians may propose the addition of new concepts. Often, the descriptions used in terminology standards are not commonly used by clinicians (e.g., “Urea nitrogen:MCnc:Pt:Ser/Plas:Qn”), rendering their use cumbersome. Clinicians may also suggest incorporating custom-made terms (e.g., “BUN measurement”) as synonyms to enhance the practicality of the standards.
The application of terminology standards at the point of care may burden healthcare providers. This burden can be alleviated by introducing a better EHR interface with personalized templates for commonly used terms and a terminology search engine that retrieves appropriate terms and recognizes common synonyms. Additionally, developing NLP algorithms to match local terms with standardized terminology and mapping SNOMED CT to other terminologies or classifications in healthcare information systems, such as ICD-10 and LOINC, can streamline processes by automatically correlating the clinical data recorded by clinicians in SNOMED CT with other classifications or terminologies.
In summary, the quantity and quality of data are crucial for data-driven AI in healthcare. Ensuring data quality with semantic interoperability requires the unambiguous collection, sharing, and use of data facilitated by adherence to terminology standards. The usability of terminology standards in laboratory medicine can be improved by accepting a range of complementary terminology standards, active participation in the evolution of the content and structure of these standards, and alleviating the burden on healthcare providers by developing better interfaces or incorporating natural language processing.

ACKNOWLEDGEMENTS

None.

Notes

AUTHOR CONTRIBUTIONS

The author confirms sole responsibility for the following: manuscript conception and preparation.

CONFLICTS OF INTEREST

None declared.

RESEARCH FUNDING

None declared.

References

1. Alowais SA, Alghamdi SS, Alsuhebany N, Alqahtani T, Alshaya AI, Almohareb SN, et al. 2023; Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Med Educ. 23:689. DOI: 10.1186/s12909-023-04698-z. PMID: 37740191. PMCID: PMC10517477. PMID: 6781510864c94223aa4e2eece3d06ab1.
2. HIMSS. Interoperability in healthcare. https://www.himss.org/resources/interoperability-healthcare. Updated on Sep 2023.
3. World Health Organization. The WHO Family of International Classifications (FIG). https://www.who.int/standards/classifications. Updated on Feb 2024.
4. SNOMED International. What is SNOMED CT? https://www.snomed.org/five-step-briefing. Updated on Jul 2023.
5. Regenstrief Institute. What LOINC is. https://loinc.org/get-started/what-loinc-is/. Updated on Sep 2023.
6. Slawomirski L, Lindner L, de Bienassis K, Haywood P, Hashiguchi TCO, Steentjes M, et al. Progress on implementing and using electronic health record systems: Developments in OECD countries as of 2021. OECD Health Working Papers, No. 160. OECD Publishing;Paris: DOI: 10.1787/18152015.
7. SNOMED International and Regenstrief Institute. New collaboration agreement between SNOMED International and LOINC® from Regenstrief. https://www.snomed.org/news/new-collaboration-agreement-between-snomed-international-and-loinc%C2%AE-from-regenstrief. Updated on Oct 2022.
8. Bodenreider O, Cornet R, Vreeman DJ. 2018; Recent developments in clinical terminologies-SNOMED CT, LOINC, and RxNorm. Yearb Med Inform. 27:129–39. DOI: 10.1055/s-0038-1667077. PMID: 30157516. PMCID: PMC6115234.
9. HealthIT.gov. United States Core Data for Interoperability (USCDI). https://www.healthit.gov/isa/united-states-core-data-interoperability-uscdi. Updated on October 2023.
10. HL7. US Core Implementation Guide. https://build.fhir.org/ig/HL7/US-Core/. Updated on December 2023.
11. Kim S, Cho EJ, Jeong TD, Park HD, Yun YM, Lee K, et al. 2023; Proposed model for evaluating real-world laboratory results for big data research. Ann Lab Med. 43:104–7. DOI: 10.3343/alm.2023.43.1.104. PMID: 36045065. PMCID: PMC9467825.

Fig. 1

Example of the LOINC terminology in SNOMED CT.

Abbreviations: LOINC, Logical Observation Identifiers, Names, and Codes; MCnc, mass concentration; Pt, point in time; Qn, quantitative; SCTID, SNOMED CT identifier; Ser/plas, serum/plasma; SNOMED CT, Systematized Nomenclature of Medicine Clinical Term.
alm-44-6-467-f1.tif
Table 1

Health terminology standards adopted by six OECD member countries by domain

Domain Australia Canada Japan Korea UK US
Diagnosis SNOMED CT - AU ICD-10-CA, DSM-IV, SNOMED CT Japanese standard disease code master based on ICD-10 KCD-7 (Korean version based on ICD-10) SNOMED CT, ICD-10 SNOMED CT, ICD-10
Medication SNOMED CT - AU Drug Identification Number, ATC, CCD Standard Master for Pharmaceutical Products KD Code SNOMED CT, UK Drug Extension, dm+d RxNorm, National Drug Code
Laboratory LOINC LOINC Laboratory Test Code Master EDI SNOMED CT LOINC
Radiology SNOMED CT – AU, DICOM SNOMED CT, DICOM DICOM EDI, DICOM SNOMED CT, NICIP Code Set LOINC
Surgery and procedures ICD-10-AM, SNOMED CT - AU Canadian Classification of Health Intervention/SNOMED CT Standard surgical and treatment procedures code master ICD-9-CM SNOMED CT, OPCS-4 SNOMED CT, ICD-10-PCS, HCPCS, CPT-4

Abbreviations: ATC, Anatomical Therapeutic Chemical; CCD, Canadian Clinical Drug; CPT, Current Procedural Terminology; DICOM, Digital Imaging and Communications in Medicine; dm+d, dictionary of medicines and devices; DSM-IV, Diagnostic and Statistical Manual of Mental Disorders; EDI, electronic data interchange; HCPCS, Healthcare Common Procedure Coding System; ICD-10-AM, ICD-10-Australian Modification; ICD-10-PCS, ICD-10-Procedure Coding System; KD, Korea Drug; LOINC, Logical Observation Identifiers, Names, and Codes; NICIP, National Interim Clinical Imaging Procedure; OPCS-4, Office of Population Censuses and Surveys Classification of Interventions and Procedures-4; SNOMED CT - AU, Systematized Nomenclature of Medicine Clinical Term - Australia.

TOOLS
Similar articles