Journal List > J Korean Med Sci > v.34(Suppl 1) > 1126812

Kim, Chang, Kang, Kim, Park, and Khang: Use of the National Health Information Database for Estimating Town-Level Mortality in Korea: Comparison with the National Administrative Data, 2014–2017

Abstract

Background

This study addressed town-level mortality rates using the National Health Information Database (NHID) of the National Health Insurance Service in Korea in comparison with those derived from the National Administrative Data (NAD) of the Ministry of Interior and Safety.

Methods

We employed the NHID and NAD between 2014 and 2017. We compared the numbers of population and deaths at the national level between these two data sets. We also compared the distribution of the town-level numbers of population and deaths of the two data sets. Correlation analyses were performed to investigate the relation between the NHID and NAD in the town-level numbers of population and deaths, crude mortality rate, and standardized mortality ratio (SMR).

Results

The numbers of population and deaths in the NHID were almost identical to those in the NAD, regardless of gender. The distribution of the town-level numbers of population and deaths was also similar between the two data sets during the entire study period. Throughout the study period, the Pearson correlation coefficients between the two databases for the town-level numbers of population and deaths and the crude mortality rate were 0.996 or over. The correlation coefficients for the SMR ranged from 0.937 to 0.972.

Conclusion

Town-level mortality showed significant correlation and concordance between the NHID and NAD. This result highlights the possibility of producing future analyses of town-level health-related indicators in Korea, including the mortality rate, using the NHID.

Graphical Abstract

jkms-34-e168-abf001

INTRODUCTION

Information on health indicators in small geographical areas may provide better opportunities to address specific local health problems among the most vulnerable populations. More accurate methods for measuring diseases, exposures, behaviors, and other health measures could facilitate better assessments of population health and the development of policies and targeted programs for preventing disease.1 In Korea, the Seventh District Healthcare Plan that every local district government must set up by law requires information on health indicators stratified by income and geography.2 However, there is a paucity of information on health indicators at the town (eup/myeon/dong) level in Korea.
In some previous studies, eup/myeon/dong areas, the smallest administrative units in Korea, were used as the unit of analysis for calculating mortality in Korea by geographical area.3456 However, those studies only provided town-level mortality within a city3 or for both men and women combined45, mainly due to limitations of the data used in those studies.
The National Health Information Database (NHID) is considered to be a data source suitable for monitoring mortality at the town level. The NHID utilizes data from the National Health Insurance Service (NHIS), which covers the entire Korean population.7 The eligibility database of the NHID includes information on the gender, age, residential area, and death status of all people enrolled in the national health insurance program.7 Also, various health and medical information can be obtained through linkage with other databases of the NHIS.7 A previous study showed that the NHID could be used to estimate mortality and life expectancy at the national, provincial, and district levels.8 However, it has not yet been explored whether the NHID can be used for calculating mortality or life expectancy at the town level.
The purpose of this study was to examine the feasibility of estimating town-level mortality using the NHID. We compared town-level mortality rates based on the NHID with those derived from the National Administrative Data (NAD) of the Ministry of Interior and Safety (MOIS), which constitutes the official national database on vital and internal migration statistics. In this study, we first compared the overall numbers of population and deaths between the NHID and NAD during the study period. Second, we compared the distribution of town-level numbers of population and deaths between the two data sets. Third, correlation analyses were conducted of the numbers of population and deaths, the crude mortality rate, and the standardized mortality ratio (SMR) between the NHID and NAD.

METHODS

Data

In this study, we used the NHID and the NAD provided by the MOIS in 2014–2017. All subjects present on January 1 of each year in the NHID eligibility data were followed up for one year. The numbers of population and deaths by year, gender, 5-year age group (0, 1–4, 5–9, 10–14, …, 85+), and eup/myeon/dong were obtained. The data set includes both the beneficiaries of the national health insurance and medical aid programs. We excluded foreigners and cases where a subject's gender, age, or address was not known. The NAD is based on the resident registration statistics provided on the MOIS website, which provides figures since 2008.9 The number of registered residents in the current year was obtained in aggregated form according to gender, 5-year age group (0, 1–4, 5–9, 10–14, …, 85+), and eup/myeon/dong at the end of December of the previous year. The number of deaths was calculated annually by gender and eup/myeon/dong. The study period was limited to 2014–2017 because the numbers of deaths in the NAD were not provided separately for men and women between 2008 and 2014.

Unit of analysis

The unit of analysis in this study was eup/myeon/dong, categorized according to the administrative unit classification as of December 31, 2017 of the Korean Administrative Unit Classification issued by the Statistical Policy Bureau of Statistics Korea.10 However, eight towns (Gunnae-myeon, Jangdan-myeon, Jinseo-myeon in Paju, Gyeonggi and Geundong-myeon, Wondong-myeon, Wonnam-myeon, Imnam-myeon in Cheorwon-gun, Gangwon and Sudong-myeon in Goseong-gun, Gangwon), which are civilian access control areas (for military purposes), were excluded from the analysis. Besides, 26 towns with an average annual population of fewer than 500 persons were integrated with neighboring areas to ensure stable mortality calculations.11 Towns that were divided or merged during the study period were consolidated into a single unit. Ultimately, a total of 3,431 towns across the nation were analyzed.

Statistical analysis

First, we compared the national-level numbers of population and deaths according to the calendar year and gender between the NHID and NAD. Second, the distribution of the town-level numbers of population and deaths between the two data sets was compared according to year and gender during the whole study period. Third, we analyzed the correlations between the NHID and NAD for the town-level numbers of population and deaths, crude mortality rate, and SMR throughout the entire study period. The crude mortality was defined as the number of deaths per 100,000 population. SMR is a mortality measure derived from an indirect standardization method that can be used when the number of deaths by age group is unknown, or the number of deaths is tiny.12 This method is considered suitable for measuring differences in health status between small areas.13 The standard population in the SMR calculation was the total population of the data set.

Ethics statement

This study was approved by the Institutional Review Board (IRB) of Seoul National University Hospital (IRB No. E-1810-008-975) and the National Health Insurance Service of Korea (REQ0000022282). Informed consent was waived for this study because secondary data were used.

RESULTS

Table 1 shows the national-level numbers of population and deaths in the two data sets according to the calendar year and gender for the entire study period. When the data for the entire study period were combined, the resulting total number of population was 203,741,630 in the NHID, which was nearly identical to the corresponding figure of 203,748,512 in the NAD. The NHID recorded 1,101,739 men and women deaths in 2014–2017, 0.7% less than the 1,109,705 deaths recorded in the NAD for the same period. When comparing the two data sets by year, the population numbers were almost the same, and the number of deaths was 0.1%–1.3% less in the NHID than in the NAD. This tendency was similar when men and women were separately examined.
Table 1

Comparison of numbers of population and deaths at the national level between the NHID of the NHIS and NAD of the Ministry of Interior and Safety, 2014–2017

jkms-34-e168-i001
Gender Year No. of population No. of deaths
NHID NAD NHID/NAD NHID NAD NHID/NAD
Total 2014–2017 203,741,630 203,748,512 1.000 1,101,739 1,109,705 0.993
2014 50,655,308 50,662,752 1.000 265,928 267,683 0.993
2015 50,853,193 50,861,629 1.000 273,965 277,472 0.987
2016 51,039,935 51,039,939 1.000 278,773 279,017 0.999
2017 51,193,194 51,184,192 1.000 283,073 285,533 0.991
Men 2014–2017 101,799,286 101,802,715 1.000 598,103 602,469 0.993
2014 25,326,070 25,329,566 1.000 145,812 146,857 0.993
2015 25,414,403 25,418,435 1.000 148,812 150,651 0.988
2016 25,496,699 25,496,737 1.000 150,901 151,091 0.999
2017 25,562,114 25,557,977 1.000 152,578 153,870 0.992
Women 2014–2017 101,942,344 101,945,797 1.000 503,636 507,236 0.993
2014 25,329,238 25,333,186 1.000 120,116 120,826 0.994
2015 25,438,790 25,443,194 1.000 125,153 126,821 0.987
2016 25,543,236 25,543,202 1.000 127,872 127,926 1.000
2017 25,631,080 25,626,215 1.000 130,495 131,663 0.991
NHID = National Health Information Database, NHIS = National Health Insurance Service, NAD =National Administrative Data.
Table 2 shows the distribution of the town-level numbers of population and deaths in the two data sets for the entire study period and each year. During the entire study period, for both men and women, the median population number from the NHID was 44,680 (interquartile range [IQR], 73,004), which was similar to the median population number of 44,661 (IQR, 72,964) from the NAD. The minimum and maximum values were 4,092 and 476,523, respectively, in the NHID, which were also similar to the corresponding values of 4,093 and 476,375 in the NAD. The median number of deaths in the NHID was 280 (IQR, 229), which was also nearly identical to the value obtained using the NAD, and the minimum and maximum values were not significantly different. This was also true when analyzed by year, gender, or both year and gender. Supplementary Figs. 1 and 2 depict graphical representations of the very similar distribution of the town-level numbers of population and deaths for the whole study period between the two data sets.
Table 2

Distribution of numbers of population and deaths among 3,431 towns in Korea: findings from the NHID of the NHIS and the (NAD of the Ministry of Interior and Safety, 2014–2017

jkms-34-e168-i002
Gender Year NHID NAD
Population Death Population Death
Median (IQR) Minimum Maximum Median (IQR) Minimum Maximum Median (IQR) Minimum Maximum Median (IQR) Minimum Maximum
Total 2014–2017 44,680 (73,004) 4,092 476,523 280 (229) 18 1,965 44,661 (72,964) 4,093 476,375 284 (238) 22 1,957
2014 11,117 (18,242) 487 96,566 68 (57) 0 444 11,110 (18,259) 486 96,566 68 (57) 4 434
2015 11,095 (18,360) 1,018 119,778 69 (57) 3 493 11,099 (18,354) 1,017 119,801 71 (61) 5 480
2016 11,053 (18,270) 1,019 127,480 71 (60) 2 490 11,039 (18,260) 1,019 127,412 72 (62) 3 497
2017 10,975 (18,181) 960 148,556 72 (62) 5 538 10,962 (18,163) 961 148,548 72 (62) 8 546
Men 2014–2017 22,571 (36,193) 2,040 233,607 152 (127) 9 1,102 22,563 (36,193) 2,041 233,600 152 (130) 8 1,108
2014 5,638 (9,082) 229 48,586 37 (31) 0 253 5,643 (9,072) 228 48,596 38 (32) 0 249
2015 5,610 (9,065) 507 58,729 38 (33) 1 277 5,617 (9,080) 508 58,747 39 (33) 3 272
2016 5,579 (9,005) 508 62,476 38 (33) 1 278 5,575 (8,994) 510 62,455 38 (34) 1 285
2017 5,517 (9,024) 489 72,467 39 (33) 2 294 5,519 (9,013) 490 72,469 39 (34) 2 302
Women 2014–2017 22,038 (36,767) 2,005 242,916 128 (105) 9 863 22,023 (36,758) 2,006 242,775 130 (107) 11 849
2014 5,554 (9,195) 258 49,140 31 (27) 0 191 5,553 (9,187) 258 49,120 31 (28) 0 185
2015 5,468 (9,227) 501 61,049 32 (26) 0 216 5,469 (9,224) 500 61,054 32 (27) 1 208
2016 5,485 (9,182) 499 65,004 33 (27) 1 212 5,481 (9,175) 499 64,957 33 (28) 1 212
2017 5,447 (9,159) 471 76,089 33 (29) 1 244 5,444 (9,148) 471 76,079 33 (29) 2 244
NHID = National Health Information Database, NHIS = National Health Insurance Service, NAD = National Administrative Data, IQR = Interquartile range.
Table 3 shows the results of the correlation analysis between the NHID and NAD for the numbers of population and deaths, crude mortality, and SMR throughout the study period. In both gender, the Pearson correlation coefficient was 1.000 for the number of population, and 0.998 for the number of deaths and crude mortality, respectively. The same findings were obtained in a sub-analysis by gender. The Pearson correlation coefficient of the SMR was 0.960 for men and women combined, 0.972 for men, and 0.937 for women.
Table 3

Results of the correlation analysis between the number of population and deaths, crude mortality rate, and SMR according to gender: findings from the NHID of the NHIS and the NAD of the Ministry of Interior and Safety, 2014–2017

jkms-34-e168-i003
Period Gender No. of towns Correlation coefficient
No. of population No. of deaths No. of deaths per 100,000 SMR
2014–2017 Total 3,431 1.000 0.998 0.998 0.960
Men 1.000 0.998 0.997 0.972
Women 1.000 0.996 0.996 0.937
SMR = standardized mortality ratio, NHID = National Health Information Database, NHIS = National Health Insurance Service, NAD = National Administrative Data.
Fig. 1 presents the scatter plots and correlation coefficients of the town-level crude mortality of the two data sets according to gender throughout the study period. Regardless of gender, the town-level crude mortality of the two data sets showed an almost perfect correlation (a 45° diagonal line) for almost all towns, implying that the crude mortality rates from the NHID and NAD were nearly identical. Fig. 2 shows the scatter plots and correlation coefficients of the town-level SMR from the NHID and NAD according to gender throughout the study period. Overall, the town-level SMRs of the two data sets were relatively more different than the crude mortality rates, but most showed a close relationship. Supplementary Figs. 3 and 4 show the scatter plots and correlation coefficients of the town-level numbers of population and deaths according to gender throughout the study period. As for crude mortality and SMR, the town-level numbers of population and deaths in both data sets were almost on the 45° diagonal line, regardless of gender.
Fig. 1

Scatter plots and Pearson correlation coefficients [r] of town-level crude mortality from the NHID of the National Health Insurance Service and the NAD of the Ministry of the Interior and Safety among 3,431 towns in Korea, 2014–2017. (A) Correlation of crude mortality in both men and women combined. (B) Correlation of crude mortality in men. (C) Correlation of crude mortality in women.

NAD = National Administrative Data, NHID = National Health Information Database.
jkms-34-e168-g001
Fig. 2

Scatter plots and Pearson correlation coefficients [r] of the town-level SMR from the NHID of the National Health Insurance Service and the NAD of the Ministry of the Interior and Safety among 3,431 towns in Korea, 2014–2017. (A) Correlation of the SMR in both men and women combined. (B) Correlation of the SMR in men. (C) Correlation of the SMR in women.

SMR = standardized mortality ratio, NAD = National Administrative Data, NHID = National Health Information Database.
jkms-34-e168-g002

DISCUSSION

In this study, the NHID from the NHIS and the NAD from the MOIS showed similar numbers of population and deaths at the national level. In particular, the number of population in the two data sets was approximately the same. The distribution of the number of population and the number of deaths at the town level was also similar between the two data sets. The Pearson correlation coefficients of the town-level numbers of population, numbers of deaths, and crude mortality from the NHID and NAD pooled over the entire study period were 0.996 or over. The Pearson correlation coefficient of the SMR ranged from 0.937 to 0.972.
Bahk et al.8 reported that crude mortality and life expectancy at the national, provincial, and municipal levels estimated using the NHID were highly correlated and concordant with data from the Korean Statistical Information Service (KOSIS). Regardless of gender, the national-level crude mortality of the NHID was 0.99 fold the KOSIS data. The concordance correlation coefficient of crude mortality at the municipal level of the two data sets ranged from 0.997 to 0.999, and that of life expectancy was 0.914 to 0.990 depending on the period and gender. This study showed that the NHID presented very similar results to the NAD of the MOIS, in addition to its high level of similarity with the KOSIS data. It has also been found that the NHID is useful for estimating regional mortality rates not only at the national, provincial, and municipal levels but also at the level of towns.8
The NHID has several strengths over the NAD. First of all, the NAD only provides total numbers of death while the NHID provides age-specific numbers of deaths. Using the NAD data, mortality rates in each town can be derived only by the indirect standardization method while both direct and indirect standardization methods can be applied to the NHID data. Town-level life expectancy can be also calculated using age-specific mortality rates from the NHID. Second, the NHID provides information on the numbers of death according to individual linkage to mortality registration data from Statistics Korea, using unique personal identification numbers. However, the numbers of death in the NAD are aggregate data which are not based on individual linkage. Numerator-denominator bias in the calculation of town-level mortality rate would be an unresolved problem in the NAD. Finally, the NHID contains various health indicators. If the NHID is accepted as a reliable source for town-level health indicators, many health indicators could be produced and used in the future.
The correlation coefficients of the town-level crude mortality rates and SMRs between the NHID and NAD were very high, but the magnitudes of the correlation coefficients for the SMRs were smaller than those of the crude mortality rates. The tendency was particularly evident in women. The SMR is calculated by dividing the actual number of deaths by the number of expected deaths in each region. The number of expected deaths is estimated by multiplying the nationwide average age-specific mortality rate by the age-specific population in each region. Therefore, even if a small absolute difference in the number of deaths exists between the two data sets, the difference in the SMR can be relatively large compared to the difference in the crude mortality rate.
This study has limitations. First, the MOIS only provides the combined number of deaths for both gender in the data from 2008 to 2013, while the number of deaths for each gender is only available starting in the 2014 data. Therefore, the period of this study was limited to 2014 and later. The results of correlation analysis for both gender combined for the four years from 2010 to 2013 are shown in Supplementary Table 1. The results for the years 2010–2013 were very similar to the results for the years 2014–2017. Second, there is uncertainty about the validity of using the SMR to measure mortality, as was explored in this study. Some studies have shown that a significant difference in SMR could occur if the structures of two study populations are significantly different.1415 However, the town-level population by 5-year age groups in the two data sets was entirely consistent. When we compared the number of population, and the population proportion by 5-year age group between the two data sets used in this study, the concordance correlation coefficient in all age groups was above 0.999.
In this study, the town-level mortality calculated using the NHID of the NHIS was found to be nearly identical to and highly correlated with the town-level mortality obtained using the NAD of the MOIS. This result shows that the NHID can be used to estimate town-level mortality in Korea in the future. It is necessary to monitor the health status of each eup/myeon/dong in Korea through the SMR and life expectancy calculated using the NHID. In addition, if the NHID is considered as a reliable source for town-level health indicators other than mortality rates, many health indicators can be produced and used in the future.

Notes

Funding This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant No. HI18C0446).

Disclosure The authors have no potential conflicts of interest to disclose.

Author Contributions

  • Conceptualization: Khang YH.

  • Data curation: Kim YY, Park JH, Kang HY, Chang Y.

  • Formal analysis: Kim I.

  • Methodology: Khang YH, Kim I.

  • Writing - original draft: Kim I, Khang YH.

  • Writing - review & editing: Kim I, Chang Y, Khang YH, Kim YY, Park JH, Kang HY.

References

1. Khoury MJ, Iademarco MF, Riley WT. Precision public health for the era of precision medicine. Am J Prev Med. 2016; 50(3):398–401.
crossref
2. Korea Health Promotion Institute. Guideline for Establishment of the 7th Local Healthcare Plan. Updated 2018. Accessed December 8, 2018. http://www.khealth.or.kr/kps/publish/view?menuId=MENU00890&page_no=B2017003&pageNum=1&siteId=&srch_text=&srch_cate=28&srch_type=ALL&str_clft_cd_list=&str_clft_cd_type_list=&board_idx=10127.
3. Choi MH, Cheong KS, Cho BM, Hwang IK, Kim CH, Kim MH, et al. Deprivation and mortality at the town level in Busan, Korea: an ecological study. J Prev Med Public Health. 2011; 44(6):242–248.
crossref
4. Kim JH, Yoon TH. Comparisons of health inequalities in small areas with using the standardized mortality ratios in Korea. J Prev Med Public Health. 2008; 41(5):300–306.
crossref
5. Shin H, Lee S, Chu JM. Development of composite deprivation index for Korea: the correlation with standardized mortality ratio. J Prev Med Public Health. 2009; 42(6):392–402.
crossref
6. Yoon TH, Noh M, Han J, Jung-Choi K, Khang YH. Deprivation and suicide mortality across 424 neighborhoods in Seoul, South Korea: a Bayesian spatial analysis. Int J Public Health. 2015; 60(8):969–976.
crossref
7. Cheol Seong S, Kim YY, Khang YH, Heon Park J, Kang HJ, Lee H, et al. Data Resource Profile: The National Health Information Database of the National Health Insurance Service in South Korea. Int J Epidemiol. 2017; 46(3):799–800.
crossref
8. Bahk J, Kim YY, Kang HY, Lee J, Kim I, Lee J, et al. Using the National Health Information Database of the National Health Insurance Service in Korea for monitoring mortality and life expectancy at national and local levels. J Korean Med Sci. 2017; 32(11):1764–1770.
crossref
9. Ministry of the Interior and Safety. Resident Registration Statistics. Updated 2018. Accessed December 3, 2018. http://27.101.213.4/index.jsp.
10. Statistics Korea. Korean Administrative Unit Classification as of December 31, 2017. Updated 2018. Accessed December 4, 2018. http://kssc.kostat.go.kr/ksscNew_web/kssc/common/CommonBoardList.do?gubun=1&strCategoryNameCode=019&strBbsId=kascrr&categoryMenu=014.
11. Jonker MF, van Lenthe FJ, Congdon PD, Donkers B, Burdorf A, Mackenbach JP. Comparison of Bayesian random-effects and traditional life expectancy estimations in small-area applications. Am J Epidemiol. 2012; 176(10):929–937.
crossref
12. Gordis N. Epidemiology. 5th ed. Philadelphia, PA: Elsevier Saunders;2014.
13. Goldman DA, Brender JD. Are standardized mortality ratios valid for public health data analysis? Stat Med. 2000; 19(8):1081–1088.
crossref
14. Julious SA, Nicholl J, George S. Why do we continue to use standardized mortality ratios for small area comparisons? J Public Health Med. 2001; 23(1):40–46.
crossref
15. Silcocks PB, Jenner DA, Reza R. Life expectancy as a summary of mortality in a population: statistical considerations and suitability for use by health authorities. J Epidemiol Community Health. 2001; 55(1):38–43.
crossref

SUPPLEMENTARY MATERIALS

Supplementary Table 1

Results of the correlation analysis between the numbers of population and deaths, crude mortality rate, and SMR for men and women combined: Findings from the National Health Information Database of the National Health Insurance Service and the National Administrative Data of the Ministry of Interior and Safety, 2010–2013

Supplementary Fig. 1

Histograms and probability density functions of the town-level numbers of population from the NHID of the National Health Insurance Service and the NAD of the Ministry of the Interior and Safety among 3,431 towns in Korea, 2014–2017. (A) Histogram and probability density function of the numbers of the population in both men and women combined. (B) Histogram and probability density function of the numbers of the population in men. (C) Histogram and probability density function of the numbers of the population in women.

Supplementary Fig. 2

Histograms and probability density functions of the town-level numbers of deaths from the NHID of the National Health Insurance Service and the NAD of the Ministry of the Interior and Safety among 3,431 towns in Korea, 2014–2017. (A) Histogram and probability density function of the numbers of deaths in both men and women combined. (B) Histogram and probability density function of the numbers of deaths in men. (C) Histogram and probability density function of the numbers of deaths in women.

Supplementary Fig. 3

Scatter plots and Pearson correlation coefficients [r] of the town-level numbers of population from the NHID of the National Health Insurance Service and the NAD of the Ministry of the Interior and Safety among 3,431 towns in Korea, 2014–2017. (A) Correlation of the numbers of the population in both men and women combined. (B) Correlation of the numbers of the population in men. (C) Correlation of the numbers of the population in women.

Supplementary Fig. 4

Scatter plots and Pearson correlation coefficients [r] of the town-level number of deaths from the NHID of the National Health Insurance Service and the NAD of the Ministry of the Interior and Safety among 3,431 towns in Korea, 2014–2017. (A) Correlation of the numbers of deaths in both men and women combined. (B) Correlation of the numbers of deaths in men. (C) Correlation of the numbers of deaths in women.
TOOLS
ORCID iDs

Ikhan Kim
https://orcid.org/0000-0002-6428-4159

Youngs Chang
https://orcid.org/0000-0003-4722-6894

Hee-Yeon Kang
https://orcid.org/0000-0001-8530-8087

Yeon-Yong Kim
https://orcid.org/0000-0003-2179-8931

Jong Heon Park
https://orcid.org/0000-0002-4749-5878

Young-Ho Khang
https://orcid.org/0000-0002-9585-8266

Similar articles