Abstract
Purpose
We validate the 7th American Joint Committee on Cancer/Union for International Cancer Control (AJCC/UICC) staging system for gastric cancer and propose a new staging system that reflects the prognostic significances of each of T and N category.
Methods
Data from 5,957 patients who underwent curative gastrectomies from 2000 to 2007 at 4 university hospitals in Daegu Metropolitan city in Korea were analyzed for the validation of the 7th AJCC/UICC staging system for gastric cancer. The hazard ratios of the respective T and N categories were estimated and converted to weightings and summated to make prognostic score (P-score). Homogeneity and stage grouping were determined according to the P-scores.
Results
In the 7th AJCC/UICC staging system for gastric cancer, poor discrimination was noted between stages IIB and IIIA (P = 0.152). In addition, heterogeneity in stage IIB (P = 0.021) and a small gap in 5-year survival rates (1.7%) between stages IA and IB were noted. A new proposed staging system was generated on the basis of P-scores and demonstrated more discrimination between stages and more homogeneity within stages. The new staging system reflects the different prognostic impacts of N3a and N3b.
Gastric cancer is one of the leading causes of cancer-related deaths in the world [1]. Similar to other malignancies, we can predict the survival of gastric cancer patients and make plans for multidisciplinary treatments on the basis of the stage of disease; thus, the staging system is crucial in the management of patients with gastric cancer. We adopted the 7th American Joint Committee on Cancer/Union for International Cancer Control (AJCC/UICC) staging system starting in 2010, and it contained major changes compared with the 6th AJCC/UICC TNM staging system for gastric cancer. These changes included reclassification of T2a and T2b to T2 and T3, respectively; division of the previous N1 stage (up to 6 metastatic regional lymph nodes) into N1 (1 or 2 metastatic regional lymph nodes) and N2 (3 to 6 metastatic regional lymph nodes); and reclassification of the previous N2 (7 to 15 metastatic regional lymph nodes) and N3 stages (more than 15 metastatic regional lymph nodes) to N3a (7 to 15 metastatic regional lymph nodes) and N3b (more than 15 metastatic regional lymph nodes) [234]. Although the 7th AJCC/UICC staging system for gastric cancer has been reported to be more accurate or valid by some authors [3567], the system is not without controversy. One of the most common controversies is classification of the N category, such that the discriminative power of the N category in the 7th UICC/AJCC staging system has been decreased relative to that of the 6th AJCC/UICC staging system [89]. In addition, different prognostic significances of the subgroups of N3 (N3a and N3b) was not reflected in the final staging [1011]. The metastatic lymph node ratio was suggested to be a complementary measure by many authors [12131415], and others noted that there was heterogeneity in the survival of subgroups in a certain stage of the 7th AJCC/UICC staging system [16]. These controversies have highlighted the need for a new staging system [17]. In this study, we validated the 7th AJCC/UICC staging system with data from a large number of patients at 4 university hospitals and propose a new staging system based on basic concepts of the TNM staging system that reflects the different prognostic significance of each T and N category using a prognostic score (P-score).
Included patients were those who had undergone surgery for gastric cancer from January 2000 to December 2007 at 4 university hospitals in Daegu Metropolitan city in Korea. Patients who met the following criteria were excluded from the analysis: (1) nonresective surgery, (2) gastric cancer located in the cardia, (3) less than 16 total retrieved lymph nodes, (4) recurrent gastric cancer, (5) coexisting malignancy in other organs, (6) history of preoperative chemotherapy, and (7) death within 90 days of surgery that might have been caused by complications of the surgery. After excluding 274 out of 6,241 patients, data from 5,967 patients were analyzed retrospectively. The mean follow up time was 66.2 months including early deaths. Roux-en-Y esophagojejunostomy was performed after total gastrectomy except for one case of jejunal interposition. Gastroduodenostomy (Billroth I), gastrojejunostomy (Billroth II) or Roux-en-Y gastrojejunostomy was performed after distal gastrectomy. Gastrogastrostomy was performed after pylorus preserving gastrectomy. The standard extent of lymph node dissection was D2 and D1+ in patients with early gastric cancer. Follow-up data were collected on the basis of medical records and telephone interviews. Survival duration was calculated from the day of surgery to the day of the last follow-up at outpatient department or by telephone, or the day of the patient's death.
The process for generating the new staging system was as follows: (1) estimation of the hazard ratio (HR) of each T and N category; (2) transformation of HR into a weighting; (3) summation of the weightings of the T and N categories (P-score); and (4) homogeneity testing and grouping as well as testing for the appropriateness of the 5-year survival gap between adjacent stages.
The estimated survival probability of the 7th edition of the TNM staging system and respective combinations of T and N categories were estimated using the Kaplan-Meier method and were compared by the log-rank test. The HR within the Cox's proportional hazards model was used to assess the weighting of prognostic significance of the T and N categories. At first, the variables of age and sex were included to evaluate the interaction effect with the T and N categories, but the interactions were not significant. Thus, only data from the T and N categories were included in the model, and the HR was estimated. These HRs were simplified to the nearest number ending in 0.5 to meet both convenience and accuracy. We named this number the weighting. All the weightings of the T and N categories were confirmed to be within the 95% confidential intervals of their respective HRs and summated to make P-scores. The Cox's proportional hazards model was applied using P-scores to estimate the regression coefficient. This regression coefficient was applied to assess the homogeneity and grouping of P-scores using an estimated covariance matrix and Wald test. With sufficient numbers of patients in each stage, the Z-test was used to examine the discrimination of the 5-year survival rates between adjacent stages. The log-rank score within the Cox's proportional hazards model was used to demonstrate the discriminative ability of the new staging system in comparison with the 7th AJCC/UICC TNM classification. Higher log-rank scores were regarded to have better discriminative power in terms of survival. A P-value of less than 0.05 was considered statistically significant. No adjustments were made.
In total, 3,922 (65.7%) male patients and 2,045 (34.3%) female patients were included in this study. The mean age was 58.6 ± 11.5 years. Distal gastrectomy was performed in 4,547 patients (76.2%). Billroth I anastomosis was performed as the reconstruction method in 3,071 patients (67.5%) after distal gastrectomy. Mucosal cancer was noted in 1,723 patients (28.9%), and submucosal cancer was noted in 1,298 patients (21.8%). T4b cancer was noted in 84 patients (1.4%). The mean retrieved lymph node count was 39.9 ± 16.8. The number of patients with regional lymph node metastases was 2,344 (39.3%) (Table 1).
In survival analysis, 5-year survival rates according to the T categories of the 7th AJCC/UICC staging system were 92.4% for T1a, 90.6% for T1b, 86.4% for T2, 66.2% for T3, 39.4% for T4a, and 29.7% for T4b. Although a statistically significant difference in survival was noted between T1a and T1b (P = 0.025), the difference in 5-year survival rates was 1.8%, and the difference in survival between T4a and T4b did not reach statistical significance (P = 0.052). In the N categories, the 5-year survival rates were 90.2% for N0, 77.4% for N1, 62.4% for N2, 40.0% for N3a, and 21.4% for N3b (Fig. 1). Acceptable discrimination and distribution of survival were noted with the N categories of the 7th AJCC/UICC staging system. The survival curves for both the T and N categories did not intersect. Table 2 presents the 5-year survival rates of all the combinations of the T and N categories according to the 7th AJCC/UICC TNM staging system. T and N combinations of stage IIB exhibited a statistically significant difference in survival (P = 0.021) within the stage, and the difference in 5-year survival rates between T2N2M0 and T1N3M0 was 25.3% (80.0% vs. 54.7%, P = 0.012). Furthermore, a significant difference in survival was noted between T3N1M0 and T1N3M0 (74.5% vs. 54.7%, P = 0.009). A similar tendency was identified in stage IIIA patients with marginal significance (P = 0.067), exhibiting a lack of homogeneity. Table 2 also presents the 5-year survival difference between subgroups of the N3 category. A remarkable difference in the 5-year survival rates was noted between N3a and N3b, except in the T4bN3M0 group. Fig. 2 presents cumulative survival according to the 7th AJCC/UICC staging system. No statistically significant difference in survival was noted between stages IIB and IIIA (P = 0.152), and the intersection of the survival curves showed that these findings alluded to poor discriminative ability.
Our proposed staging system was designed to reflect the different prognostic significance of the T and N categories while preserving basic concepts of the TNM staging system. Thus, the presence of metastatic disease (M1) was regarded as stage IV. In addition, all of the T and N categories were preserved and included in the proposed staging system, in particular, the prognostic significance of both N3a and N3b was included in the final stage. Table 3 presents HRs of the T and N categories and the transformation of HRs into weightings. The weightings of the T categories were 1 for T1a, 1 for T1b, 1.5 for T2, 2.5 for T3, 4 for T4a, and 5 for T4b. The weightings for the N categories were 1 for N0, 1.5 for N1, 2 for N2, 3.5 for N3a, and 6 for N3b. The gaps in the weightings between adjacent T or N categories differed according to the different HRs of the T and N categories. Grouping of T and N categories according to the P-score for the new staging system is presented in Table 4. Twenty-six combinations of T, N, and M categories were grouped to generate an 8-stage system using the Wald test. Thus, the P-scores was 2 for stage IA, 2.5 for IIB, 3 and 3.5 for IIA, 4 and 4.5 for IIB, 5 and 5.5 for IIIA, 6 and 6.5 for IIIB, and 7 or greater for IIIC. Through new groupings, 6 of the 25 T and N combinations in the 7th AJCC/UICC staging system were moved to different stages in the new staging system. T2N2M0 of IIB moved to IIA, T3N2M0 of IIIA moved to IIB and T4aN0M0 of IIB moved to IIIA. T1N3bM0 of IIB, T2N3bM0 of IIIA, and T3N3bM0 of IIIB moved to IIIC and these shifts were caused by poor prognosis of the component N3b.
In the survival curves of the new staging system composed of 8 stages, except for a small gap in the 5-year survival rates between stages IA and IB (92.7% and 91.0%, P = 0.041), there was good discrimination of survival between adjacent stages and an appropriate gap in the 5-year survival rates in the new staging system. Although a statistically significant difference in survival was noted by the log-rank test between stages IA and IB, the 5-year survival rate difference was only 1.7%. No statistically significant difference in the 5-year survival rate was noted between stages IA and IB (P = 0.173) by the Z-test (Table 5). Thus, stages IA and IB were unified into stage I.
Fig. 3 depicts the survival distributions of the final version of the new 7-staged staging system on the basis of P-score. This system demonstrated homogeneity with respect to survival in each stage group and distinct differences in the 5-year survival rates of these groups. The log-rank score of the proposed staging system was increased compared with the 7th AJCC/UICC staging system (2,497.1 vs. 2,413.6), indicating that the proposed staging system exhibited better discriminative power.
The purpose of a cancer staging system is stated in the AJCC cancer staging manual. The extent or stage of cancer at the time of diagnosis is a key factor that defines prognosis and is a critical element in determining an appropriate treatment based on the experience and outcomes of groups of prior patients at a similar stage. In addition, accurate staging is necessary to evaluate the results of treatments and clinical trials, to facilitate the exchange and comparison of information among treatment centers, and to serve as a basis for clinical and translational cancer research [2]. Until recently, many appraisals of the 7th AJCC/UICC staging system for gastric cancer and proposals for new staging systems have been reported [181920]. In our proposed staging system, the increments in weightings varied from 0.5 to 2.5, and this feature might disclose the different prognostic significance of each component of the T and N categories. More weighting was placed on categories with more prognostic significance, depending on their HR. Some authors have reported that the subgroups of N3 (N3a and N3b) should be included in the final stage separately because the prognosis of the patients with N3a and N3b differs significantly [1021]. In our proposed staging system, the weightings of N3a and N3b were 3.5 and 6, respectively. This difference in the prognosis was reflected in the proposed staging system without modifications. Some authors insisted that heterogeneity existed within certain groups in the 7th AJCC/UICC staging system [16]. In the proposed staging system, the 5-year survival rate of T1N3aM0 was 66.0% and 22% for T1N3bM0. T1N3a remained in stage IIB, and T1N3bM0 moved to stage IIIC. According to prognostic significance, T2N2M0 moved from IIB to stage IIA, T3N2M0 moved from stage IIIA to stage IIB and T3N3bM0 moved from stage IIIB to stage IIIC. Some authors reported poor discrimination of survival between stage IB and IIA and between IIB and IIIA in the 7th AJCC/UICC staging system [48192022]. In the present study, poor discrimination of survival was noted between stage IIB and IIIA with intersection of the survival curves by the 7th AJCC/UICC staging system. However, in the staging system proposed here, the survival curves did not intersect, and a significant survival difference was observed. Dikken et al. [23] reported increased complexity without prognostic significance in the 7th AJCC/UICC staging system. Our proposed staging system is composed of seven stages that fall in the middle of the 6th and 7th AJCC/UICC staging system with regard to complexity. Warneke et al. [19] stressed that there was no consideration for different biologic behavior between the primary tumor and lymph node metastasis. The 7th AJCC/UICC staging system reflects a mathematical model where the addition of the values of the T and N categories equals the same sum total in each subgroup. This feature is not an adequate categorization of the tumor's biologic potential or prognosis. In the proposed system, we convert the HRs of each T and N category to weightings, which reflect the prognosis by providing more weight to categories with worse biological behavior. Then, the weightings of the T and N categories are added to make P-scores. Thus, the different biological behavior between the status of the primary tumor and lymph node metastasis is simultaneously considered. Several benchmarks for comparing the performance of two staging systems have been suggested [23]. First, there should be homogeneity within stage groups. Second, there should be discrimination between stage groups; patients in different stage groups should have larger differences in survival. Third, the staging system should have good predictive accuracy. Fourth, the staging system should be as simple and intuitive as possible in clinical practice. Our proposed system, for the most part, appears to satisfy these four benchmarks. As for discrimination, in this study, the survival difference between stage IA and IB in the 7th AJCC/UICC system was small and unified to achieve relatively good discrimination of survival rates between neighboring stages. This unification of stage IA and IB was statistically verified to be rational by the result of Z-test presented in Table 5. Conversion of the T and N categories to weightings is simple and easy to perform. The staging process is easier and more scientific compared with the 7th AJCC/UICC staging system because the stage is determined by the P-scores.
Furthermore, this study is the first to propose a staging system generated on the basis of prognostic outcome of HR of respective T and N categories to reflect different prognostic impacts of respective T and N categories overcoming the limitations of anatomical and mathematical model.
In conclusion, the newly proposed staging system preserves the basic concepts of TNM classification, reflecting different prognostic significance for the T and N categories using a P-score, dividing N3 into N3a and N3b in stage grouping, and unifying stages IA and IB into a single stage I. This system demonstrates homogeneity of each stage group with respect to survival and a distinct difference in the survival rates between stages. Therefore, the TNM system for gastric cancer based on this P-score appears to be more scientifically accurate than the 7th AJCC/UICC staging system for gastric cancer.
ACKNOWLEDGEMENTS
We thank all the members of Daegu Gastric Cancer Study Group for their contribution to the manuscript.
References
1. Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D. Global cancer statistics. CA Cancer J Clin. 2011; 61:69–90.
2. Edge SB, Byrd DR, Compton CC, Fritz AG, Greene FL, Trotti A, editors. AJCC cancer staging manual. 7th ed. New York: Springer;2010.
3. Fang WL, Huang KH, Chen JH, Lo SS, Hsieh MC, Shen KH, et al. Comparison of the survival difference between AJCC 6th and 7th editions for gastric cancer patients. World J Surg. 2011; 35:2723–2729.
4. Wang W, Sun XW, Li CF, Lv L, Li YF, Chen YB, et al. Comparison of the 6th and 7th editions of the UICC TNM staging system for gastric cancer: results of a Chinese single-institution study of 1,503 patients. Ann Surg Oncol. 2011; 18:1060–1067.
5. Zhang J, Zhou Y, Jiang K, Shen Z, Ye Y, Wang S. Evaluation of the seventh AJCC TNM staging system for gastric cancer: a meta-analysis of cohort studies. Tumour Biol. 2014; 35:8525–8532.
6. Graziosi L, Marino E, Cavazzoni E, Donini A. Prognostic value of the seventh AJCC/UICC TNM classification of non-cardia gastric cancer. World J Surg Oncol. 2013; 11:103.
7. McGhan LJ, Pockaj BA, Gray RJ, Bagaria SP, Wasif N. Validation of the updated 7th edition AJCC TNM staging criteria for gastric adenocarcinoma. J Gastrointest Surg. 2012; 16:53–61.
8. Yoon HM, Ryu KW, Nam BH, Cho SJ, Park SR, Lee JY, et al. Is the new seventh AJCC/UICC staging system appropriate for patients with gastric cancer? J Am Coll Surg. 2012; 214:88–96.
9. Marano L, Boccardi V, Braccio B, Esposito G, Grassia M, Petrillo M, et al. Comparison of the 6th and 7th editions of the AJCC/UICC TNM staging system for gastric cancer focusing on the "N" parameter-related survival: the monoinstitutional NodUs Italian study. World J Surg Oncol. 2015; 13:215.
10. Yeh CN, Wang SY, Hsu JT, Chiang KC, Cheng CT, Tsai CY, et al. N3 subclassification incorporated into the final pathologic staging of gastric cancer: a modified system based on current AJCC staging. Medicine (Baltimore). 2015; 94:e575.
11. Jun KH, Lee JS, Kim JH, Kim JJ, Chin HM, Park SM. The rationality of N3 classification in the 7th edition of the International Union Against Cancer TNM staging system for gastric adenocarcinomas: a case-control study. Int J Surg. 2014; 12:893–896.
12. Spolverato G, Ejaz A, Kim Y, Squires MH, Poultsides G, Fields RC, et al. Prognostic performance of different lymph node staging systems after curative intent resection for gastric adenocarcinoma. Ann Surg. 2015; 262:991–998.
13. Wang J, Dang P, Raut CP, Pandalai PK, Maduekwe UN, Rattner DW, et al. Comparison of a lymph node ratio-based staging system with the 7th AJCC system for gastric cancer: analysis of 18,043 patients from the SEER database. Ann Surg. 2012; 255:478–485.
14. Wu XJ, Miao RL, Li ZY, Bu ZD, Zhang LH, Wu AW, et al. Prognostic value of metastatic lymph node ratio as an additional tool to the TNM stage system in gastric cancer. Eur J Surg Oncol. 2015; 41:927–933.
15. Li X, Liu Y, Cao B, Liu B, Bai T, Li X, et al. Metastatic lymph node ratio and prognosis of gastric cancer at different pT stages. Hepatogastroenterology. 2015; 62:507–511.
16. Rocken C, Behrens HM. Validating the prognostic and discriminating value of the TNM-classification for gastric cancer: a critical appraisal. Eur J Cancer. 2015; 51:577–586.
17. Reim D, Loos M, Vogl F, Novotny A, Schuster T, Langer R, et al. Prognostic implications of the seventh edition of the international union against cancer classification for patients with gastric cancer: the Western experience of patients treated in a single-center European institution. J Clin Oncol. 2013; 31:263–271.
18. Zhang J, Jiang K, Liu Y, Ye Y, Lv L, Shen Z, et al. Proposal of a new lymph node staging system for gastric cancer: study from two institutions in China. Med Oncol. 2014; 31:229.
19. Warneke VS, Behrens HM, Hartmann JT, Held H, Becker T, Schwarz NT, et al. Cohort study based on the seventh edition of the TNM classification for gastric cancer: proposal of a new staging system. J Clin Oncol. 2011; 29:2364–2371.
20. Patel MI, Rhoads KF, Ma Y, Ford JM, Visser BC, Kunz PL, et al. Seventh edition (2010) of the AJCC/UICC staging system for gastric adenocarcinoma: is there room for improvement? Ann Surg Oncol. 2013; 20:1631–1638.
21. Sun Z, Wang ZN, Zhu Z, Xu YY, Xu Y, Huang BJ, et al. Evaluation of the seventh edition of American Joint Committee on Cancer TNM staging system for gastric cancer: results from a Chinese monoinstitutional study. Ann Surg Oncol. 2012; 19:1918–1927.
22. Kikuchi S, Futawatari N, Sakuramoto S, Katada N, Yamashita K, Shibata T, et al. Comparison of staging between the old (6th edition) and new (7th edition) TNM classifications in advanced gastric cancer. Anticancer Res. 2011; 31:2361–2365.
23. Dikken JL, van de Velde CJ, Gonen M, Verheij M, Brennan MF, Coit DG. The New American Joint Committee on Cancer/International Union Against Cancer staging system for adenocarcinoma of the stomach: increased complexity without clear improvement in predictive accuracy. Ann Surg Oncol. 2012; 19:2443–2451.