Journal List > Korean J Pain > v.36(3) > 1516083088

Hong: P value, it is just not enough
According to the instructions for authors in The Korean Journal of Pain (KJP), it is recommended to provide an effect size and its corresponding estimates. Additionally, it suggests being cautious about reporting P values alone.
“Confidence intervals or effect sizes should be presented with P values. P values should not be presented alone and should be presented with confidence intervals.”
Furthermore, in the consolidated standards of reporting trials (CONSORT) statement, which serves as the reporting guidelines for randomized controlled trials (RCTs), the following reporting items are recommended to be included [1].
“For each primary and secondary outcome, results for each group, and the estimated effect size and its precision (such as 95% confidence interval). For binary outcomes, presentation of both absolute and relative effect sizes is recommended.”
From 2020 onwards, a search in the KJP archive with the terms "randomized" or "randomised" in the title yielded a total of 23 RCTs (Supplementary Table 1). Among these studies, only three reported the effect size and confidence interval (CI) for the primary outcome [2-4]. In some studies, only the effect size according to the measurement time points within the groups was reported without the effect size and CI between the groups.
A P value is a statistical measure employed in null hypothesis significance testing to indicate the likelihood of obtaining the observed results assuming the null hypothesis is true. It also serves as a criterion for accepting or rejecting the alternative hypothesis, based on a predetermined threshold. However, this binary decision-making process has created an obsession and misuse of P values, often treating a P value < 0.05 as a guarantee that the hypothesis being tested is true [5]. Paradoxically, this practice has led to an increase in false positive studies and raised concerns about the reproducibility of research findings. As a result, there is a growing consensus advocating for a reduction in the threshold for statistical significance [6].
However, it is important to note that a P value < 0.05 (in general) does not represent the probability of the alternative hypothesis being true. Moreover, the P value itself does not provide information about the extent of the observed difference, regardless of the chosen threshold for significance. Therefore, it is not appropriate to determine the significance of the research results solely based on the magnitude of the P value. In 2016, the American Statistical Association released principles that addressed the appropriate use of P values [7]. Among them, I believe the following sentence carries significant implications.
“A P-value, or statistical significance, does not measure the size of an effect or the importance of a result.”
The limitations of the P value can be overcome by providing the effect size and its estimated CI, which can more effectively express research findings [8,9].
Furthermore, clinical significance in research findings can be evaluated by establishing a “minimally clinically important difference (MCID)” prior to the study commencement and comparing the CI of the effect size to that predetermined difference [10]. The term MCID refers to the smallest change in a clinical outcome that patients perceive as meaningful and significant [11]. The emphasis is on recognizing clinically important or meaningful changes from the patient's perspective, rather than simply focusing on statistically significant differences. For example, if the MCID for a pain score is determined to be ‘1’ based on prior evidence [12], and the lower limit of the CI in the study is greater than ‘1’, it indicates that the study findings are not only statistically significant but also hold clinical significance. In well-designed RCTs, readers should be able to assess the clinical significance of the research findings independently of statistical significance, and the effect size and CI can play a role in facilitating this assessment [13].
Although readers can calculate the effect size directly using the provided data (mean, standard deviation, and sample size for continuous outcome, or a contingency table for binary outcome), providing the effect size can alleviate this burden for readers. Moreover, it enhances the clarity of research findings, promotes transparency, and facilitates a better understanding of the study’s impact.
Given these considerations, I strongly advocate for a more robust and explicit recommendation regarding reporting effect size measures and their corresponding estimates.

Notes

SUPPLEMENTARY MATERIALS

Supplementary materials can be found via https://doi.org/10.3344/kjp.23173.

DATA AVAILABILITY

Data sharing is not applicable to this article as no datasets were generated or analyzed for this paper.

CONFLICT OF INTEREST

No potential conflict of interest relevant to this article was reported.

AUTHOR CONTRIBUTIONS

Boohwi Hong: Writing/manuscript preparation.

REFERENCES

1. Schulz KF, Altman DG, Moher D. CONSORT Group. 2010; CONSORT 2010 statement: updated guidelines for reporting parallel group randomised trials. BMJ. 340:c332. DOI: 10.1136/bmj.c332. PMID: 20332509. PMCID: PMC2844940.
crossref
2. Govil N, Parag K, Arora P, Khandelwal H, Singh A. Ruchi. 2020; Perioperative duloxetine as part of a multimodal analgesia regime reduces postoperative pain in lumbar canal stenosis surgery: a randomized, triple blind, and placebo-controlled trial. Korean J Pain. 33:40–7. DOI: 10.3344/kjp.2020.33.1.40. PMID: 31888316. PMCID: PMC6944370.
crossref
3. Jeong H, Choi JW, Sim WS, Kim DK, Bang YJ, Park S, et al. 2022; Ultrasound-guided erector spinae plane block for pain management after gastrectomy: a randomized, single-blinded, controlled trial. Korean J Pain. 35:303–10. DOI: 10.3344/kjp.2022.35.3.303. PMID: 35768985. PMCID: PMC9251398.
crossref
4. Lee GY, Lee JW, Lee E, Yeom JS, Kim KJ, Shin HI, et al. 2022; Evaluation of the efficacy and safety of epidural steroid injection using a nonparticulate steroid, dexamethasone or betamethasone: a double-blind, randomized, crossover, clinical trial. Korean J Pain. 35:336–44. DOI: 10.3344/kjp.2022.35.3.336. PMID: 35768989. PMCID: PMC9251387.
crossref
5. Hadjipavlou G, Siviter R, Feix B. 2021; What is the true worth of a P-value? Time for a change. Br J Anaesth. 126:564–7. DOI: 10.1016/j.bja.2020.10.042. PMID: 33419530.
crossref
6. Chuang Z, Martin J, Shapiro J, Nguyen D, Neocleous P, Jones PM. 2023; Minimum false-positive risk of primary outcomes and impact of reducing nominal P-value threshold from 0.05 to 0.005 in anaesthesiology randomised clinical trials: a cross-sectional study. Br J Anaesth. 130:412–20. DOI: 10.1016/j.bja.2022.11.001. PMID: 36503825.
crossref
7. Wasserstein RL, Lazar NA. 2016; The ASA statement on p-values: context, process, and purpose. Am Stat. 70:129–33. DOI: 10.1080/00031305.2016.1154108.
8. Lee DK. 2016; Alternatives to P value: confidence interval and effect size. Korean J Anesthesiol. 69:555–62. DOI: 10.4097/kjae.2016.69.6.555. PMID: 27924194. PMCID: PMC5133225. PMID: ce5d9e7a1f674dea89090d92a9c8c0b8.
crossref
9. Sullivan GM, Feinn R. 2012; Using effect size-or why the P value is not enough. J Grad Med Educ. 4:279–82. DOI: 10.4300/JGME-D-12-00156.1. PMID: 23997866. PMCID: PMC3444174.
10. Lee S. 2016; Avoiding negative reviewer comments: common statistical errors in anesthesia journals. Korean J Anesthesiol. 69:219–26. DOI: 10.4097/kjae.2016.69.3.219. PMID: 27274365. PMCID: PMC4891532. PMID: 741d6494a6f64058bc4e615d3111b963.
crossref
11. Muñoz-Leyva F, El-Boghdadly K, Chan V. 2020; Is the minimal clinically important difference (MCID) in acute pain a good measure of analgesic efficacy in regional anesthesia? Reg Anesth Pain Med. 45:1000–5. DOI: 10.1136/rapm-2020-101670. PMID: 32900985.
crossref
12. Myles PS, Myles DB, Galagher W, Boyd D, Chew C, MacDonald N, et al. 2017; Measuring acute postoperative pain using the visual analog scale: the minimal clinically important difference and patient acceptable symptom state. Br J Anaesth. 118:424–9. DOI: 10.1093/bja/aew466. PMID: 28186223.
crossref
13. Park S. 2016; Significant results: statistical or clinical? Korean J Anesthesiol. 69:121–5. DOI: 10.4097/kjae.2016.69.2.121. PMID: 27066201. PMCID: PMC4823405. PMID: 6613dbd08d704885bf44a164e9ac2ec3.
crossref
TOOLS
Similar articles