We performed simulations to investigate the finite-sample performance of the proposed methods in terms of bias and standard deviation (SD) of the cutpoint estimator and the empirical power of the maximally selected test statistics. We generated a time to the event of interest (
X) and time to a CR (
Y) using Gumbel's bivariate exponential distribution
1314 with degree of dependency α set as 0 and 0.3. We generated a non-informative censoring time C from an exponential distribution with a hazard rate of λ(>0), which was determined to satisfy
P(
C<
X⋀
Y)=
p, where
p denotes the censoring fraction. We set
p=0 and 0.3. We also generated the prognostic factor
Z from a uniform distribution
U(0,1) and set the true cutpoint value µ as 0.5. We set ε
1=0.1 and ε
2=0.9. The effect size θ=exp(β) was the relative risk between the two groups of patients, where
Z was greater than or less than or equal to µ. We set β as 0, 1, 1.5, 2, and 3: β=0 corresponded to the null hypothesis and β=1, 1.5, 2, and 3 corresponded to the alternative hypotheses. We considered four combinations of α and
p: (α,
p)=(0,0), (0,0.3), (0.3,0), (0.3,0.3). We performed 500 replications for each configuration of α and
p with sample sizes of 50, 100, and 200. We also permuted each sample 250 times to obtain the empirical null distribution of
Q(ε
1,ε
2).
Fig. 2 depicts the empirical
p values of the proposed test
Q(ε
1,ε
2) based on a simulated sample against the number
b of permutation times for each combination of α and
p under
H0:β=0. As shown in
Fig. 2, the number
B=250 of permutation times was chosen as acceptable regardless of the sample size and the combinations of α and
p. See the work
6 for details of the data generation procedures.
Fig. 3 depicts the empirical distribution function of μ̂ based on 2000 replications, with the sample size of 100 for each combination of α and
p when β=0, 1, 1.5, 2, and 3. As expected, regardless of the combinations of α and
p, the distribution of μ̂ is centered around the true cutpoint value 0.5 of µ as β increases from 0 to 3.
Table 2 displays the proportion of CR, the bias (Bias) and SD of the cutpoint estimator μ̂, and the approximated (A.Pow) and permutation-based (P.Pow) power of the proposed test statistic
Q(ε
1,ε
2). These are based on 500 replications and 250 permutations for each combination of α and
p when
n=50, 100, and 200. As expected, as β increases, the SD of μ̂ decreases gradually, and both A.Pow and P.Pow converge to 1 regardless of the combination of α and
p and sample size
n. For a fixed value of β, as
n increases, both Bias and the SD of μ̂ decrease; also, both A.Pow and P.Pow increase for any combination of α and
p. The permutation-based test (P.Pow), included in the interval (0.031,0.069), satisfies a significance level of 0.05 for most cases, while the approximated test (A.Pow) is very conservative. Furthermore, P.Pow is larger than A.Pow regardless of the combination of α and
p and sample size
n.