As will be demonstrated momentarily, a version of Theorem 1 does not hold for the linear trend test. This test, versions of which were first proposed by Cochran (1954) and Armitage (1955), uses the linear contrast to define the linear component of . A direct interpretation of the test is that the null holds if and only if the least-squares slope when is regressed upon is zero.
To gain a general sense of how this test is affected by misclassification, we compare to for a large ensemble of , , and P values. Particularly, we fix exposure categories and also fix . Then values of are drawn from 12 different probability distributions. The first three distributions arise from fixing , thinking of a uniform distribution for X as being a simple but important special case. A distribution on is assigned by fixing without loss of generality, and then taking the increments to be distributed as , so that is increasing and is fixed. In our experiments we take , which allows the possibility of far from linear patterns (as c increases the distribution puts more weight on relationships which are closer to linear). Then the three distributions are based on sampling P from a specified distribution over tridiagonal classification matrices, and conditioning on either (i) P being non-monotone, (ii) P being monotone but not tapered, or (iii) P being tapered. The distribution on P generates tridiagonal matrices by independently drawing , , and , for . We use , giving a mean of 0.667 and a standard deviation of 0.085 for each diagonal element of P, before conditioning. This ensures that a wide range of classification matrices is being generated, with the typical extent of misclassification being quite considerable.
The next three distributions arise exactly as per the first three, except is now also taken as random, with , so that each has mean . This allows departures from X being uniformly distributed (which would correspond to ). We wish to take d small enough to engender substantial departures from uniformity, but not so small as to yield distributions that place almost no mass on one or more values. Thus we set , which induces a standard deviation of 0.053 for each .
Results for arising from these six distributions appear in Figure 1. For both the general test and the trend test, results are given in terms of relative power, defined as the power achieved from misclassified data at the sample size for which ideal data yield 80% power. That is, if denotes the distribution function for the noncentral distribution with noncentrality t, then the relative power is
where solves . For the general test, Theorem 1 guarantees that the relative power cannot exceed 80%. Figure 1 shows contrary behavior for the trend test. At least when the distribution of X can depart from uniformity, there are values of P (of all three types) and and for which misclassification induces power gain. Of course these results are based on first-order asymptotic theory; in Appendix A we present a small simulation indicating that the theory does indeed translate to the finite-sample case. Also, while our simulated scenarios never produced a power gain when X is uniformly distributed, in Appendix B we exhibit a and a tridiagonal, tapered P such that there is power gain in this case.
Figure 1 Relative power for the trend test and for the general test, with tridiagonal misclassification matrices. The panels give 1,000 realizations from each of the six distributions over described in the text, with fixed (random) in the upper (lower) panels. Quartiles of relative power are indicated (diamonds), and the reference lines correspond to 80% relative power
The remaining six distributions for are constructed as per the first six described above, but with the underlying distribution of P modified so as to generate non-banded matrices. This is achieved by taking the rows of P to be independently distributed as Dirichlet over all k categories. To mimic the earlier construction, for each row the parameters of the Dirichlet distribution are taken to be a for the correct category and to sum to b over the incorrect categories, implying the same distribution for diagonal elements as previously. For all but the first and last rows, sub-sums of are forced, both below and above the correct category. The specification is completed by fixing a value of r such that the Dirichlet parameters for a given row decay geometrically by a factor of r as we move further away from the correct category. We take in generating the results of Figure 2. Qualitatively, we see very similar behavior as before with tridiagonal misclassification.
Figure 2 Relative power for the trend test and for the general test, with non-banded misclassification matrices. The panels give 1,000 realizations from each of the six distributions over described in the text, with fixed (random) in the upper (lower) panels. Quartiles of relative power are indicated (diamonds), and the reference lines correspond to 80% relative power
To seek further insight into when misclassification increases power, we focus on the situation of non-uniform X distributions and tapered misclassification matrices, i.e., the settings underlying the lower right panels of both Figures 1 and 2. For these simulated values, in Figure 3 we first plot the relative power of the trend test against the absolute ratio of to . Values of this ratio larger than one correspond to the misclassification inflating the slope which summarizes the linear component of the exposure–disease relationship. In situations involving linear regression of a continuous outcome on a continuous exposure, nondifferential exposure misclassification is quite widely guaranteed to attenuate the regression slope being estimated [see, for instance, Gustafson (2004); Carroll et al. (2006)]. Figure 3 shows that such a guarantee does not apply to the present situation, but it is perhaps not surprising that attenuation is much more common than inflation across the simulated scenarios. What is more surprising is that slope inflation is neither a necessary condition nor a sufficient condition for power gain. That is, in some scenarios the misclassification inflates the slope yet power is lost for detecting that the slope is not zero. And in some situations, the misclassification attenuates the slope, yet power is gained for detecting that the slope is not zero. So the behavior of the slopes alone is not determining whether power is lost or gained.
Next in Figure 3 we plot the relative power against , to see whether the worst probability of correct classification across exposure categories is a driving force behind the power of the test based on misclassified data. The plots provide a negative answer here, indicating very little association, if any, between this characteristic of the classification matrix and the relative power.
Finally, we consider the extent to which misclassification pushes the exposure distribution toward or away from uniformity. We define the log sum-of-squares ratio (LSSR) as
So, for instance, a large positive value of LSSR describes a case where the misclassification induces a distribution of that is much further from uniform than the distribution of X. Figure 3 exhibits quite strong negative associations between relative power and LSSR, in both the tridiagonal and non-banded cases. Thus scenarios with power gain tend to have misclassification which makes much more uniformly distributed than X. But again this is only a stochastic tendency with respect to the chosen distribution of . A low LSSR is neither a necessary nor sufficient condition for power gain. In general, there seems to be considerable complexity in how the distribution of X, the distribution of , and the misclassification matrix P combine to determine the performance of the trend test applied to misclassified data.
Figure 3 Relative power of the trend test versus (i) absolute ratio of slopes (top panels), (ii) smallest diagonal element of P (middle panels), and (iii) LSSR (bottom panels). The misclassification matrices are tapered throughout. The left (right) panels correspond to tridiagonal (non-banded) misclassification matrices. For both distributions of , 1,000 realizations are simulated
Comments (0)