Genotype Relative Risks:
Methods for Design and Analysis of Candidate-gene
Association Studies
D.J. Schaid, S.S. Sommer
Department of Health Sciences Research, Mayo
Clinic/Foundation, Rochester, MN 55905.
American Journal of Human Genetics ,
53(5), 1114-1126 (1993)
Abstract
Design and analysis methods are presented for studying the association of a candidate
gene with a disease by using parental data in place of nonrelated controls. This alternative design
eliminates spurious differences in allele frequencies between cases and nonrelated controls
resulting from different ethnic origins and population stratification for these two groups. We
present analysis methods which are based on two genetic relative risks: (1) the relative risk of
disease for homozygotes with two copies of the candidate gene versus homozygotes without the
candidate gene and (2) the relative risk for heterozygotes with one copy of the candidate gene
versus homozygotes without the candidate gene. In addition to estimating the magnitude of these
relative risks, likelihood methods allow specific hypotheses to be tested, namely, a test for overall
association of the candidate gene with disease, as well as specific genetic hypotheses, such as
dominant or recessive inheritance. Two likelihood methods are presented: (1) a likelihood method
appropriate when Hardy-Weinberg equilibrium holds and (2) a likelihood method in which we
condition on parental genotype data when Hardy-Weinberg equilibrium does not hold. The
results for the relative efficiency of these two methods suggest that the conditional approach may
at times be preferable, even when equilibrium holds. Sample-size and power calculations are
presented for a multitiered design. The purpose of tier 1 is to detect the presence of an abnormal
sequence for a postulated candidate gene among a small group of cases. The purpose of tier 2 is to
test for association of the abnormal variant with disease, such as by the likelihood methods
presented. The purpose of tier 3 is to confirm positive results from tier 2. Results indicate that
required sample sizes are smaller when expression of disease is recessive, rather than dominant,
and that, for recessive disease and large relative risks, necessary sample sizes may be feasible,
even if only a small percentage of the disease can be attributed to the candidate gene.