Coalescent-based association mapping and fine mapping of complex trait loci

Genetics. 2005 Feb;169(2):1071-92. doi: 10.1534/genetics.104.031799. Epub 2004 Oct 16.

Abstract

We outline a general coalescent framework for using genotype data in linkage disequilibrium-based mapping studies. Our approach unifies two main goals of gene mapping that have generally been treated separately in the past: detecting association (i.e., significance testing) and estimating the location of the causative variation. To tackle the problem, we separate the inference into two stages. First, we use Markov chain Monte Carlo to sample from the posterior distribution of coalescent genealogies of all the sampled chromosomes without regard to phenotype. Then, averaging across genealogies, we estimate the likelihood of the phenotype data under various models for mutation and penetrance at an unobserved disease locus. The essential signal that these models look for is that in the presence of disease susceptibility variants in a region, there is nonrandom clustering of the chromosomes on the tree according to phenotype. The extent of nonrandom clustering is captured by the likelihood and can be used to construct significance tests or Bayesian posterior distributions for location. A novelty of our framework is that it can naturally accommodate quantitative data. We describe applications of the method to simulated data and to data from a Mendelian locus (CFTR, responsible for cystic fibrosis) and from a proposed complex trait locus (calpain-10, implicated in type 2 diabetes).

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alleles
  • Bayes Theorem
  • Calpain / genetics
  • Chromosome Mapping / methods*
  • Chromosome Mapping / statistics & numerical data
  • Computer Simulation
  • Cystic Fibrosis / genetics
  • Cystic Fibrosis Transmembrane Conductance Regulator / genetics
  • Diabetes Mellitus, Type 2 / genetics
  • Disease Susceptibility
  • Genetic Variation
  • Haplotypes
  • Humans
  • Likelihood Functions
  • Linkage Disequilibrium
  • Markov Chains
  • Models, Genetic
  • Monte Carlo Method
  • Mutation
  • Pedigree
  • Penetrance
  • Polymorphism, Single Nucleotide
  • Quantitative Trait, Heritable*
  • Recombination, Genetic

Substances

  • CFTR protein, human
  • Cystic Fibrosis Transmembrane Conductance Regulator
  • Calpain
  • calpain 10