Causal Genetic Inference Using Haplotypes as Instrumental Variables

Genet Epidemiol. 2016 Jan;40(1):35-44. doi: 10.1002/gepi.21940. Epub 2015 Dec 1.

Abstract

In genomic studies with both genotypes and gene or protein expression profile available, causal effects of gene or protein on clinical outcomes can be inferred through using genetic variants as instrumental variables (IVs). The goal of introducing IV is to remove the effects of unobserved factors that may confound the relationship between the biomarkers and the outcome. A valid inference under the IV framework requires pairwise associations and pathway exclusivity. Among these assumptions, the IV expression association needs to be strong for the casual effect estimates to be unbiased. However, a small number of single nucleotide polymorphisms (SNPs) often provide limited explanation of the variability in the gene or protein expression and can only serve as weak IVs. In this study, we propose to replace SNPs with haplotypes as IVs to increase the variant-expression association and thus improve the casual effect inference of the expression. In the classical two-stage procedure, we developed a haplotype regression model combined with a model selection procedure to identify optimal instruments. The performance of the new method was evaluated through simulations and compared with the IV approaches using observed multiple SNPs. Our results showed the gain of power to detect a causal effect of gene or protein on the outcome using haplotypes compared with using only observed SNPs, under either complete or missing genotype scenarios. We applied our proposed method to a study of the effect of interleukin-1 beta (IL-1β) protein expression on the 90-day survival following sepsis and found that overly expressed IL-1β is likely to increase mortality.

Keywords: Mendelian randomization; causal effect estimate; haplotype; instrumental variable (IV).

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Simulation
  • Haplotypes
  • Humans
  • Interleukin-1beta / genetics
  • Mendelian Randomization Analysis / methods*
  • Models, Genetic*
  • Polymorphism, Single Nucleotide*
  • Shock, Septic / genetics
  • Shock, Septic / mortality

Substances

  • Interleukin-1beta