Comparison of induced and cancer-associated mutational spectra using multivariate data analysis

Carcinogenesis. 2008 Apr;29(4):772-8. doi: 10.1093/carcin/bgn053. Epub 2008 Feb 22.

Abstract

One of the most useful tools for investigating the aetiopathology of cancer is the mutation spectrum, which comprises the type and distribution of mutations within a gene sequence. Many studies have generated mutagen-induced spectra using in vitro or in vivo model systems in an attempt to find correlations with those observed in cancer-associated genes such as the TP53 tumour suppressor gene. Consequently, meaningful similarities in the types of mutation found in induced and human spectra have been demonstrated. However, it is more difficult to draw such conclusions about the distribution or sequence context of mutations when they arise in different target sequences. We have developed an analytical approach for base substitution spectra that capture information for both sequence context and mutation type simultaneously. The resulting mutation signature is a fixed set of data points that allows comparison of multiple mutation spectra regardless of sequence. We have applied this method to a mixed set of mutation spectra observed in exons 5, 7 and 8 of TP53 from cancers of brain, breast, skin, colon, oesophagus, liver, head and neck, stomach and lung (smokers and non-smokers) and spectra induced by benzo[a]pyrene diol epoxide, ultraviolet (UV) B, UVC, simulated sunlight and hydroxyl radicals in the cII, supF and yeast p53 model systems. We demonstrate that this approach allows human cancer and mutagen-induced signatures to be grouped together according to similarity. Specifically, the analysis reveals key differences between smoking- and non-smoking-related lung cancer for TP53 mutations and the mutability of CpG sites between exons in skin cancer.

MeSH terms

  • Animals
  • Animals, Genetically Modified
  • Carcinogens
  • DNA Mutational Analysis*
  • Humans
  • Lung Neoplasms / etiology
  • Mice
  • Multivariate Analysis
  • Mutagens
  • Mutation*
  • Neoplasms / etiology*
  • Neoplasms / genetics*
  • Neoplasms / pathology
  • Smoking / adverse effects

Substances

  • Carcinogens
  • Mutagens