GRSDB: a database of quadruplex forming G-rich sequences in alternatively processed mammalian pre-mRNA sequences

Nucleic Acids Res. 2006 Jan 1;34(Database issue):D119-24. doi: 10.1093/nar/gkj073.

Abstract

Guanine-rich nucleic acids are known to form highly stable G-quadruplex structures, also known as G-quartets. Recently, there has been a tremendous amount of interest in studying G-quadruplexes owing to the realization of their biological importance. G-rich sequences (GRSs) capable of forming G-quadruplexes are found in the vicinity of polyadenylation regions and are involved in regulating 3' end processing of mammalian pre-mRNAs. G-rich motifs are also known to play an important role in alternative, tissue-specific splicing by interacting with hnRNP H protein subfamily. Whether quadruplex structure directly plays a role in regulating RNA processing events requires further investigation. To date there has not been a comprehensive effort to study G-quadruplexes near RNA processing sites. We have applied a computational approach to map putative Quadruplex forming GRSs within the transcribed regions of a large number of alternatively processed human and mouse gene sequences that were obtained as fully annotated entries from GenBank and RefSeq. We have used the computed data to build the GRSDB database that provides a unique avenue for studying G-quadruplexes in the context of RNA processing sites. GRSDB website offers visual comparison of G-quadruplex distribution patterns among all the alternative RNA products of a gene with the help of dynamic graphics. At present, GRSDB contains data from 1310 human and mouse genes, of which 1188 are alternatively processed. It has a total of 379,223 predicted G-quadruplexes, of which 54,252 are near RNA processing sites. GRSDB is a good resource for researchers interested in investigating the functional relevance of G-quadruplexes, especially in the context of alternative RNA processing. It can be accessed at http://bioinformatics.ramapo.edu/grsdb/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Animals
  • Base Sequence
  • Computer Graphics
  • Databases, Nucleic Acid*
  • Guanine / chemistry*
  • Humans
  • Internet
  • Mice
  • RNA 3' End Processing
  • RNA 3' Polyadenylation Signals
  • RNA Precursors / chemistry*
  • RNA Splice Sites
  • RNA, Messenger / chemistry*
  • Regulatory Sequences, Ribonucleic Acid*
  • User-Computer Interface

Substances

  • RNA Precursors
  • RNA Splice Sites
  • RNA, Messenger
  • Regulatory Sequences, Ribonucleic Acid
  • Guanine