An update on LNCipedia: a database for annotated human lncRNA sequences

Nucleic Acids Res. 2015 Jan;43(Database issue):D174-80. doi: 10.1093/nar/gku1060. Epub 2014 Nov 5.

Abstract

The human genome is pervasively transcribed, producing thousands of non-coding RNA transcripts. The majority of these transcripts are long non-coding RNAs (lncRNAs) and novel lncRNA genes are being identified at rapid pace. To streamline these efforts, we created LNCipedia, an online repository of lncRNA transcripts and annotation. Here, we present LNCipedia 3.0 (http://www.lncipedia.org), the latest version of the publicly available human lncRNA database. Compared to the previous version of LNCipedia, the database grew over five times in size, gaining over 90,000 new lncRNA transcripts. Assessment of the protein-coding potential of LNCipedia entries is improved with state-of-the art methods that include large-scale reprocessing of publicly available proteomics data. As a result, a high-confidence set of lncRNA transcripts with low coding potential is defined and made available for download. In addition, a tool to assess lncRNA gene conservation between human, mouse and zebrafish has been implemented.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Databases, Nucleic Acid*
  • HEK293 Cells
  • Humans
  • Internet
  • Mice
  • Molecular Sequence Annotation
  • Proteins / genetics
  • RNA, Long Noncoding / chemistry*
  • RNA, Long Noncoding / genetics
  • Sequence Analysis, RNA
  • Zebrafish / genetics

Substances

  • Proteins
  • RNA, Long Noncoding