BioXpress: an integrated RNA-seq-derived gene expression database for pan-cancer analysis

Database (Oxford). 2015 Mar 28:2015:bav019. doi: 10.1093/database/bav019. Print 2015.

Abstract

BioXpress is a gene expression and cancer association database in which the expression levels are mapped to genes using RNA-seq data obtained from The Cancer Genome Atlas, International Cancer Genome Consortium, Expression Atlas and publications. The BioXpress database includes expression data from 64 cancer types, 6361 patients and 17 469 genes with 9513 of the genes displaying differential expression between tumor and normal samples. In addition to data directly retrieved from RNA-seq data repositories, manual biocuration of publications supplements the available cancer association annotations in the database. All cancer types are mapped to Disease Ontology terms to facilitate a uniform pan-cancer analysis. The BioXpress database is easily searched using HUGO Gene Nomenclature Committee gene symbol, UniProtKB/RefSeq accession or, alternatively, can be queried by cancer type with specified significance filters. This interface along with availability of pre-computed downloadable files containing differentially expressed genes in multiple cancers enables straightforward retrieval and display of a broad set of cancer-related genes.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Genetic*
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Neoplasms* / genetics
  • Neoplasms* / metabolism
  • RNA, Neoplasm* / biosynthesis
  • RNA, Neoplasm* / genetics

Substances

  • RNA, Neoplasm