Amino acid sequence of human histidine-rich glycoprotein derived from the nucleotide sequence of its cDNA

Biochemistry. 1986 Apr 22;25(8):2220-5. doi: 10.1021/bi00356a055.

Abstract

A lambda gt 11 library containing cDNA inserts prepared from human liver mRNA has been screened with an affinity-purified antibody to human histidine-rich glycoprotein (HRG) and then with a restriction fragment isolated from the 5' end of the largest cDNA insert obtained by antibody screening. A number of positive clones were identified and shown to code for HRG by DNA sequence analysis. A total of 2067 nucleotides were determined by sequencing 3 overlapping cDNA clones, which included 121 nucleotides of 5'-noncoding sequence, 54 nucleotides coding for a leader sequence of 18 amino acids, 1521 nucleotides coding for the mature protein of 507 amino acids, a stop codon of TAA, and 352 nucleotides of 3'-noncoding sequence followed by a poly(A) tail of 16 nucleotides. The length of the noncoding sequence of the 3' end differed in several clones, but each contained a polyadenylylation or processing sequence of AATAAA followed by a poly(A) tail. More than half of the amino acid sequence of HRG consisted of five different types of internal repeats. Within the last 3 internal repeats (type V), there were 12 tandem repetitions of a 5 amino acid segment with a consensus sequence of Gly-His-His-Pro-His. This repeated portion, referred to as a "histidine-rich region", contained 53% histidine and showed a high degree of similarity to a histidine-rich region of high molecular weight kininogen.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Blood Proteins
  • Cloning, Molecular
  • DNA / metabolism*
  • DNA Restriction Enzymes
  • Glycoproteins / genetics*
  • Humans
  • Liver / metabolism
  • Proteins / genetics*
  • Proteins / isolation & purification

Substances

  • Blood Proteins
  • Glycoproteins
  • Proteins
  • histidine-rich proteins
  • DNA
  • DNA Restriction Enzymes

Associated data

  • GENBANK/M13149