cpnDB: a chaperonin sequence database

Genome Res. 2004 Aug;14(8):1669-75. doi: 10.1101/gr.2649204.

Abstract

Type I chaperonins are molecular chaperones present in virtually all bacteria, some archaea and the plastids and mitochondria of eukaryotes. Sequences of cpn60 genes, encoding 60-kDa chaperonin protein subunits (CPN60, also known as GroEL or HSP60), are useful for phylogenetic studies and as targets for detection and identification of organisms. Conveniently, a 549-567-bp segment of the cpn60 coding region can be amplified with universal PCR primers. Here, we introduce cpnDB, a curated collection of cpn60 sequence data collected from public databases or generated by a network of collaborators exploiting the cpn60 target in clinical, phylogenetic, and microbial ecology studies. The growing database currently contains approximately 2000 records covering over 240 genera of bacteria, eukaryotes, and archaea. The database also contains over 60 sequences for the archaeal Type II chaperonin (thermosome, a homolog of eukaryotic cytoplasmic chaperonin) from 19 archaeal genera. As the largest curated collection of sequences available for a protein-encoding gene, cpnDB provides a resource for researchers interested in exploiting the power of cpn60 as a diagnostic or as a target for phylogenetic or microbial ecology studies, as well as those interested in broader subjects such as lateral gene transfer and codon usage. We built cpnDB from open source tools and it is available at http://cpndb.cbr.nrc.ca.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Bacteria / genetics
  • Base Sequence
  • Chaperonin 60 / genetics*
  • Computational Biology
  • Databases, Genetic
  • Molecular Sequence Data
  • Sequence Homology, Nucleic Acid

Substances

  • Chaperonin 60