Identification and mapping of self-assembling protein domains encoded by the Escherichia coli K-12 genome by use of lambda repressor fusions | Academic Article individual record

Self-assembling proteins and protein fragments encoded by the Escherichia coli genome were identified from E. coli K-12 strain MG1655. Libraries of random DNA fragments cloned into a series of lambda repressor fusion vectors were subjected to selection for immunity to infection by phage lambda. Survivors were identified by sequencing the ends of the inserts, and the fused protein sequence was inferred from the known genomic sequence. Four hundred sixty-three nonredundant open reading frame-encoded interacting sequence tags (ISTs) were recovered from sequencing 2,089 candidates. These ISTs, which range from 16 to 794 amino acids in length, were clustered into families of overlapping fragments, identifying potential homotypic interactions encoded by 232 E. coli genes. Repressor fusions identified ISTs from genes in every protein-based functional category, but membrane proteins were underrepresented. The IST-containing genes were enriched for regulatory proteins and for proteins that form higher-order oligomers. Forty-eight (20.7%) homotypic proteins identified by ISTs are predicted to contain coiled coils. Although most of the IST-containing genes are identifiably related to proteins in other bacterial genomes, more than half of the ISTs do not have identifiable homologs in the Protein Data Bank, suggesting that they may include many novel structures. The data are available online at

author list (cited authors)
Marino-Ramirez, L., Minor, J. L., Reading, N., & Hu, J. C.
publication date
published in
  • DNA-Binding Proteins
  • Bacteriophage Lambda
  • Artificial Gene Fusion
  • Viral Proteins
  • Escherichia Coli
  • Genomic Library
  • Sequence Tagged Sites
  • Protein Structure, Tertiary
  • Protein Interaction Mapping
  • Repressor Proteins
  • Viral Regulatory And Accessory Proteins
  • Genome, Bacterial
  • Escherichia Coli Proteins
citation count