Ribosome-Related
Proteins:
Alignments, Phylogeny and Structure
Introduction
The ever expanding wealth of genomic data is providing
new opportunities for more consistent and reliable phylogenetic reconstructions.
While many of the translation- related proteins, such as the tRNA synthetases,
appear to lack consistent phylogenetic relationship, perhaps due to horizontal
gene transfer and convergent evolution, we have shown that there are a
number of translational proteins which provide clear and very consistant
phylogenetic implications (Vishwanath et al. " Ribosomal Protein Sequence
Block Structure Suggests Complex Prokaryotic Evolution with Implications
for the Origin of Eukaryotes." Molecular Phylogenetics and Evolution
2004 Dec;33(3):615-25 ; Hartman et al.
Archaea v.2 pXXXX 2006
entitled: " The archaeal origins of the eukarkotic translational
system" ). Among these proteins are the ribosomal proteins, some
initiation and elongation factors, some RNA polymerase subunits, the SRP
proteins and a few others. The phylogenetic implications are in
large part the result of a taxon- specific block structure,
with blocks unique to, and distinctive of, all sequenced bacterial and
separately of all sequenced archaeal genomes. The proteins common to the
prokaryotes and the eukaryotes share the archaeal signature blocks, along
with various N- and C-terminal additions. This is in parallel with what
is seen in the eukaryotic rRNAes, which appear most similar to the archaeal
rRNA with multiple insertions.
Overview
One of the main objectives of this study was to gain
a deeper understanding of the Evolution of living organisms (Bacteria,
Archaea and Eukarya). In order to achieve this goal, we analyzed the molecules
that play an important role in the biological information system, specifically
ribosomal proteins, polymerase and translation factors. Through
profile methods, Blast and Psi Blast, we identified taxa containing representative
proteins encompassing the widest taxonomic and ecological range possible
and so as to include organisms for which structural and functional information
was available. Initial Multiple sequence
alignments were generated using various software and then refined
so that the hydrophobicity, polarity and turn induction amino acid patterns
were preserved in so far as possible. In addition, alignment gaps
were restricted to positions between secondary structure elements.
Finally amino acid making RNA contacts or involved in protein-protein
interactions were aligned to produce conserved positions across the entire
alignment if possible without additional gaps.
The blocks identified in the alignments have a length
varying from 6 to 70 aa, have no consistent correlation with RNA contacts
or protein protein interaction sites, are shorter than typical protein
domains but longer than segments associated with enzyme active-sites,
and their transitions are clear and well defined.
To investigate deeply the implications of the unusual block structure
found, maximum likelihood and maximum parsimony phylogenetic
trees were built by concatenating each type of blocks separately
for all ribosomal and translation factor proteins respectively. The core
branching obtained from the sequence alignments is the same: the trees
clearly resolves not only the three domains, but also the separation between
Crenarchaea and Euryarchaea, with an implied Crenarchaea "origin" for
the eukaryotic translational system. More interesting is the suggested
single last common ancestor for all extant Bacteria and all extant Archaea,
which in turn opens the possibility of a very ancient and major prokaryotic
bottleneck around two billion years ago!
Contact
BMERC - BioMolecular Engineering
Research Center
36 Cummington St. room 302
Boston University
Boston MA 02215 - USA
last modified July
19, 2004 11:09 AM
© BMERC 2004 |