Domain rearrangement and creation in protein evolution

University dissertation from Stockholm : Institutionen för biokemi och biofysik

Abstract: Proteins are composed of domains, recurrent protein fragments with distinct structure, function and evolutionary history. Some domains exist only as single domain proteins, however, a majority of them are also combined with other domains. Domain rearrangements are important in the evolution of new proteins as new functionalities can arise in a single evolutionary event. In addition, the domain repertoire can be expanded through mutations of existing domains and de novo creation. The processes of domain rearrangement and creation have been the focus of this thesis.According to our estimates about 65% of the eukaryotic and 40% of the prokaryotic proteins are of multidomain type. We found that insertion of a single domain at the N- or C-terminus was the most common event in the creation of novel multidomain architectures. However, domain repeats deviate from this pattern and are often expanded through duplications of several domains. Next, by mapping domain combinations onto an evolutionary tree we estimated that roughly one domain architecture has been created per million years, with the highest rates in metazoa. Much of this so called explosion of new architectures in metazoa seems to be explained by a set of domains amenable to exon shuffling. In contrast to domain architectures, most known domain families evolved early. However, many proteins have incomplete domain coverage, and could hence contain de novo created domains. In Saccharomyces cerevisiae, however, species specific sequences constitute only a minor fraction of the proteome, and are often short, disordered sequences located at the protein termini.

  This dissertation MIGHT be available in PDF-format. Check this page to see if it is available for download.