Bioinformatics Methods for Topology Prediction of Membrane Proteins

University dissertation from Stockholm : Department of Biochemistry and Biophysics, Stockholm University

Abstract: Membrane proteins are key elements of the cell since they are associated with a variety of very important biological functions crucial to its survival. They are implicated in cellular recognition and adhesion, act as molecular receptors, transport substrates through membranes and exhibit specific enzymatic activity.This thesis is focused on integral membrane proteins, most of which contain transmembrane segments that form an alpha helix and are composed of mainly hydrophobic residues, spanning the lipid bilayer. A more specialized and less well-studied case, is the case of integral membrane proteins found in the outer membrane of Gram-negative bacteria and (presumably) in the outer envelope of mitochondria and chloroplasts, proteins whose transmembrane segments are formed by amphipathic beta strands that create a closed barrel (beta-barrels). The importance of transmembrane proteins, as well as the inherent difficulties in crystallizing and obtaining three-dimensional structures of these, dictates the need for developing computational algorithms and tools that will allow for a reliable and fast prediction of their structural and functional features. In order to elucidate their function, we must acquire knowledge about their structure and topology with relation to the membrane. Therefore, a large number of computational methods have been developed in order to predict the transmembrane segments and the overall topology of transmembrane proteins. In this thesis, I initially describe a large-scale benchmark of many topology prediction tools in order to devise a strategy that will allow for better detection of alpha-helical membrane proteins in a proteome. Then, I give a description of construction of improved machine-learning algorithms and computer software for accurate topology prediction of transmembrane proteins and discrimination of such proteins from non-transmembrane proteins. Finally, I introduce a fast way to obtain a position-specific scoring matrix, which is essential for modern topology prediction methods.

  CLICK HERE TO DOWNLOAD THE WHOLE DISSERTATION. (in PDF format)