Modeling the Interaction Space of Biological Macromolecules: A Proteochemometric Approach Applications for Drug Discovery and Development

University dissertation from Uppsala : Acta Universitatis Upsaliensis

Abstract: Molecular interactions lie at the heart of myriad biological processes. Knowledge of molecular recognition processes and the ability to model and predict interactions of any biological molecule to any chemical compound are the key for better understanding of cell functions and discovery of more efficacious medicines.This thesis presents contributions to the development of a novel chemo-bioinformatics approach called proteochemometrics; a general method for interaction space analysis of biological macromolecules and their ligands. In this work we explore proteochemometrics-based interaction models over broad groups of protein families, evaluate their validity and scope, and compare proteochemometrics to traditional modeling approaches.Through the proteochemometric analysis of large interaction data sets of multiple retroviral proteases from various viral species we investigate complex mechanisms of drug resistance in HIV-1 and discover general physicochemical determinants of substrate cleavage efficiency and binding in retroviral proteases. We further demonstrate how global proteochemometric models can be used for design of protease inhibitors with broad activity on drug-resistant viral mutants, for monitoring drug resistance mechanisms in the physicochemical sense and prediction of potential HIV-1 evolution trajectories. We provide novel insights into the complexity of HIV-1 protease specificity by constructing a generalized IF-THEN rule model based on bioinformatics analysis of the largest set of HIV-1 protease substrates and non-substrates.We discuss how proteochemometrics can be used to map recognition sites of entire protein families in great detail and demonstrate how it can incorporate target variability into drug discovery process. Finally, we assess the utility of the proteochemometric approach in evaluation of ADMET properties of drug candidates with a special focus on inhibition of cytochrome P450 enzymes and investigate application of the approach in the pharmacogenomics field.