Bioinformatic approaches to gene expression in leukemia. Networks and deconvolution

Abstract: The aim of this thesis is develop methods to extract information from high-dimensionality data and to apply them in looking at gene-gene and gene-protein interactions in Acute Myeloid Leukemia (AML). The in silico methods developed can be used with data from other systems. The thesis gives an overview of the field for network inference and deconvolution methods, as well as a brief background on the biology concerning AML. Paper I develops a method, Ultranet, for inferring Gaussian Graphical Models in an efficient manner. The models can created by solving the sparse inverse covariance selection (SICS) problem be used to identify gene networks from data containing a large number of variables but a proportionately low sample number. We apply Ultranet to data from several blood disorders and show that the models capture blood related gene interactions. In paper II, we investigate how the cellular heterogeneity of many tissue samples can be taken into consideration when examining gene expression of said samples. Cell hierarchies are thought to be present in leukemia and some solid tumors, sustained by cancer stem cells (CSCs). We developed a computational approach that extracts gene expression patterns and cell type proportions in silico. Paper III and IV looks at how the DEK and WT1 proteins, respectively, interact with DNA using chromatin immunoprecipitation followed by sequencing. The proteins are known to be altered in a range of different cancer forms, including Acute Myeloid Leukemia. We find, in paper III, that DEK binds close to transcription start sites of actively transcribed genes as indicated by the presence of the histone markers. DEK binds to genes that are ubiquitously expressed across tissues, represented by presence of RNA polymerase II binding sites in cell lines. We also show that knockdown of DEK by shRNA results in both significant down- and up-regulations of DEK-bound genes, suggesting complex interactions. In paper IV, we study how the binding patterns of the WT1 isoforms differ. There is an alternative splice site between zinc finger three and four which leads to inclusion or exclusion of three amino acids (+/-KTS). We show that WT1 -KTS binds to transcription start sites, while WT1 +KTS bind inside gene bodies.

  CLICK HERE TO DOWNLOAD THE WHOLE DISSERTATION. (in PDF format)