Genetic Cartography at Massively Parallel Scale

Abstract: Massively parallel sequencing (MPS) is revolutionizing genomics. In this work we use, refine, and develop new tools for the discipline.MPS has led to the discovery of multiple novel subtypes in Acute Lymphoblastic Leukemia (ALL). In Study I we screen for fusion genes in 134 pediatric ALL patients, including patients without an assigned subtype. In approximately 80% of these patients we detect novel or known fusion gene families, most of which display distinct methylation and expression patterns. This shows the potential for improvements in the clinical stratification of ALL. Large sample sizes are important to detect recurrent somatic variation. In Study II we investigate if a non-index overlapping pooling schema can be used to increase sample size and detect somatic variation. We designed a schema for 172 ALL samples and show that it is possible to use this method to call somatic variants.Around the globe there are many ongoing and completed genome projects. In Study III we sequenced the genome of 1000 Swedes to create a reference data set for the Swedish population. We identified more than 10 million variants that were not present in publicly available databases, highlighting the need for population-specific resources. Data, and the tools developed during this study, have been made publicly available as a resource for genomics in Sweden and abroad.The increased amount of sequencing data has created a greater need for automation. In Study IV we present Arteria, a computational automation system for sequencing core facilities. This system has been adopted by multiple facilities and has been used to analyze thousands of samples. In Study V we developed CheckQC, a program that provides automated quality control of Illumina sequencing runs. These tools make scaling up MPS less labour intensive, a key to unlocking the full future potential of genomics.The tools, and data presented here are a valuable contribution to the scientific community. Collectively they showcase the power of MPS and genomics to bring about new knowledge of human health and disease.

  CLICK HERE TO DOWNLOAD THE WHOLE DISSERTATION. (in PDF format)