Search for dissertations about: "corpus linguistics and translation"
Showing result 1 - 5 of 9 swedish dissertations containing the words corpus linguistics and translation.
-
1. Clefts in English and Swedish: A contrastive study of IT-clefts and WH-clefts in original texts and translations
Abstract : This study investigates the use of cleft constructions in English and Swedish on the basis of a bidirectional translation corpus consisting of original English and Swedish texts and their translations into the other language. This design minimizes the problems inherent in corpora of original texts alone, viz. READ MORE
-
2. A Halich Karaim Bible Translation
Abstract : This dissertation is a corpus-based analysis of a translation of the Hebrew Bible into Halich Karaim, a Kipchak Turkic variety previously spoken in the present-day Ukraine. The corpus analysed in the thesis comprises sixty selected pages of a 596 page manuscript written in the Hebrew alphabet. READ MORE
-
3. Discourse in Statistical Machine Translation
Abstract : This thesis addresses the technical and linguistic aspects of discourse-level processing in phrase-based statistical machine translation (SMT). Connected texts can have complex text-level linguistic dependencies across sentences that must be preserved in translation. However, the models and algorithms of SMT are pervaded by locality assumptions. READ MORE
-
4. Unsupervised Learning of Morphology: Survey, Model, Algorithm and Experiments
Abstract : This thesis contains work on a specific problem in field of Language Technology. The problem can be described as follows: "Can a computer extract a description of word conjugation in a natural language using only written text in the language?" The problem is often referred to as Unsupervised Learning of Morphology (ULM) and has a wide variety of Language Technology applications, including Machine Translation, Document Categorization and I nformation Retrieval. READ MORE
-
5. The Multilingual Forest : Investigating High-quality Parallel Corpus Development
Abstract : This thesis explores the development of parallel treebanks, collections of language data consisting of texts and their translations, with syntactic annotation and alignment, linking words, phrases, and sentences to show translation equivalence. We describe the semi-manual annotation of the SMULTRON parallel treebank, consisting of 1,000 sentences in English, German and Swedish. READ MORE