Search for dissertations about: "character-based statistical machine translation"

Found 2 swedish dissertations containing the words character-based statistical machine translation.

  1. 1. Text Harmonization Strategies for Phrase-Based Statistical Machine Translation

    Author : Sara Stymne; Lars Ahrenberg; Joakim Nivre; Nizar Habash; Linköpings universitet; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; NATURVETENSKAP; NATURAL SCIENCES; HUMANIORA; HUMANITIES; Statistical machine translation; text harmonization; compound words; definiteness; reordering; unknown words;

    Abstract : In this thesis I aim to improve phrase-based statistical machine translation (PBSMT) in a number of ways by the use of text harmonization strategies. PBSMT systems are built by training statistical models on large corpora of human translations. This architecture generally performs well for languages with similar structure. READ MORE

  2. 2. Spelling Normalisation and Linguistic Analysis of Historical Text for Information Extraction

    Author : Eva Pettersson; Joakim Nivre; Beáta Megyesi; Michael Piotrowski; Uppsala universitet; []
    Keywords : NLP for historical text; spelling normalisation; digital humanities; information extraction; character-based statistical machine translation; SMT; Levenshtein edit distance; language technology; computational linguistics; Computational Linguistics; Datorlingvistik;

    Abstract : Historical text constitutes a rich source of information for historians and other researchers in humanities. Many texts are however not available in an electronic format, and even if they are, there is a lack of NLP tools designed to handle historical text. READ MORE