Search for dissertations about: "treebank"

Found 5 swedish dissertations containing the word treebank.

  1. 1. The Multilingual Forest : Investigating High-quality Parallel Corpus Development

    Author : Yvonne Adesam; Martin Volk; Joakim Nivre; Koenraad de Smedt; Stockholms universitet; []
    Keywords : NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; NATURAL SCIENCES; treebank; syntax; alignment; corpus; annotation projection; multilingual; tagging; parsing; datorlingvistik; Computational Linguistics;

    Abstract : This thesis explores the development of parallel treebanks, collections of language data consisting of texts and their translations, with syntactic annotation and alignment, linking words, phrases, and sentences to show translation equivalence. We describe the semi-manual annotation of the SMULTRON parallel treebank, consisting of 1,000 sentences in English, German and Swedish. READ MORE

  2. 2. Morphosyntactic Corpora and Tools for Persian

    Author : Mojgan Seraji; Joakim Nivre; Carina Jahani; Jan Hajic; Uppsala universitet; []
    Keywords : NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; NATURAL SCIENCES; Persian; language technology; corpus; treebank; preprocessing; segmentation; part-of-speech tagging; dependency parsing; Computational Linguistics; Datorlingvistik;

    Abstract : This thesis presents open source resources in the form of annotated corpora and modules for automatic morphosyntactic processing and analysis of Persian texts. More specifically, the resources consist of an improved part-of-speech tagged corpus and a dependency treebank, as well as tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and dependency parsing for Persian. READ MORE

  3. 3. Inductive Dependency Parsing of Natural Language Text

    Author : Joakim Nivre; Walter Daelemans; Växjö universitet; []
    Keywords : NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; NATURAL SCIENCES; natural language parsing; dependency parsing; memory-based learning; treebank parsing; Systems engineering; Systemteknik; Computer and Information Sciences Computer Science; Data- och informationsvetenskap;

    Abstract : This thesis investigates new methods for syntactic parsing of unrestricted natural language text under requirements of robustness and disambiguation. A parsing system is required to assign to every sentence in a text at least one analysis (robustness) and at most one analysis (disambiguation). READ MORE

  4. 4. Tree Transformations in Inductive Dependency Parsing

    Author : Jens Nilsson; Joakim Nivre; Pierre Nugues; Växjö universitet; []
    Keywords : NATURAL SCIENCES; NATURVETENSKAP; NATURVETENSKAP; NATURAL SCIENCES; Inductive Dependency Parsing; Dependency Structure; Tree Transformation; Non-projectivity; Coordination; Verb Group; Language technology; Språkteknologi; Computer and Information Sciences Computer Science; Data- och informationsvetenskap;

    Abstract : This licentiate thesis deals with automatic syntactic analysis, or parsing, of natural languages. A parser constructs the syntactic analysis, which it learns by looking at correctly analyzed sentences, known as training data. The general topic concerns manipulations of the training data in order to improve the parsing accuracy. READ MORE

  5. 5. The Architecture of Result Relations : Corpus and experimental approaches to Result coherence relations in English

    Author : Marta Andersson; Nils-Lennart Johannesson; Ted Sanders; Stockholms universitet; []
    Keywords : HUMANITIES; HUMANIORA; RESULT; PURPOSE; discourse connectives; disambiguation; subjectivity; nonveridicality; English; engelska;

    Abstract : Two fundamental components of causality are the Cause and the Result. In linguistic work the distinction between these aspects is commonly blurred, presumably because the primary research focus has been on describing how language encodes causality. READ MORE