Search for dissertations about: "Corpus Language"

Showing result 1 - 5 of 148 swedish dissertations containing the words Corpus Language.

  1. 1. Why the pond is not outside the frog? Grounding in contextual representations by neural language models

    Author : Mehdi Ghanimifard; Göteborgs universitet; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; Computational linguistics; Language grounding; Spatial language; Distributional semantics; Computer vision; Language modelling; Vision and language; Neural language model; Grounded language model;

    Abstract : In this thesis, to build a multi-modal system for language generation and understanding, we study grounded neural language models. Literature in psychology informs us that spatial cognition involves different aspects of knowledge that include visual perception and human interaction with the world. READ MORE

  2. 2. The Sango Language and Its Lexicon (S�nd�-y�ng� t� S�ng�)

    Author : Christina Thornell; Allmän språkvetenskap; []
    Keywords : HUMANIORA; HUMANITIES; Linguistics; lexical semantics; lexicology; language planning; language contact; functional linguistics; language typology; Ubangi language; pidgin creole; Sango; Central African Republic; Lingvistik;

    Abstract : This doctoral dissertation is an overview of the recently arisen Sango language spoken in the Central African Republic. The overview contains a sociolinguistic and linguistic dimension with a lexical-semantic focus. READ MORE

  3. 3. Splitting rocks: Learning word sense representations from corpora and lexica

    Author : Luis Nieto Piña; Göteborgs universitet; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; language technology; natural language processing; distributional models; semantic representations; distributed representations; word senses; embeddings; word sense disambiguation; linguistic resources; corpus; lexicon; machine learning; neural networks;

    Abstract : The representation of written language semantics is a central problem of language technology and a crucial component of many natural language processing applications, from part-of-speech tagging to text summarization. These representations of linguistic units, such as words or sentences, allow computer applications that work with language to process and manipulate the meaning of text. READ MORE

  4. 4. The Multilingual Forest : Investigating High-quality Parallel Corpus Development

    Author : Yvonne Adesam; Martin Volk; Joakim Nivre; Koenraad de Smedt; Stockholms universitet; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; treebank; syntax; alignment; corpus; annotation projection; multilingual; tagging; parsing; datorlingvistik; Computational Linguistics;

    Abstract : This thesis explores the development of parallel treebanks, collections of language data consisting of texts and their translations, with syntactic annotation and alignment, linking words, phrases, and sentences to show translation equivalence. We describe the semi-manual annotation of the SMULTRON parallel treebank, consisting of 1,000 sentences in English, German and Swedish. READ MORE

  5. 5. Morphosyntactic Corpora and Tools for Persian

    Author : Mojgan Seraji; Joakim Nivre; Carina Jahani; Jan Hajic; Uppsala universitet; []
    Keywords : NATURVETENSKAP; NATURAL SCIENCES; Persian; language technology; corpus; treebank; preprocessing; segmentation; part-of-speech tagging; dependency parsing; Computational Linguistics; Datorlingvistik;

    Abstract : This thesis presents open source resources in the form of annotated corpora and modules for automatic morphosyntactic processing and analysis of Persian texts. More specifically, the resources consist of an improved part-of-speech tagged corpus and a dependency treebank, as well as tools for text normalization, sentence segmentation, tokenization, part-of-speech tagging, and dependency parsing for Persian. READ MORE