Search for dissertations about: "Natural Sciences Computer and Information Science Language Technology Computational Linguistics"
Showing result 1 - 5 of 11 swedish dissertations containing the words Natural Sciences Computer and Information Science Language Technology Computational Linguistics.
-
1. MaltParser -- An Architecture for Inductive Labeled Dependency Parsing
Abstract : This licentiate thesis presents a software architecture for inductive labeled dependency parsing of unrestricted natural language text, which achieves a strict modularization of parsing algorithm, feature model and learning method such that these parameters can be varied independently. The architecture is based on the theoretical framework of inductive dependency parsing by Nivre \citeyear{nivre06c} and has been realized in MaltParser, a system that supports several parsing algorithms and learning methods, for which complex feature models can be defined in a special description language. READ MORE
-
2. Compound Processing for Phrase-Based Statistical Machine Translation
Abstract : In this thesis I explore how compound processing can be used to improve phrase-based statistical machine translation (PBSMT) between English and German/Swedish. Both German and Swedish generally use closed compounds, which are written as one word without spaces or other indicators of word boundaries. READ MORE
-
3. Bootstrapping Named Entity Annotation by Means of Active Machine Learning
Abstract : This thesis describes the development and in-depth empirical investigation of a method, called BootMark, for bootstrapping the marking up of named entities in textual documents. The reason for working with documents, as opposed to for instance sentences or phrases, is that the BootMark method is concerned with the creation of corpora. READ MORE
-
4. Semantic Spaces of Clinical Text : Leveraging Distributional Semantics for Natural Language Processing of Electronic Health Records
Abstract : The large amounts of clinical data generated by electronic health record systems are an underutilized resource, which, if tapped, has enormous potential to improve health care. Since the majority of this data is in the form of unstructured text, which is challenging to analyze computationally, there is a need for sophisticated clinical language processing methods. READ MORE
-
5. Disfluency in Swedish human–human and human–machine travel booking dialogues
Abstract : This thesis studies disfluency in spontaneous Swedish speech, i.e., the occurrence of hesitation phenomena like eh, öh, truncated words, repetitions and repairs, mispronunciations, truncated words and so on. READ MORE