Dependency-based Semantic Analysis of Natural-language Text

University dissertation from Department of Computer Science, Lund University

Abstract: Semantic roles, logical relations such as Agent or Instrument that hold between events and their participants and circumstances, need to be determined automatically by several types of applications in natural language processing. This process is referred to as semantic role labeling. This dissertation describes how to construct statistical models for semantic role labeling of English text, and how role semantics is related to surface syntax. It is generally agreed that the problem of semantic role labeling is closely tied to syntactic analysis. Most previous implementations of semantic role labelers have used constituents as the syntactic input, while dependency representations, in which the syntactic structure is viewed as a graph of labeled word-to-word relations, has received very little attention in comparison. Contrary to previous claims, this work demonstrates empirically that dependency representations can serve as the input for semantic role labelers and achieve similar results. This is important theoretically since it makes the syntactic-semantic interface conceptually simpler and more intuitive, but also has practical significance since there are languages for which constituent annotation is infeasible. The dissertation devotes considerable effort to investigating the relation between syntactic representation and semantic role labeling performance. Apart from the main result that dependency-based semantic role labeling rivals its constituent-based counterpart, the empirical experiments support two findings: First, that the dependency-syntactic representation has to be well-designed in order to achieve a good performance in semantic role labeling. Secondly, that the choice of syntactic representation affects the substages of the semantic role labeling task differently; above all, the role classification task, which relies strongly on lexical features, is shown to benefit from dependency representations. The systems presented in this work have been evaluated in two international open evaluations, in both of which they achieved the top result.

  CLICK HERE TO DOWNLOAD THE WHOLE DISSERTATION. (in PDF format)