Search for dissertations about: "Audio speech interaction"
Showing result 1 - 5 of 10 swedish dissertations containing the words Audio speech interaction.
-
1. Mining Speech Sounds : Machine Learning Methods for Automatic Speech Recognition and Analysis
Abstract : This thesis collects studies on machine learning methods applied to speech technology and speech research problems. The six research papers included in this thesis are organised in three main areas. The first group of studies were carried out within the European project Synface. READ MORE
-
2. Design foundations for content-rich acoustic interfaces : Investigating audemes as referential non-speech audio cues
Abstract : To access interactive systems, blind and visually impaired users can leverage their auditory senses by using non-speech sounds. The current structure of non-speech sounds, however, is geared toward conveying user interface operations (e.g., opening a file) rather than large theme-based information (e. READ MORE
-
3. Designing guidance along audio-haptically augmented paths in a city environment
Abstract : The main purpose of this thesis is to show in a case study how it is possible to inform, with Activity Theory, the design and evaluation of a pedestrian navigation system that uses audio-tactile feedback. The case study consists of an iterative design process that results in a tourist guide application used in a mobile phone. READ MORE
-
4. Developing and evaluating co-speech gesture-synthesis models for embodied conversational agents
Abstract : A large part of our communication is non-verbal: humans use non-verbal behaviors to express various aspects of our state or intent. Embodied artificial agents, such as virtual avatars or robots, should also use non-verbal behavior for efficient and pleasant interaction. READ MORE
-
5. Predictive Modeling of Turn-Taking in Spoken Dialogue : Computational Approaches for the Analysis of Turn-Taking in Humans and Spoken Dialogue Systems
Abstract : Turn-taking in spoken dialogue represents a complex cooperative process wherein participants use verbal and non-verbal cues to coordinate who speaks and who listens, to anticipate speaker transitions, and to produce backchannels (e.g., “mhm”, “uh-huh”) at the right places. READ MORE