Now showing items 469-488 of 627

  • The role of prosody in a voicemail summarization system 

    Koumpis, Konstantinos; Renals, Steve (International Speech Communication Association, 2001)
    When a speaker leaves a voicemail message there are prosodic cues that emphasize the important points in the message, in addition to lexical content. In this paper we compare and visualize the relative contribution of these ...
  • Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis 

    Yamagishi, Junichi; Watts, Oliver; King, Simon; Usabaev, Bela (ISCA, 2010)
    In speaker-adaptive HMM-based speech synthesis, there are a few speakers whose synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptation data from within the same corpus. This ...
  • Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis 

    Yamagishi, Junichi; Watts, Oliver; King, Simon; Usabaev, Bela (2010)
    In speaker-adaptive HMM-based speech synthesis, there are typically a few speakers for which the output synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptation data from ...
  • The RPC Calculus 

    Cooper, Ezra; Wadler, Philip (2009)
    Several recent language designs have offered a unified language for programming a distributed system, with explicit notation of locations; we call these “location-aware” languages. These languages provide constructs ...
  • Rural and Urban Road Network Generalisation: Deriving 1:250,000 from OS MasterMap 

    Chaudhry, Omair; Mackaness, William (Institute of Geography. The School of Geosciences.The University of Edinburgh, 2006)
    Roads are essential component of topographic maps and spatial databases. The challenge in automated generalisation of road networks is to derive a connected network while maintaining the structure for the intended ...
  • SABLE: A Standard for TTS Markup 

    Sproat, Richard; Hunt, Andrew; Ostendorf, Mari; Taylor, Paul; Black, Alan W; Lenzo, Kevin; Edgington, Mike (International Speech Communication Association, 1998-11)
    Currently, speech synthesizers are controlled by a multitude of proprietary tag sets. These tag sets vary substantially across synthesizers and are an inhibitor to the adoption of speech synthesis technology by developers. ...
  • SABLE: A Standard For TTS Markup 

    Sproat, Richard; Hunt, Andrew; Ostendorf, Mari; Taylor, Paul; Black, Alan W; Lenzo, Kevin; Edgington, Mike (International Speech Communication Association, 1998-12)
    Currently, speech synthesizers are controlled by a multitude of proprietary tag sets. These tag sets vary substantially across synthesizers and are an inhibitor to the adoption of speech synthesis technology by developers. ...
  • Segmental Duration Control Based on an Articulatory Model 

    Shiga, Yoshinori; Matsuura, Hiroshi; Nitta, Tsuneo (International Speech Communication Association, 1998-12)
    This paper proposes a new method that determines segmental duration for text-to-speech conversion based on the movement of articulatory organs which compose an articulatory model. The articulatory model comprises four ...
  • Selection of Pronunciation Variants in Spontaneous Speech: Comparing the Performance of Man and Machine 

    Wester, Mirjam; Kessens, Judith M; Cucchiarini, Catia; Strik, Helmer (International Speech Communication Association, 1998-09)
    Dans cet article, les performances d'un outil de transcription automatique sont évaluées. L'outil de transcription est un reconnaisseur de parole continue (CSR) fonctionnant en mode de reconnaissance forcée. Pour l'évaluation ...
  • The Selection of Pronunciation Variants: Comparing the Performance of Man and Machine 

    Kessens, Judith M; Wester, Mirjam; Cucchiarini, Catia; Strik, Helmer (International Speech Communication Association, 1998-12)
    In this paper the performance of an automatic transcription tool is evaluated. The transcription tool is a Continuous Speech Recognizer (CSR) running in forced recognition mode. For evaluation the performance of the CSR ...
  • Semantic Support for Visualisation in Collaborative AI Planning 

    Lino, N; Tate, Austin; Chen-Burger, Y-H (International Conference on Automated Planning and Scheduling, 2005-06)
    In the last decades, many advances have been made in intelligent planning systems. Significant improvements related to core problems, providing faster search algorithms and shortest plans have been proposed. However, there ...
  • A Semantic-based Workflow Choreography for Integrated Sensing and Processing 

    Chen-Burger, Y-H; Lin, F-P (IEEE Computer Society, 2005-05)
    Real-time automatic and continuous information gathering through EcoGrid offers an unique and immense opportunity for long term ecological monitoring and planning. However, the vast amount of raw data gathered must be ...
  • Sentence Boundary Detection in Broadcast Speech Transcripts 

    Gotoh, Yoshihiko; Renals, Steve (International Speech Communication Association, 2000-09)
    This paper presents an approach to identifying sentence boundaries in broadcast speech transcripts. We describe finite state models that extract sentence boundary information statistically from text and audio sources. An ...
  • A Shrinkage Estimator for Speech Recognition with Full Covariance HMMs 

    Bell, Peter; King, Simon (2008)
    We consider the problem of parameter estimation in full-covariance Gaussian mixture systems for automatic speech recognition. Due to the high dimensionality of the acoustic feature vector, the standard sample covariance ...
  • The significance of the senses 

    Nudds, Matthew (2003)
    Standard accounts of the senses attempt to answer the question how and why we count five senses (the counting question); none of the standard accounts is satisfactory. Any adequate account of the senses must explain ...
  • Signing for the deaf using virtual humans 

    Bangham, J A; Cox, Stephen; Lincoln, Michael; Marshall, I; Tutt, M; Wells, Mark (IEEE Signal Processing Society Press, 2000-04)
    Research at Televirtual (Norwich) and the University of East Anglia, funded predominantly by the Independent Television Commission and more recently by the UK Post Office also, has investigated the feasibility of using ...
  • Simple methods for improving speaker-similarity of HMM-based speech synthesis 

    Yamagishi, Junichi; King, Simon (2010)
    In this paper we revisit some basic configuration choices of HMM based speech synthesis, such as waveform sampling rate, auditory frequency warping scale and the logarithmic scaling of F0, with the aim of improving ...
  • Single Speaker Segmentation and Inventory Selection Using Dynamic Time Warping Self Organization and Joint Multigram Mapping 

    Aylett, Matthew; King, Simon (2008)
    In speech synthesis the inventory of units is decided by inspection and on the basis of phonological and phonetic expertise. The ephone (or emergent phone) project at CSTR is investigating how self organisation techniques ...
  • Slice sampling covariance hyperparameters of latent Gaussian models 

    Murray, Iain; Adams, Ryan Prescott (2010)
    The Gaussian process (GP) is a popular way to specify dependencies between random variables in a probabilistic model. In the Bayesian framework the covariance structure can be specified using unknown hyperparameters. ...
  • Slow Frequency Hopping for Mitigating Tidal Fading on Rural Long Distance Over-Water Wireless Links 

    Macmillan, Alexander; Triana, Jhair Tocancipa; Marina, Mahesh (2010-05)
    We consider the scenario where long-distance point-to-point wireless links are established over tidal estuaries to bring Internet access to rural communities living in coastal villages. Using real signal strength measurements ...