Now showing items 469-488 of 637

  • Retrieval of broadcast news documents with the THISL system. 

    Abberley, Dave; Renals, Steve; Cook, Gary; Robinson, Tony (1999)
    This paper describes the THISL system that participated in the TREC-7 evaluation, Spoken Document Retrieval (SDR) Track, and presents the results obtained, together with some analysis. The THISL system is based on the Abbot ...
  • Revisiting the security of speaker verification systems against imposture using synthetic speech 

    De Leon, P. L.; Apsingekar, V. R.; Pucher, M.; Yamagishi, Junichi (2010)
    In this paper, we investigate imposture using synthetic speech. Although this problem was first examined over a decade ago, dramatic improvements in both speaker verification (SV) and speech synthesis have renewed ...
  • Robust Constraint-consistent Learning 

    Howard, Matthew; Klanke, Stefan; Gienger, Michael; Goerick, Christian; Vijayakumar, Sethu (2009)
    Many everyday human skills can be framed in terms of performing some task subject to constraints imposed by the environment. Constraints are usually unobservable and frequently change between contexts. In this paper, ...
  • Robust LTS rules with the Combilex speech technology lexicon 

    Richmond, Korin; Clark, Robert A J; Fitt, Susan (2009)
    Combilex is a high quality pronunciation lexicon aimed at speech technology applications that has recently been released by CSTR. Combilex benefits from several advanced features. This paper evaluates one of these: the ...
  • Robust Pitch Detection by Narrow Band Spectrum Analysis 

    Shimodaira, Hiroshi; Nakai, Mitsuru (International Speech Communication Association, 1992-10)
    This paper proposes a new technique for detecting pitch patterns which is useful for automatic speech recognition, by using a narrow band spectrum analysis. The motivation of this approach is that humans perceive some kind ...
  • Robustness of HMM-based Speech Synthesis 

    Yamagishi, Junichi; Ling, Zhenhua; King, Simon (2008)
    As speech synthesis techniques become more advanced, we are able to consider building high-quality voices from data collected outside the usual highly-controlled recording studio environment. This presents new challenges ...
  • Robustness of VOR and OKR adaptation under kinematics and dynamics transformations 

    Haith, Adrian; Vijayakumar, Sethu (2007)
    Many computational models of vestibulo-ocular reflex (VOR) adaptation have been proposed, however none of these models have explicitly highlighted the distinction between adaptation to dynamics transformations, in which ...
  • The role of higher-level linguistic features in HMM-based speech synthesis 

    Watts, Oliver; Yamagishi, Junichi; King, Simon (2010)
    We analyse the contribution of higher-level elements of the linguistic specification of a data-driven speech synthesiser to the naturalness of the synthetic speech which it generates. The system is trained using various ...
  • The role of human rights in safeguarding the exercise of IP 

    Grosheide, Willem (AHRC Research Centre for Studies in Intellectual Property and Technology Law, 2004-12)
    Paper presented by Grosheide at the "Towards Utopia" expert meeting chaired by Waelde and Brown, held in December 2004.
  • The role of prosody in a voicemail summarization system 

    Koumpis, Konstantinos; Renals, Steve (International Speech Communication Association, 2001)
    When a speaker leaves a voicemail message there are prosodic cues that emphasize the important points in the message, in addition to lexical content. In this paper we compare and visualize the relative contribution of these ...
  • Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis 

    Yamagishi, Junichi; Watts, Oliver; King, Simon; Usabaev, Bela (ISCA, 2010)
    In speaker-adaptive HMM-based speech synthesis, there are a few speakers whose synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptation data from within the same corpus. This ...
  • Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis 

    Yamagishi, Junichi; Watts, Oliver; King, Simon; Usabaev, Bela (2010)
    In speaker-adaptive HMM-based speech synthesis, there are typically a few speakers for which the output synthetic speech sounds worse than that of other speakers, despite having the same amount of adaptation data from ...
  • The RPC Calculus 

    Cooper, Ezra; Wadler, Philip (2009)
    Several recent language designs have offered a unified language for programming a distributed system, with explicit notation of locations; we call these “location-aware” languages. These languages provide constructs ...
  • Rural and Urban Road Network Generalisation: Deriving 1:250,000 from OS MasterMap 

    Chaudhry, Omair; Mackaness, William (Institute of Geography. The School of Geosciences.The University of Edinburgh, 2006)
    Roads are essential component of topographic maps and spatial databases. The challenge in automated generalisation of road networks is to derive a connected network while maintaining the structure for the intended ...
  • SABLE: A Standard for TTS Markup 

    Sproat, Richard; Hunt, Andrew; Ostendorf, Mari; Taylor, Paul; Black, Alan W; Lenzo, Kevin; Edgington, Mike (International Speech Communication Association, 1998-11)
    Currently, speech synthesizers are controlled by a multitude of proprietary tag sets. These tag sets vary substantially across synthesizers and are an inhibitor to the adoption of speech synthesis technology by developers. ...
  • SABLE: A Standard For TTS Markup 

    Sproat, Richard; Hunt, Andrew; Ostendorf, Mari; Taylor, Paul; Black, Alan W; Lenzo, Kevin; Edgington, Mike (International Speech Communication Association, 1998-12)
    Currently, speech synthesizers are controlled by a multitude of proprietary tag sets. These tag sets vary substantially across synthesizers and are an inhibitor to the adoption of speech synthesis technology by developers. ...
  • SDA: Survey Document and Analysis 

    Ruus, Laine (2016-10)
    Poster presented by Laine Ruus of EDINA at RepoFringe 2016. What Windows Media Player does for sound and video files, SDA does for numeric microdata, although it can also handle aggregate and time-series data.
  • Segmental Duration Control Based on an Articulatory Model 

    Shiga, Yoshinori; Matsuura, Hiroshi; Nitta, Tsuneo (International Speech Communication Association, 1998-12)
    This paper proposes a new method that determines segmental duration for text-to-speech conversion based on the movement of articulatory organs which compose an articulatory model. The articulatory model comprises four ...
  • Selection of Pronunciation Variants in Spontaneous Speech: Comparing the Performance of Man and Machine 

    Wester, Mirjam; Kessens, Judith M; Cucchiarini, Catia; Strik, Helmer (International Speech Communication Association, 1998-09)
    Dans cet article, les performances d'un outil de transcription automatique sont évaluées. L'outil de transcription est un reconnaisseur de parole continue (CSR) fonctionnant en mode de reconnaissance forcée. Pour l'évaluation ...
  • The Selection of Pronunciation Variants: Comparing the Performance of Man and Machine 

    Kessens, Judith M; Wester, Mirjam; Cucchiarini, Catia; Strik, Helmer (International Speech Communication Association, 1998-12)
    In this paper the performance of an automatic transcription tool is evaluated. The transcription tool is a Continuous Speech Recognizer (CSR) running in forced recognition mode. For evaluation the performance of the CSR ...