Festvox Festival & Dialog Systems

Notes:

Festvox is a project that provides free software and tools for building custom synthetic voices. The Festvox Festival is a software package that includes a text-to-speech synthesis system that can be used to create synthetic voices for use in applications such as speech recognition systems, dialog systems, and text-to-speech systems. The software allows users to create synthetic voices by recording and analyzing speech data, and then using that data to generate new synthetic speech. The goal of Festvox is to make it easy for people to create custom synthetic voices that can be used in a variety of applications.

Wikipedia:

See also:

Blizzard Challenge

Doing research on a deployed spoken dialogue system: one year of let’s go! experience. A Raux, D Bohus, B Langner, AW Black, M Eskenazi – INTERSPEECH, 2006 – cs.cmu.edu … Public! Taking a Spoken Dialog System to the Real World, Interspeech 2005 (Eurospeech), Lis- bon, Portugal, 2005. … 1992. 4 Black, A. and Lenzo, K., Building Voices in the Festi- val Speech System, http://festvox.org/bsv/, 2000. … Cited by 95 Related articles All 18 versions Cite Save More

Olympus: an open-source framework for conversational spoken language interface research D Bohus, A Raux, TK Harris, M Eskenazi… – Proceedings of the …, 2007 – dl.acm.org … Building Voices in the Festival Speech System, http://festvox.org/bsv/, 2000. Bohus, D., Grau Puerto, S., Huggins-Daines, D., Keri, V., Krishna, G., Kumar, K., Raux, A., Tomko, S., 2007. Conquest – an Open-Source Dialog System for Conferences, in Proc. … Cited by 99 Related articles All 31 versions Cite Save

Responding to student uncertainty in spoken tutorial dialogue systems H Pon-Barry, K Schultz, EO Bratt, B Clark… – International Journal of …, 2006 – IOS Press … answer(s) held by the tutor. 2 http://www.nuance.com 3 http://festvox.org Page 7. H. Pon-Barry et al. / Responding to Student Uncertainty in Spoken Tutorial Dialogue Systems 177 Tutor The tutor component contains the tutorial … Cited by 67 Related articles All 16 versions Cite Save

Comparing spoken dialog corpora collected with recruited subjects versus real users H Ai, A Raux, D Bohus, M Eskenazi… – Proc. of the 8th SIGdial …, 2007 – cs.cmu.edu … Figure 1: Example Dialog with Let’s Go. mation system, a telephone-based dialog system that provides schedule information for buses in the Pitts- burgh area (Raux et al., 2005). … Building Voices in the Festival Speech System. http://festvox.org/bsv/ D. Bohus and A. Rudnicky. … Cited by 49 Related articles All 14 versions Cite Save More

Improving the understandability of speech synthesis by modeling speech in noise B Langner, AW Black – Proc. ICASSP, 2005 – cs.cmu.edu … The CMU SIN database is publicly avail- able at http://festvox.org/cmu sin/. … 7. REFERENCES [1] A. Raux, B. Langner, A. Black, and M. Eskenazi, “LET’S GO: Improving spoken dialog systems for the elderly and non-native,” in Eurospeech, Geneva, Switzerland., 2003. … Cited by 27 Related articles All 11 versions Cite Save More

The RavenClaw dialog management framework: Architecture and systems D Bohus, AI Rudnicky – Computer Speech & Language, 2009 – Elsevier … Full-size image (29 K) Full-size image (29 K) Fig. 1. A classical dialog system architecture. … For instance, a dialog system that assists users in making flight reservations must communicate with a database to obtain information and to perform the required transactions. … Cited by 109 Related articles All 4 versions Cite Save

The African Speech Technology Project: An Assessment. JC Roux, PH Louw, T Niesler – LREC, 2004 – comp.nus.edu.sg … understanding A finite-state architecture was adopted for the natural language understanding component of the spoken dialogue system (Niesler & … Speech synthesis A software interface was developed to the Festival toolbox (www.festvox.org) to achieve high quality limited … Cited by 36 Related articles All 9 versions Cite Save More

Creating a database of speech in noise for unit selection synthesis B Langner, AW Black – Fifth ISCA Workshop on Speech Synthesis, 2004 – isca-speech.org … 5. REFERENCES [1] A. Raux, B. Langner, A. Black, and M. Eskenazi, “LET’S GO: Improving spoken dialog systems for the elderly and non-native,” in Eurospeech … [3] A. Black and K. Lenzo, “Building voices in the Festival speech synthesis system,” http://festvox.org/bsv/, 2000. … Cited by 14 Related articles All 10 versions Cite Save

Non-Native Users in the Let’s Go!! Spoken Dialogue System: Dealing with Linguistic Mismatch. A Raux, M Eskenazi – HLT-NAACL, 2004 – acl.ldc.upenn.edu … 5 Conclusion and Future Directions In this paper, we described the Let’s Go!! bus information system, a dialogue system targetted at non-native speak- ers of English. … 1998. The Festival speech synthesis system. http://festvox.org/festival. D. Bohus and A. Rudnicky. 2003. … Cited by 19 Related articles All 18 versions Cite Save More

Speech synthesis for educational technology A Black – Proc. ISCA Workshop on Speech and Language …, 2007 – isca-speech.org … China, 2000. [4] Raux, A., Langner, B., Bohus, D., Black, A. and Eskenazi, M. Let s Go Public Taking a Spoken Dialog System to the Real World , Interspeech 2005, Lisbon, Portugal, 2005. [5 … Portugal, 2005. http://festvox.org/blizzard. [10 … Cited by 14 Related articles All 10 versions Cite Save

Multilingual spoken language processing P Fung, T Schultz – Signal Processing Magazine, IEEE, 2008 – ieeexplore.ieee.org … decade, the performance of spoken language understanding systems has improved dramatically, including speech recognition, dialog systems, speech summarization … collecting several hours of well-recorded speech in the target language (Black and Lenzo, http://festvox.org). … Cited by 30 Related articles All 3 versions Cite Save

Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. H Cuayáhuitl, S Renals, O Lemon, H Shimodaira – INTERSPEECH, 2006 – cstr.ed.ac.uk … faster learning than the traditional RL framework.}, categories = {reinforcement learning, spoken dialogue systems} } @article{vepa_king_tsap05 … Challenge Workshop (Interspeech Satellite)}, year = {2006}, month = {September}, note = {(http://festvox.org/blizzard/blizzard2006 … Cited by 22 Related articles All 10 versions Cite Save More

Evaluating the effectiveness of Scot: a spoken conversational tutor H Pon-Barry, B Clark, EO Bratt… – Proc. of ITS …, 2004 – people.cs.umass.edu … com 3 http://festvox.org Page 5. 3.1 Participants Thirty native English speakers were recruited to participate in this experiment (16 male, 14 female). All subjects were novices in the domain of damage control, twenty-nine had no prior experience in dialogue system studies. … Cited by 24 Related articles All 9 versions Cite Save More

The local language speech technology initiative R Tucker, K Shalonova – SCALLA Conf, 2004 – evalda.org … technologies required in a TTS system are also useful for other speech and language-related applications such as Dialogue Systems, Machine Translation … and open-source code such as Festival (http://www.cstr.ed.ac.uk/projects/festival) and Festvox (http://festvox.org/) to build … Cited by 12 Related articles All 9 versions Cite Save More

Prominence prediction for supersentential prosodic modeling based on a new database JY Zhang, AR Toth, K Collins-Thompson… – Fifth ISCA Workshop …, 2004 – isca-speech.org … a larger context, be that a dialog system or longer prose, the limita- tions in prosodic continuity become clear. Specifically, it … it fully. Our database is named Facts and Fables and is available at http://www.festvox.org/cmu faf/. In ad … Cited by 12 Related articles All 10 versions Cite Save

Automatic prediction of speaker age using CART S Schötz – Working Papers, Lund University, Dept. of Linguistics …, 2005 – lup.lub.lu.se … In which situations would it be acceptable for a spoken dialog system to behave like a human being, and in which would it be completely out of the question? These questions remain to be answered. … Website: http://www.festvox.org/festvox/index.html. 1999-2003. … Cited by 11 Related articles All 2 versions Cite Save More

A research platform for multi-agent dialogue dynamics TK Harris, S Banerjee, AI Rudnicky… – Robot and Human …, 2004 – ieeexplore.ieee.org … The Festival speech synthesis system. (http://festvox.org/festival) Bohus, D., & Rxdnicky, A. (2002, November). Inte- grating multiple knowledge sources for utterance- level confidence annotation in the CMU Com- municator spoken dialog system (Tech. R,ep. No. … Cited by 8 Related articles All 9 versions Cite Save

Festvox: Tools for Creation and Analyses of Large Speech Corpora GK Anumanchipalli, K Prahallad… – Workshop on Very Large …, 2011 – www-2.cs.cmu.edu … 6. References [1] http://www.festvox.org, [Online]. … 20–4. [5] A. Raux, B. Langner, D. Bohus, AW Black, and M. Eskenazi, “Lets go public! taking a spoken dialog system to the real world,” in in Proc. of Interspeech 2005, Lisbon, Portugal, 2005. … Cited by 4 Related articles All 9 versions Cite Save More

Voice creation for conversational fairy-tale characters J Gustafson, K Sjölander – Fifth ISCA Workshop on Speech …, 2004 – isca-speech.org … 5. THE NICE FAIRY-TALE GAME SYSTEM The NICE Fairytale System is a multimodal spoken dialogue system where the user can explore a fairy-tale world inspired by the … [9] Black A., Taylor P., and Caley R., “The Festival speech synthesis system,” http://festvox.org/festival. … Cited by 5 Related articles All 10 versions Cite Save

A corpus-based approach to expressive speech synthesis E Eide, A Aaron, R Bakis, W Hamza… – Fifth ISCA Workshop …, 2004 – isca-speech.org … The use of markup as an interface between the user and the engine enables our expressive TTS engine to be easily integrated into a dialog system. … http://www.festvox.org [3] http://www.research. att.com/projects/tts/demo.html [4] http://www.rhetorical.com/cgi-bin/demo.cgi … Cited by 62 Related articles All 9 versions Cite Save

Contextualizing Reflective Dialogue in a Spoken Conversational Tutor. H Pon-Barry, B Clark, K Schultz… – Journal of …, 2005 – search.ebscohost.com … The system responds to the student via a FestVox (festvox.org) limited domain synthesized voice. SCoT’s Tutorial Architecture … Aleven, V., Koedinger, K., & Popescu, O. (2003). A tutorial dialog system to support self-explanation: Evaluation and open questions. … Cited by 4 Related articles All 21 versions Cite Save

Segmentation of monologues in audio books for building synthetic voices K Prahallad, AW Black – Audio, Speech, and Language …, 2011 – ieeexplore.ieee.org … BUILDING VOICES FROM AUDIO BOOKS The methods of segmenting long speech files using SFA- 1 and SFA-2 are implemented as a package named IN- TERSLICE. This is integrated in FestVox (www.festvox.org), which is an open source toolkit for building synthetic voices. … Cited by 19 Related articles All 8 versions Cite Save

An analysis of machine translation and speech synthesis in speech-to-speech translation system K Hashimoto, J Yamagishi, W Byrne… – … , Speech and Signal …, 2011 – ieeexplore.ieee.org … [7] C. Boidin, V. Rieser, LVD Plas, O. Lemon, and J. Chevelu “Predicting how it sounds: Re-ranking dialogue prompts based on TTS quality for adaptive Spoken Dialogue Systems,” Proc Interspeech, pp.2487–2490, 2009. … [16] Festival, http://www.festvox.org/festival/ [17] JS White … Cited by 3 Related articles All 5 versions Cite Save

Empirical foundations for intelligent coaching systems EO Bratt, K Schultz, S Peters, T Chen… – The Interservice/Industry …, 2005 – NTSA … Heather Pon-Barry is a Research Engineer at the Research and Technology Center of Robert Bosch Corp, working on in-car dialogue systems She holds an AM in Symbolic Systems from Stanford University, where her thesis focused on tutoring … http://festvox.org) limited domain … Cited by 4 Related articles All 2 versions Cite Save

Acceptance testing of a spoken language translation system R Banchs, A Bonafonte, J Pérez – LREC’06, Genoa, Italy, 2006 – rbanchs.com … Acoustic models for ASR for Spanish and Catalan have been trained using either the TALP-tourism corpus or a 1Available on-line at: “http://www.festvox.org/”. Page 2. … 1995. A task-based evaluation of the TRAINS-95 dialog system. Technical report, Uni- versity of Rochester. … Cited by 3 Related articles All 11 versions Cite Save More

New Parametrizations for Emotional Speech Synthesis AW Black, HT Bunnell, Y Dou, D Perry… – Final Report for …, 2011 – old-site.clsp.jhu.edu … of more controllable emotion speech synthesis is necessary in the improvement of human machine communication in spoken dialog systems, speech to … Kominek J. and Black, A. “CMU Arctic Speech Databases” http://festvox.org/cmu_arctic Metze, F. “Articulatory Features for … Cited by 2 Related articles All 2 versions Cite Save More

Speech synthesis for language tutoring systems R Delmonte – The path of speech technologies in computer- …, 2008 – books.google.com … To generate an appropriate feedback message for a user, a dialogue system with TTS needs a series of linguistic components that roughly parallel those required to analyze and understand a user’s spoken utterance, as Table 6.1 illustrates. … Cited by 3 Related articles Cite Save

Adaptation techniques for speech synthesis in under-resoured languages GK Anumanchipalli, AW Black – SLTU, Spoken Language Technologies …, 2010 – cs.cmu.edu … there is an increasing use and acceptance of text-to-speech(TTS) technologies in the internet, mobile phones and dialogue systems. … Black “CMU ARCTIC databases for speech synthesis”, Tech Report CMU-LTI-03-177, Carnegie Mellon Unversity http://festvox.org/cmu arctic … Cited by 2 Related articles All 6 versions Cite Save More

Phonemic Similarity Metrics to Compare Pronunciation Methods. B Hixon, E Schneider, SL Epstein – INTERSPEECH, 2011 – homes.cs.washington.edu … [5] RJ Passonneau, SL Epstein, T. Ligorio, J. Gordon, and P. Bhutada, “Learning about voice search for spoken dialogue systems,” presented at … http://festvox.org/bsv [23] S. Fitt, “Documentation and User Guide to UNISYN Lexicon and Post-Lexical Rules,” University of Edinburgh … Cited by 3 Related articles All 9 versions Cite Save

Topic-Dependent Language Model Switching for Embedded Automatic Speech Recognition M Santos-Pérez, E González-Parada… – Ambient Intelligence- …, 2012 – Springer … database consists of 530 utterances and is used for the dialog system used in the CMU Darpa Com- municator [4]. Communicator is an automated telephone based dialog system for booking … (1998) 7. CMU Communicator limited domain website, http://festvox.org/dbs/dbs_com … Cited by 1 Related articles All 4 versions Cite Save

Survey on Speech, Machine Translation and Gesture in Ambient Assisted Living D Anastasiou – 2011 – lodel.irevues.inist.fr … Dialog systems and their components will be also pointed out. … 1 Automatic Speech Recognition of speech recognition and synthesis, and Machine Translation (MT); speech-to-speech translation, dialog systems, and gesture recognition and localization will also be discussed. … Cited by 1 Related articles Cite Save More

Quality of service and communicative competence in NLG evaluation K Jokinen – Proceedings of the Eleventh European Workshop on …, 2007 – dl.acm.org … One of the frameworks used in dialogue system evaluation is the PARADISE framework (Walker et al. … Evaluation can be performed via web-based interaction (cf. the Blizzard Challenge for evalu- ating corpus-based speech synthesis: http://festvox.org/blizzard/). … Cited by 1 Related articles All 10 versions Cite Save

Automatic estimation of speaker age using CART S Schötz – Lund Working Papers in Linguistics, 2009 – cts.lub.lu.se … In which situations would it be acceptable for a spoken dialog system to behave like a human being, and in which would it be completely out of the question? These questions remain to be answered. … Website: http://www.festvox.org/festvox/index.html. 1999-2003. … Cited by 1 Related articles All 7 versions Cite Save More

Machine learning for text selection with expressive unit-selection voices. D Espinosa, M White… – …, 2010 – 20.210-193-52.unknown.qala.com. … … Figure 1 shows the re- duction in overall costs after adding the top n utterances, for 5http://festvox.org 1127 … deliverables/ annex2.5/apml-howto.pdf [11] M. Foster and M. White, “Assessing the impact of adaptive gen- eration in the COMIC multimodal dialogue system,” in … Cited by 1 Related articles All 3 versions Cite Save

Improving the Understandability of Speech Synthesis by Modeling Speech B Langner, AW Black – in noise,” in ICASSP05, 2005 – Citeseer … 2000. 51, The Festival speech synthesis system,” http://festvox.org/festival – Black, Taylor, et al. – 1998. 47, … al. – 1995. 20, LET’S GO: Improving spoken dialog systems for the elderly and non-native – Raux, Langner, et al. – 2003. 11, … All 3 versions Cite Save More

Applause: A Learning Tool for Low-Resource Languages N Wolfe, VV Vemuri, LJ Martin, F Metze, AW Black – celebrate-language.com … From CLAP resources collected by the first author, we have the foundations of a dialog system with which we plan to integrate our pronunciation scoring tool. … References [1] Black, AW Festvox: CMU ARCTIC speech synthesis databases. http://festvox.org/cmu_arctic/. … Cite Save More

Using Speaker ID to Discover Repeat Callers of a Spoken Dialog System. A Fandrianto, B Langner… – …, 2011 – 20.210-193-52.unknown.qala.com. … … A. Black, and M. Eskenazi, “Doing research on a deployed spoken dialogue system: One year of Let’s Go! experience,” in Interspeech 2006, Pittsburgh, PA, 2006. [3] A. Black, P. Taylor, and R. Caley, “The Festival speech synthesis system,” 1998, http://festvox.org/festival. 1320 Related articles All 5 versions Cite Save

Phonetically balanced Bangla speech corpus SM Murtoza, F Alam, R Sultana, S Absar, M Khan – cle.org.pk … Building voices in the Festival speech synthesis system, http://festvox.org/bsv. … Handbook of Multimodal and Spoken Dialogue Systems: Resources, Terminology and Product Evaluation (The Springer International Series in Engineering and Computer Science), Springer, August … Related articles All 2 versions Cite Save More

Limited domain synthesis for a speech-enabled MP3 player R Jonson – 2005 – ling.gu.se … better results. FestVox [www.festvox.org,2005] presents a demo of an experimental limited domain synthesis built for the dialogue system Communicator [Rudnicky et al, 2000]: a telephone-based travel planner. FestVox also … Related articles Cite Save More

Natural Interactive Communication for Edutainment J Gustafson, J Boye, L Bell, M Wirén, JC Martin… – 2004 – speech.kth.se Page 1. NICE project (IST-2001-35293) Natural Interactive Communication for Edutainment NICE Deliverable D3.7b Multimodal Output Generation Module for the NICE fairy-tale game 8 December 2004 Authors Joakim Gustafson and Johan Boye, … Related articles All 4 versions Cite Save More

Virtual conversation with a real talking head O Gambino, A Augello, A Caronia… – … 2008 Conference on, 2008 – ieeexplore.ieee.org … In the last years has been growing interest towards the use of conversational agents, called chatbots, as an alternative to advanced dialogue system. … IKP Working Papers, 2005. [15] www.loquendo.com 267 Page 6. [16] Festvox: <http://festvox.org/festtut/notes/festtut 7.html> [17 … Related articles Cite Save

Learning Multi-Goal Dialogue Strategies Using Reinforcement Learning With Reduced State-Action Spaces S Fitt, K Richmond, H Kawai, T Toda… – Proc. of …, 2006 – cstr.inf.ed.ac.uk … turns) with faster learning than the traditional RL framework.}, categories = {reinforcement learning, spoken dialogue systems}, month = dec … Blizzard Challenge Workshop (Interspeech Satellite)}, address = {Pittsburgh, USA}, note = {(http://festvox.org/blizzard/blizzard2006.html … Cite Save More

HLTs in Second Language Learning–Can we optimise TTS technology to be a better teaching tool? L Mohasi, ME Dlodlo, CT Rondebosch – researchgate.net … According to [2], in dialogue systems, it would be a simple matter to convey the intended speech act to a TTS system designed to use that information at various … Available: http://festvox.org/blizzard/ blizzard2008.html [8] N. Zulu, “Measuring Language Distance”, Poster, 2007. … Related articles All 3 versions Cite Save More

The VUB Blizzard Challenge 2009 Entry L Latacz, W Mattheyses, W Verhelst – esat.kuleuven.be … In the ES3 task, listeners were also asked to rate the appropriateness of the synthesis in a dialog system. … Special Issue on “Animating Virtual Speakers or Singers from Audio: Lip-Synching Facial Animation” [4] http://www.spraak.org [5] http://www.festvox.org [6] Clark, Robert AJ … Cited by 1 Related articles All 5 versions Cite Save More

Automatic prediction of speaker age using CART IV NILSSON – ling.gu.se … In which situations would it be acceptable for a spoken dialog system to behave like a human being, and in which would it be completely out of the question? These questions remain to be answered. … Website: http://www.festvox.org/festvox/index.html. 1999-2003. … Related articles All 2 versions Cite Save More

Report on available data and annotations, identification of experimental procedures R op den Akker, J Carletta, M Mehu, C Pelachaud… – dcs.gla.ac.uk … Except for corpora set up to inform a spoken dialogue systems application by showing what the system needs to produce, dialogue corpus designers usually aim to capture completely natural, uncontrolled conversations. … http://festvox.org/cmu arctic/cmu arctic report.pdf … Related articles Cite Save More

Survey on Speech, Machine Translation and Gestures in Ambient Assisted Living. http://webcast. in2p3. fr/videos- … D Anastasiou – 2011 – lodel.irevues.inist.fr … We cover initiatives and tools both from Academia and Industry. We also refer to speech-to-speech translation systems which combine speech recognition, machine translation, and text-to-speech. Dialog systems and their components will be also pointed out. … Dialog systems. … Related articles All 2 versions Cite Save More

Speech Interfaces for Equitable Access to Information Technology. M Plauché, U Nallasamy – Information Technologies & …, 2007 – search.ebscohost.com … farmers. Speech interfaces, or spoken dialog systems (SDS), allow users to control computer output (graphics, texts, or audio) by uttering key words or phrases that are interpreted using automatic speech recognition (ASR). … Cited by 31 Related articles All 33 versions Cite Save

Challenges in Interpreting Spoken Military Commands and Tutoring Session Responses Elizabeth Owen Bratt, Karl Schultz, and S Peters, TH King, EM Bender, A Copestake – people.cs.umass.edu … Black, Alan W. and Lenzo, Kevin A. 2003. Building Synthetic Voices for FestVox 2.0 Edition. Available at http://www.festvox.org/bsv/ Bulitko, Vadim V., & Wilkins., David C. 1999. … 2001. The WITAS Multi-Modal Dialogue System I. In Proceedings of Eu- roSpeech 2001. … Related articles All 7 versions Cite Save More

An examination of speech in noise and its effect on understandability for natural and synthetic speech B Langner, A Black – … Institute, CMU, Pittsburgh PA, Tech. Rep. …, 2004 – www-2.cs.cmu.edu … The CMU SIN speech corpus is available at http://www.festvox.org/cmu_sin/. … project [4], we are developing methods to improve spoken dialog systems for non-native speakers and the elderly; specifically, we are working to improve the spoken output to make it more … Cited by 5 Related articles All 7 versions Cite Save More

An Architecture of a Telephone-based System for Speech Data Collection IT Schultz, Z Mihaylova, DIT Schlippe – csl.anthropomatik.kit.edu … 24 5 Implementation 25 5.1 Existing dialog system software tools . . . . . … To summarize, Figure 2.1 gives an overview of a telephone-based dialog system architecture. IP PBX Dialog System components Vo IP G a te wa y PSTN SPEAKER HOLD SEND virtual PBX … Related articles All 4 versions Cite Save More

A Web-Oriented Java3D Talking Head O Gambino, A Augello, A Caronia, G Pilato… – Human-Computer …, 2009 – Springer … Chatbots represent an alternative to ad- vanced dialogue system, which analyze in depth the semantic and syntactic struc- ture of the language. … Festvox, http://www.festvox. org/festtut/notes/festtut_7.html (accessed April 24, 2009) 45. … Related articles All 4 versions Cite Save

Interface design strategies for computer-assisted speech transcription S Luz, M Masoodian, B Rogers, C Deering – Proceedings of the 20th …, 2008 – dl.acm.org … Error correction methods employed in dictation systems [21], for instance, is very different from repair in dialogue systems [10]. … Further investigation of the errors after word addi- tions and training in this small test set shows that 100% 2http://festvox.org/cmu arctic/ speaker … Cited by 5 Related articles All 10 versions Cite Save

Impacts of machine translation and speech synthesis on speech-to-speech translation K Hashimoto, J Yamagishi, W Byrne, S King… – Speech …, 2012 – Elsevier … The rest of this paper is organized as follows. Section 2 reviews related work on integrating natural language generation and speech synthesis for a single-language spoken dialog system and integrating machine translation and speech synthesis for S2ST. … Cited by 2 Related articles All 4 versions Cite Save

Vietnamese Text To Speech NT Hoang, NV Tuyen, TT Son, VD Do, VC Huy – 2012 – ds.libol.fpt.edu.vn MINISTRY OF EDUCATION AND TRAINING. CAPSTONE PROJECT DOCUMENT. Vietnamese Text To Speech. TTS Group. Group Members, Nguy?n Tu?n Hoàng – 00843 Nguy?n V?n Tuy?n – 00965. Tr?n Tr??ng S?n – 01015. V? ?ông ?ô – 01024. T? Công Huy – 01408. … All 4 versions Cite Save

Multilingual and multimodal corpus-based text-to-speech system-PLATTOS M Rojc, I Mlakar – Speech Technologies/Book, 2011 – intechopen.com … The need to repeat and the misinterpretation of speaking terms are common features regarding the majority of users. Such behaviour usually leads towards less-functional and less-efficient spoken dialogue systems (Cassell, 2000). … Cited by 7 Related articles All 5 versions Cite Save More

The Path of Speech Technologies in CALL VM Holland, FP Fisher – The Path of Speech Technologies in …, 2007 – books.google.com … Proceedings of InSTIL 2000 (Integrating Speech Technology in (Lan- guage) Learning), 123–128, Dundee, Scotland. Gao, Y.(2005). Portability challenges in developing interactive dialogue systems. Proceedings of ICASSP05, Philadelphia. Garrett, N.(1992). … Related articles Cite Save

Successful Conclusion of the 2010 Summer Workshop J Du – old-site.clsp.jhu.edu … of more controllable emotion speech synthesis is necessary in the improvement of human machine communication in spoken dialog systems, speech to … Kominek J. and Black, A. “CMU Arctic Speech Databases” http://festvox.org/cmu_arctic Metze, F. “Articulatory Features for … Related articles All 2 versions Cite Save More

Implementation of a Text-to-Speech System with Machine Learning Algorithms in Turkish Z GÖRMEZ – 2009 – nlp.ceng.fatih.edu.tr … There is no teacher every time to teach. TTS applications help to learn pronunciation (González, 2007). • Spoken dialog system, is a dialog system delivered through voice. It has two essential components that do not exist in a text dialog system: A speech … Cited by 1 Related articles Cite Save More

Text-to-speech in vocabulary acquisition and student knowledge models: A classroom study using the REAP intelligent tutoring system C Sisson – Learning, 2007 – cs.cmu.edu … It is documented at http://festvox.org. … Applications such as dialogue systems and storytelling, which require highly accurate intonation, test the capabilities of prosodic modeling in current synthesis technology. 1.4.2 TTS and Computer Aided Language Learning (CALL) … Cited by 2 Related articles All 8 versions Cite Save More

Nonlinear emotional prosody generation and annotation J Tao, J Yu, Y Kang – Chinese Spoken Language Processing, 2006 – Springer … The final emotion state is determined based on the emotion outputs from text-content module. The results were used in the dialogue systems to improve the naturalness/ expressive- eness of the answering speech. … http://festvox.org/docs/speech_tools-1.2.0/x3475.htm 20. … Related articles All 7 versions Cite Save

Existing evaluation and validation of LRs. FLaReNet Deliverable D5. 1 J Odijk, A Toral – 2009 – igitur-archive.library.uu.nl … This campaign was to feature a novel method for the evaluation of prosody in synthesized speech. • MEDIA : Evaluation of Man-Machine dialogue systems. In this case, the task of hotel room reservation (including some local tourist information) is envisaged. … Related articles All 5 versions Cite Save

Statistical analysis of filled pauses’ rhythm for disfluent speech synthesis J Adell, A Bonafonte, D Escudero – Proc. of the 6th IWSS, 2007 – 147.83.50.50 … But nowadays new applications of text-to-speech (TTS) systems like film dubbing, robotics, dialogue systems, speech translation or … [Online]. Available: http://www.festvox.org/blizzard/ blizzard2006.html [3] M. Shröder, “Emotional Speech Synthesis: A Review,” in Proceedings of … Cited by 4 Related articles All 11 versions Cite Save More

Attuning speech-enabled interfaces to user and context for inclusive design: technology, methodology and practice MA Neerincx, AHM Cremers, JM Kessens… – Universal Access in the …, 2009 – Springer Logo Springer. Search Options: … Cited by 7 Related articles All 11 versions Cite Save

The listening room: a speech-based interactive art installation A Wright, A Evans, A Linney, M Lincoln – Proceedings of the 15th …, 2007 – dl.acm.org … misbehaviour of the system, combined with ‘Heather’s’ persistence in continuing the conversation regardless, which differentiate The Listening Room from more typical goal oriented dialogue systems which lack … http://festvox.org/blizzard/ bc2006/cereproc blizzard2006.pdf, 2006 … Cited by 1 Related articles All 7 versions Cite Save

Interactive visualisation techniques for dynamic speech transcription, correction and training S Luz, M Masoodian, B Rogers – Proceedings of the 9th ACM SIGCHI …, 2008 – dl.acm.org … Error correction mechanisms have been studied in the human-factors literature mainly in connection with dictation and dialogue systems [20, 11, 1, 9]. In general, these error correction facilities can be divided into two groups: 1. interactive visual facilities for locating and … Cited by 5 Related articles All 8 versions Cite Save

A Java3D Talking Head for a Chatbot S Gaglio, G Pilato, R Pirrone… – … , 2008. CISIS 2008. …, 2008 – ieeexplore.ieee.org … In the last years have been devel- oped simple dialogue systems called chatbots as an alter- native to advanced dialogue system analyzing in depth the semantic and syntactic structure of the language … [12] Loquendo: [13] Festvox: <festvox.org/festtut/notes … Related articles All 4 versions Cite Save

Unifying Speech Resources for Tone Languages: A Computational Perspective ME Ekpenyong, EAE Urua, VJ Ekong, OU Obot… – International Journal of …, 2011 – ijcis.info … for speech recognition and synthesis researchers (iii) Corpora and corpora processing tools: for researchers working on dialog systems (iv) Corpora … including full tools and documentation for building new voices are available at the Carnegie FestVox Project (http://festvox.org). … Related articles All 3 versions Cite Save More

Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0 IT Schultz, L Gren, DIT Schlippe, DINT Vu – csl.anthropomatik.kit.edu Page 1. Unsupervised Language Model Adaptation for Automatic Speech Recognition of Broadcast News Using Web 2.0 Diplomarbeit am Cognitive Systems Lab Prof. Dr.-Ing. Tanja Schultz Fakultät für Informatik Karlsruher Institut für Technologie von cand. inform. Lukasz Gren … Related articles All 2 versions Cite Save More

Syntactic surprisal affects spoken word duration in conversational contexts V Demberg, AB Sayeed, PJ Gorinski… – Proceedings of the …, 2012 – dl.acm.org … 1.2 Implications for Potential Applications Spoken dialogue systems are of increasing eco- nomic and technological importance in recent times, particularly as it is now feasible to include this tech- nology in everything from small consumer devices to industrial equipment. … Cited by 1 Related articles All 10 versions Cite Save

Survey on Swedish language resources K Elenius, E Forsbom, B Megyesi – ØØÔ»» ÛÛÛº× Ô º Ø º×» ÔÖÓ» …, 2008 – stp.lingfil.uu.se … 18 31,6% Search and knowledge mining 17 29,8% Language learning 12 21,1% Speech technologies 10 17,5% Other, please specify: 11 19,3% Others specified were: – dialog systems – multimodal systems – translation Page 15. Elenius, Forsbom and Megyesi 7 … Cited by 5 Related articles All 9 versions Cite Save More

[BOOK] Contributions to Multilingual Low-Footprint TTS System for Hand-Held Devices M Moberg – 2007 – dspace.cc.tut.fi Page 1. Julkaisu 671 Publication 671 Marko Moberg Contributions to Multilingual Low-Footprint TTS System for Hand-Held Devices Tampere 2007 Page 2. Tampereen teknillinen yliopisto. Julkaisu 671 Tampere University of Technology. Publication 671 Marko Moberg … Cited by 3 Related articles All 2 versions Cite Save

Evaluation of speaker verification security and detection of HMM-based synthetic speech PL De Leon, M Pucher, J Yamagishi… – Audio, Speech, and …, 2012 – ieeexplore.ieee.org Page 1. Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, permission must be obtained from the IEEE by emailing pubs-permissions@ieee.org. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. … Cited by 9 Related articles All 4 versions Cite Save

Thai spelling analysis for automatic spelling speech recognition C Pisarn, T Theeramunkong – Information Sciences, 2008 – Elsevier Spelling speech recognition can be applied for several purposes including enhancement of speech recognition systems and implementation of name retrieval systems. Cited by 1 Related articles All 7 versions Cite Save

[BOOK] Speech, Text and Braille Conversion Technology R Hoffmann – 2008 – Springer Page 1. Speech, Text and Braille Conversion Technology 14 Learning Objectives Text in electronic form is a key and increasingly important intermediary in allow- ing access to information by visually impaired and blind people using assistive technology. … Related articles All 2 versions Cite Save

CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems N Minematsu, R Kuroiwa, K Hirose… – Workshop …, 2007 – pub.uni-bielefeld.de Page 172. 148 6th ISCA Workshop on Speech Synthesis, Bonn, Germany, August 22-24, 2007 CRF-based Statistical Learning of Japanese Accent Sandhi for Developing Japanese Text-to-SpeechSynthesis Systems Nobuaki … Cited by 6 Related articles All 7 versions Cite Save More

An HMM-Based Brazilian Portuguese Speech Synthesizer And Its Characteristics R Maia, H Zen, K Tokuda, T Kitamura… – Revista da …, 2006 – iecom.dee.ufcg.edu.br Page 1. AN HMM-BASED BRAZILIAN PORTUGUESE SPEECH SYNTHESIZER AND ITS CHARACTERISTICS R. Maia, H. Zen, K. Tokuda, T. Kitamura, FGV Resende Jr. Abstract – Research on speech synthesis area has made … Cited by 7 Related articles All 4 versions Cite Save More

Multimodal and Speech Technology JC Martin, T Schultz – Handbook of Technical Communication, 2012 – books.google.com … of speech and language processing technologies has improved dramatically over the past decade, with an increasing number of commercial and research systems being deployed in a large variety of applications, such as spoken dialog systems, speech summarization and … Related articles Cite Save

Evaluation of Speech Synthesis N Campbell – Evaluation of Text and Speech Systems, 2007 – Springer Page 1. Chapter 2 EVALUATION OF SPEECH SYNTHESIS From Reading Machines to Talking Machines Nick Campbell National Institute of Information & Communications Technology, and ATR Spoken Language Communication … Cited by 5 Related articles All 5 versions Cite Save

Spectral mapping using artificial neural networks for voice conversion S Desai, AW Black, B Yegnanarayana… – Audio, Speech, and …, 2010 – ieeexplore.ieee.org Page 1. 954 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 18, NO. 5, JULY 2010 Spectral Mapping Using Artificial Neural Networks for Voice Conversion Srinivas Desai, Alan W. Black … Cited by 61 Related articles All 10 versions Cite Save

Emotional speech synthesis GO Hofer – Master of Science School of Informatics, University of …, 2004 – inf.ed.ac.uk … voice of an actor. Telephone dialogue systems is a technology that depends heavily on speech synthe- 14 Page 22. … 15 sis systems. In larger dialogue systems the text of a conversation might not be known beforehand and therefore requiring speech synthesis. … Cited by 9 Related articles All 6 versions Cite Save More

Spoken Language Translation F Ehsani, R Frederking, M Rayner, P Bouillon – Speech Technology, 2010 – Springer Logo Springer. Search Options: … Related articles All 6 versions Cite Save

Synthesis of listener vocalizations: towards interactive speech synthesis SC Pammi – 2012 – scidok.sulb.uni-saarland.de … Page 4. Page 5. Short Summary Spoken and multi-modal dialogue systems start to use listener vocaliza- tions, such as uh-huh and mm-hm, for natural interaction. Generation of listener vocalizations is one of the major objectives of emotionally colored … Cited by 1 Related articles Cite Save

Speech processing T Dutoit, S Dupont – 2010 – books.google.com … These are obviously important concepts when the ASR is considered as a component in a larger multimodal dialog system, where the out- comes from different modalities can be better integrated using such confidence measures. … Related articles All 4 versions Cite Save

In Search of Bloom’s Missing Sigma: Adding the conversational intelligence of human H Pon-Barry – 2004 – eecs.harvard.edu … 7 2.3 Spoken Dialogue Systems ….. 7 … The motivation for carrying out a project of this nature stems from related work in the areas of human tutoring, computer tutoring, and dialogue systems. In this chapter, I will discuss … Related articles All 5 versions Cite Save More [HTML] from sciencedirect.com

An HMM-based method for Thai spelling speech recognition C Pisarn, T Theeramunkong – Computers & Mathematics with Applications, 2007 – Elsevier Spelling speech recognition can be applied for several purposes including enhancement of speech recognition systems and implementation of name retrieval systems. Cited by 6 Related articles All 9 versions Cite Save

Focus to emphasize tone analysis for prosodic generation L Narupiyakul, V Keselj, N Cercone… – … & Mathematics with …, 2008 – Elsevier Emphasizing prosody of a sentence at its focus part when producing a speaker’s utterance can improve the recognition rate to hearers and reduce its ambiguity. Cited by 1 Related articles All 5 versions Cite Save

Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory T Toda, AW Black, K Tokuda – Audio, Speech, and Language …, 2007 – ieeexplore.ieee.org Page 1. 2222 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 15, NO. 8, NOVEMBER 2007 Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory … Cited by 256 Related articles All 15 versions Cite Save

Data-driven Natural Language Generation: Making Machines Talk Like Humans Using Natural Corpora B Langner – 2010 – lti.cs.cmu.edu … With the significant improvements that have been seen in speech applications, the long-held goal of building machines that can have hu- manlike conversations has begun to seem more reachable; there ex- ist spoken dialog systems which can now be used effectively by much … Cited by 4 Related articles All 7 versions Cite Save More

Syntax and Parsing A Sarkar – cs.sfu.ca … 27], in speech recognition systems as lan- guage models (a language model assigns a probability to a candidate output sentence—syntax is useful in particular for disfluent or error-prone speech in- put) [18], dialog systems [31], text to speech systems (www.festvox.org). … Related articles All 2 versions Cite Save More

Automatic Methods for Building Speech Synthesis Corpora SMGF Paulo – 2009 – l2f.inesc-id.pt Page 1. UNIVERSIDADE T ´ECNICA DE LISBOA INSTITUTO SUPERIOR T ´ECNICO Automatic Methods for Building Speech Synthesis Corpora Sérgio Manuel Gaspar Ferreira Paulo (Licenciado) Dissertaç˜ao para obtenç˜ao do Grau de Doutor em … Cite Save More

Intra-lingual and Cross-lingual Prosody Modelling GK Anumanchipalli – 2013 – cs.cmu.edu Page 1. Intra-Lingual and Cross-Lingual Prosody Modelling Gopala Krishna Anumanchipalli Ph.D. Thesis Draft Version: July 23, 2013 Language Technologies Institute Departamento de Engenharia School of Computer Science Electrotécnica e de Computadores … Cited by 1 Related articles All 2 versions Cite Save More

Speech synthesis based on hidden Markov models K Tokuda, Y Nankaku, T Toda, H Zen, J Yamagishi… – 2013 – ieeexplore.ieee.org … ers, voice-over functions for the visually impaired, and communication aids for the speech impaired. More recent applications include spoken dialog systems, communica- tive robots, singing speech synthesizers, and speech-to- speech translation systems. … Cited by 11 Related articles All 4 versions Cite Save

Statistical morphological disambiguation with application to disambiguation of pronunciations in Turkish MO Külekçi – Doktora Tezi, Sabanc? Üniversitesi, Fen Bilimleri …, 2006 – Citeseer Page 1. STATISTICAL MORPHOLOGICAL DISAMBIGUATION WITH APPLICATION TO DISAMBIGUATION OF PRONUNCIATIONS IN TURKISH by M. O?GUZHAN K¨ULEKC?I Submitted to the Graduate School of Engineering and Natural Sciences … Cited by 4 Related articles All 4 versions Cite Save More

Polyglot voice design for unit selection speech synthesis E Kurtic – 2004 – era.lib.ed.ac.uk Page 1. Polyglot Voice Design for Unit Selection Speech Synthesis Emina Kurtic Supervisors: Dr. Korin Richmond, Dr. Robert Clark T H E U NIVER S I T Y O F E DI NBU R G H Master of Science in Speech and Language Processing Theoretical and Applied Linguistics … Cited by 1 Related articles Cite Save