CMUSphinx 2014


See also:

Best CMUSphinx VideosCMUSphinx 2011 | CMUSphinx 2012 | CMUSphinx 2013


Speech Recognition for Voice-Based Machine Translation. T Duarte, R Prikladnicki, F Calefato, F Lanubile – IEEE software, 2014 – collab.di.uniba.it … This free platform also allows the implemen- tation of continuous-speech, speaker- independent, and large- vocabulary recognition systems (http://cmusphinx. … languages or testing algorithms (http:// cmusphinx.sourceforge.net/sphinx4/ doc/Sphinx4Whitepaper.pdf). … Cited by 2 Related articles All 8 versions

Context-aware multimedia encryption in mobile platforms M Fazeen, G Bajwa, R Dantu – Proceedings of the 9th Annual Cyber and …, 2014 – dl.acm.org … Such potential speech engines are: Built- in speech recognition engine in the Android platform, “Dragon Au- dioMining” by Nuance [12], and CMUSphinx[1]. CMUSphinx is utilized in our algorithm due to its performance and portabil- ity. …

YaoTalk: A Conversational System for the IIIS Domain T Shi, S Yang – ml-thu.net … In this 1Project homepage: http://cmusphinx.sourceforge.net/ 2Rather old tool, but quite effective. See official website. Page 2. (a) (b) (c) ???? ?! ???? ???! … PortAudio CMUSphinx C++ Recogni on Synthesis PlainTalk C++ Interface YaoTalk Dialogue System Python … Related articles

Investigation Amazigh speech recognition using CMU tools H Satori, F ElHaoussi – International Journal of Speech Technology, 2014 – Springer … (2013). Retrieved February 10, 2013, from http://www.cmusphinx. sourceforge.net/html/cmusphinx. php. Cole, R., Fanty, M., Muthusamy, Y., & Gopalakrishnan, M. (1990). … Satori, H., Harti, M., & Chenfour, N. (2007). Arabic Speech Recogni- tion system based on CMUSphinx. … Related articles

Say-it: Design of a Multimodal Game Interface for Children Based on CMU Sphinx 4 Framework M Alsulami – 2014 – scholarworks.gvsu.edu … hears nothing. Then, I Figure 1: CMUSphinx Architecture Page 12. 11 faced another issue where the microphone still had some data even when I called stop recording method, which caused some confusion to the user. Thus …

BODO Speech Recognition based on Hidden Markov Model Toolkit (HTK) LK Thakuria, P Acharjee, A Das, PH Talukdar – ijser.org … 99–145, 2011. [8] SPHINX,Sphinx, available at http://cmusphinx.sourceforge.net/html/cmusphinx. php, 2011. [9] K. Kumar and RK Aggarwal “Hindi Speech Recognition System using HTK” International journal of Computing and Business Research ISSN Vol. 2 issue 2 May 2011. … Cited by 1 Related articles

Comparing Open-Source Speech Recognition Toolkits? C Gaida, P Lange, R Petrick, P Proba, A Malatawy… – oeft.de … 6 Available from http://htk.eng.cam.ac.uk/ 7 Available from http://julius.sourceforge.jp/ 8 Available from http://cmusphinx.sourceforge.net/ 9 Available from svn://svn.code.sf.net/p/kaldi/ code/ 10 Available from http://shout-toolkit.sourceforge.net/ Page 3. …

Tuning a CMU Sphinx-III Speech Recognition System for Polish Language M P?ONKOWSKI, P URBANOVICH – red.pe.org.pl … http://cmusphinx.sourceforge.net/, Apr 2013 [2] Vertanen K., Baseline WSJ Acoustic Models for HTK and Sphinx: Training Recipes and Recognition Experiments, Cavendish Laboratory, University of Cambridge, 2006 [3] Novak J., Dixon P., Furui S., An Empirical Comparison of … Cited by 1 Related articles All 4 versions

Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish N Vanhainen, G Salvi – training, 2014 – lrec-conf.org Page 1. Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish Niklas Vanhainen and Giampiero Salvi KTH, School of Computer Science and Communication, Department of … Cited by 1 Related articles

The WaveSurfer Automatic Speech Recognition Plugin G Salvi, N Vanhainen – LREC. European Language Resources …, 2014 – lrec-conf.org Page 1. The WaveSurfer Automatic Speech Recognition Plugin Giampiero Salvi and Niklas Vanhainen KTH, School of Computer Science and Communication, Department of Speech Music and Hearing, Stockholm, Sweden 1giampi, niklasval@kth.se Abstract … Cited by 1 Related articles All 2 versions

lex4all: A language-independent tool for building and evaluating pronunciation lexicons for small-vocabulary speech recognition A Vakil, M Paulus, A Palmer, M Regneri – ACL 2014, 2014 – ling.uni-potsdam.de … data which is by definition not 1http://lex4all. github. io/lex4all/ 2http://msdn. microsoft. com/en-us/library/hh361572 3http://www. cmusphinx. org 109 Page 122. available for LRLs. In efforts to overcome this data scarcity problem …

Speech Recognition Based on Open Source Speech Processing Software P K?osowski, A Dustor, J Izydorczyk, J Kotas… – Computer Networks, 2014 – Springer … as mobile phones. More details about the Sphinx project are available at: http://cmusphinx.sourceforge.net. 2.2 SPRACH SPRACH is an abbreviation for Speech Recognition Algorithms for Connection- ist Hybrids. It involves … Related articles All 2 versions

Long audio alignment for automatic subtitling using different phone-relatedness measures A Álvarez, H Arzelus, P Ruiz – Acoustics, Speech and Signal …, 2014 – ieeexplore.ieee.org … It was inspired on the tool provided by López (www.aucel.com/pln/), and adapted to our phonelist. The English transcriptor was inferred from the Carnegie Mellon Pronouncing Dictionary (svn.code.sf.net/p/cmusphinx/code/trunk/cmudict/) using … Cited by 2

Telugu Speech Recognition System development using MFCC based Hidden Markov Model technique with Sphinx-4 PV Bhaskar, SRM Rao – 2014 – arph.in … 99–145, 2011. [10] SPHINX,Sphinx, available at http://cmusphinx.sourceforge.net/html/cmusphinx. ph p, 2011. [11] K. Kumar and RK Aggarwal “Hindi Speech Recognition System using HTK” International journal of Computing and Business Research ISSN Vol. … Related articles

Cloud Application Based on Natural Language Processing–An Example of Taipei City Bus Query CL Wang – 2014 – libetd.shu.edu.tw … [14] M. Jarmasz, “Roget ‘ s Thesaurus as A Lexical Resource for Natural Language Processing,” University of Ottawa, Ottawa, 2003. [15] “CMU Sphinx – Speech Recognition Toolkit.” [Online]. Available: http://cmusphinx.sourceforge.net/. [Accessed: 09-Nov-2012]. …

Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling P Ruiz, A Álvarez, H Arzelus – lrec-conf.org … This improved alignment accuracy vs. a binary matrix. 2 http://www.fp7-savas.eu/ 3 http://www.aucel.com/pln/ 4 http://svn.code.sf.net/p/cmusphinx/code/trunk/cmudict 5 http://code.google.com/p/phonetisaurus 6 https://sites.google.com/site/similaritymatrices/ … Cited by 1 Related articles

Mobile Speech Translation for Multilingual Requirements Meetings: A Preliminary Study F Calefato, F Lanubile, D Romita, R Prikladnicki… – collab.di.uniba.it … 1 http://msdn.microsoft.com/en-us/library/ee125663.aspx 2 http://cmusphinx.sourceforge.net 3 http://www.nuance.com/dragon 4 http://www.apple.com/ios/siri 5 https://www.google.com/intl/en/ chrome/demos/speech.html translated into the target language. Wang et al. …

Model and Feature Based Compensation for Whispered Speech Recognition S Ghaffarzadegan, H Boril… – … Annual Conference of …, 2014 – mazsola.iit.uni-miskolc.hu … 119–134, 2013. [21] PJ Moreno, Speech Recognition in Noisy Environments, Ph.D. thesis, ECE Department, CMU, PA, USA, 1996. [22] Carnegie Mellon University, “CMUSphinx – Open source toolkit for speech recognition; http://cmusphinx.sourceforge.net/wiki,” 2013. 2424

Improving a Long Audio Aligner through Phone-Relatedness Matrices for English, Spanish and Basque A Álvarez, P Ruiz, H Arzelus – Text, Speech and Dialogue, 2014 – Springer … The phonesets for all the languages are available on our project’s website5. 1 http://htk.eng.cam. ac.uk/ 2 http://www.aucel.com/pln/ 3 http://svn.code.sf.net/p/cmusphinx/code/trunk/cmudict/ 4 http://code.google.com/p/phonetisaurus/ 5 http://sites.google.com/site/similaritymatrices/ … Cited by 1

Pronunciation Practice Support System for Children who Have Difficulty Correctly Pronouncing Words I Masuda-Katsuse – Fifteenth Annual Conference of the …, 2014 – mazsola.iit.uni-miskolc.hu … Chiba Test Center, 2010. (in Japanese) [4] Sphinx-4, A Speech Recognizer Written Entirely in the Java TM-programming Language, Online: http://cmusphinx.sourceforge. net/sphinx4, accessed on March 18, 2013. [5] Kawahara, T …

Accent versus Mispronunciation: modifying speech recognition software to recognize speech and not a perfect accent in the ESL context T Cooper, A Tsukada, R Matoba, Y Naruse – Society for Information …, 2014 – editlib.org … Carnegie Mellon University. (2013). Retrieved from http://cmusphinx.sourceforge.net/ This work was supported by a KAKENHI Grant-in-Aid for Scientific Research (25370681) from Japan Society for the Promotion of Science. -1099- Related articles All 2 versions

Recent Improvements of the SpeeD Romanian LVCSR System H Cucu, A Buzo, L Petric?, D Burileanu… – Proc. Int. Conf. on …, 2014 – speed.pub.ro … 4 CMU Sphinx Toolkit: http://cmusphinx.sourceforge.net low sample likelihood. In this case, a maximum likelihood linear transformation [8] (MLLT), which aims at minimizing the loss in likelihood between full and diagonal covariance models is known to be very effective. … Cited by 2 Related articles

Development Of Multi-Modal Control Interfaces For A Semi-Autonomous Wheelchair GBDVJ RBE – 2014 – wpi.edu … 2.2.2.1. CMU SPHINX AND POCKETSPHINX CMUSphinx is a large vocabulary continuous speech recognizer developed by Carnegie Mellon University and released under BSD style license. It recognizes speech by taking the … Related articles

Using closely-related language to build an ASR for a very under-resourced language: Iban SS Juan, L Besacier, B Lecouteux… – Oriental COCOSDA …, 2014 – hal.archives-ouvertes.fr … Available: https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx [29] D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motl??cek, P. Schwarz, J. Silovsk`y, G. Stemmer, and K. Vesel`y, “The kaldi speech recognition toolkit,” in IEEE 2011 …

Ut-Vocal Effort Ii: Analysis And Constrained-Lexicon Recognition Of Whispered Speech S Ghaffarzadegan, H Boril, JHL Hansen – utdallas.edu … 12, pp. 713–716. [23] Carnegie Mellon University, “CMUSphinx – Open source toolkit for speech recognition; http://cmusphinx.sourceforge.net/wiki,” 2013. [24] LabRosa, “RASTA/PLP/MFCC feature calculation and inversion; http://labrosa.ee.columbia.edu/matlab,” 2013. … Cited by 2 Related articles All 2 versions

Crosslanguage mapping for small-vocabulary ASR in under-resourced languages: Investigating the impact of source language choice A Vakil, A Palmer – Spoken Language Technologies for Under- …, 2014 – mica.edu.vn … 12:1–12:6, ACM. [6] Microsoft, “Language support,” in Microsoft Speech Platform SDK 11 Documentation. 2012, http://msdn.microsoft.com/en-us/library/hh378476. [7] “CMUSphinx: Open source toolkit for speech recogni- tion,” http://www.cmusphinx.org. … Cited by 1

Voice-operated Home Automation M HANSSON, E JOHANSSON, J LUNDBERG… – 2014 – publications.lib.chalmers.se Page 1. Voice-operated Home Automation Affordable System using Open-source Toolkits Bachelor of Science Thesis in Computer Science MARIKA HANSSON EMIL JOHANSSON JACOB LUNDBERG MARKUS OTTERBERG …

Virtual Learning System (Miqra’ah) for Quran Recitations for Sighted and Blind Students SAE Mohamed, AS Hassanin… – Journal of Software …, 2014 – file.scirp.org … Sphinx—Speech Recognition Toolkit. http://cmusphinx.sourceforge.net/; Lamere, P., Kwok, P., Gouvea, EB, Raj, B., Singh, R., Walker, W. and Wolf, P. (2003) The CMU SPHINX-4 Speech Recognition System. Proceedings of the … Related articles All 3 versions

25 Research and Development Tools in Affective Computing MS Hussain, SK D’Mello, RA Calvo – The Oxford Handbook of …, 2014 – books.google.com … implement signal processing tools in house. There are some additional tools that were not represented in the survey but could be use- ful, such as Sphinx (cmusphinx. sourceforge. net), openBliSSART (Schuller, Lehmann, Weninger …

The Age of Confidentiality: A Review of the Security in Social Networks and Internet AJ Sánchez, Y Demazeau – Distributed Computing and Artificial …, 2014 – Springer … found. 13 Microsoft Speech API http://www.microsoft.com/downloads/ details.aspx? FamilyID=5e86ec97-40a7-453f-b0ee-6583171b4530 14 Cmu Sphinx. http://cmusphinx.sourceforge.net/sphinx4/ 15 Julius in Sourceforge. http … Related articles All 2 versions

Smart Home Implementation Using Data Mining GD Kulkarni, PV Gode, J Pratapreddy, MH Deshmukh… – ijecs.in … Hidden_Markov model [7] http://en.wikipedia.org/wiki/Sphinx [8] SPPande ,rof.Pravin Sen,” Home Automation system for disabled people usingBCI”,International Conference on advances in Engineering andTechnology-2014(ICA ET-2014) [9] http://cmusphinx.sourceforge.net …

Educational System for the Holy Quran and Its Sciences for Blind and Handicapped People Based on Google Speech API SAE Mohamed, AS Hassanin… – Journal of Software …, 2014 – file.scirp.org … Carnegie Mellon University (2010) Sphinx—Speech Recognition Toolkit. http://cmusphinx. sourceforge.net/; Lamere, P., Kwok, P., Gouvea, EB, Raj, B., Singh, R., Walker, W. and Wolf, P. (2003) The CMU SPHINX-4 Speech Recognition System. … Related articles All 6 versions

Seshat: A sync system for Audiobooks and eBooks A Dervisevic, T Oskarsson – 2014 – diva-portal.org Page 1. DEGREE PROJECT Computer Engineering Bachelor level G2E, 15 hec Department of Engineering Science, University West, Sweden 2014-06-16 Seshat – A sync system for Audiobooks and eBooks Adnan Dervisevic Tobias Oskarsson Page 2. DEGREE PROJECT …

Multimedia Keyword Spotting (MKWS) Using Training And Template Based Techniques J Patel, KS Maurya, S Kulkarni, V Sakore, S Khonde – ijetae.com … REFERENCES [1] Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, Peter Wolf, Joe Woelfel, “Sphinx-4: A Flexible Open Source Framework for Speech Recognition“, www.cmusphinx.sourceforge.net/sphinx4 [2] R. Ordelman, F. de Jong, and M … Related articles

Domain and Subtask-Adaptive Conversational Agents to Provide an Enhanced Human-Agent Interaction D Griol, JM Molina, AS de Miguel – Advances in Practical Applications of …, 2014 – Springer … The attributes are: Origin, Destination, Departure-Date, Arrival-Date, Class, Departure-Hour, Arrival-Hour, Train-Type, Order-Number, and Services. A total of 51 responses were defined for the system, corresponding to the request of 2 cmusphinx.sourceforge.net Page 6. … Cited by 1 Related articles All 2 versions

Implementation of vision based intelligent home automation and security system M Sefat, AAM Khan, M Shahjahan – Informatics, Electronics & …, 2014 – ieeexplore.ieee.org … CMUSphinx is an open source package for creating natural interface. We used the PocketSphinx which is a version of Sphinx that can be used in embedded systems(eg, based on an ARM processor). It analyzes human voices for speech recognition. …

Exploiting the human computational effort dedicated to message reply formatting for training discursive email segmenters NHS Salim – LAW VIII, 2014 – anthology.aclweb.org … 6http://www. ietf. org/rfc/rfc3676. txt 7Sphinx 4 edu. cmu. sphinx. util. NISTAlign http://cmusphinx. sourceforge. net 8https://github. com/romanows/WordSequenceAligner 9Sentences labelled with SE or S are turned into True, the other ones into False. …

A Practical Application of Evolving Fuzzy-Rule-Based Classifiers for the Development of Spoken Dialog Systems D Griol, JA Iglesias, A Ledezma, A Sanchis – … Intelligence Applications and …, 2014 – Springer … In this case, the con- fidence score assigned to the attribute Date is very low. Thus, a “2” value is added in the corresponding position for this attribute. The concept (Hour) and 1 cmusphinx.sourceforge.net Page 7. A Practical Application of Evolving Fuzzy-Rule-Based …

A multistage algorithm for fricative spotting D Ruinskiy, Y Lavner – Pacific Voice Conference (PVC), 2014 …, 2014 – ieeexplore.ieee.org … Available: http://www.sipl.technion.ac.il/Info/Downloads_Matlab ADT_e.shtml. [12] Carnegie Mellon University, “CMU Sphinx – Open Source Toolkit For Speech Recognition,” [Online]. Available: http://cmusphinx.sourceforge.net/. …

Noise Management in Mobile Speech Based Health Tools N Yadav, L Daudet, C Poellabauer, P Flynn – m-lab.cse.nd.edu … boundaries. The recognition accuracy is defined as the num- ber of words correctly recognized within their true timing boundaries in the recorded speech to an accuracy of 10%. If 1http://cmusphinx.sourceforge.net/ Page 2. the …

Better phone alignment for confidence measures in voice based querying N Prajapati, H Tulsiani, J Gada… – … (NCC), 2014 Twentieth …, 2014 – ieeexplore.ieee.org … REFERENCES [1] CMU-Sphinx: Open Source Toolkit for Speech Recognition. http://cmusphinx.sourceforge.net/. [2] T. Godambe and K. Samudravijaya, “Speech data acquisition for voice based agricultural information retrieval”, Proc. … Related articles All 4 versions

Augmented Reality: An Observational Study Considering the MuCy Model to Develop Communication Skills on Deaf Children J Cadeñanes, MAG Arrieta – Hybrid Artificial Intelligence Systems, 2014 – Springer … Mellon University. Sphinx Speech Recognition Toolkit (2014), http://cmusphinx. sourceforge.net/ (viewed on March 15, 2014) 5. Billinghurst, M., Kato, H., Poupyrev, I.: The Magic Book: A Transitional AR Interface. Computers and … Related articles All 2 versions

Object Learning with Natural Language in a Distributed Intelligent System: A Case Study of Human-Robot Interaction S Heinrich, P Folleher, P Springstübe, E Strahl… – … Practical Applications of …, 2014 – Springer … 3 SMACH is a python-based library for building hierarchical state machines—www.?ros.?org/? wiki/?smach/?. 4 PocketSphinx is an open source automatic speech recognition (ASR) system, optimised for hand-held devices and robots—www.?cmusphinx.?sourceforge.?net. … Cited by 2 Related articles All 5 versions

Using bands of frequencies for vowel recognition for Polish language M P?onkowski – International Journal of Speech Technology – Springer … Oxford: Oxford University Press. CMU Sphinx (2014). The Carnegie Mellon Sphinx Project. Retrieved from http://cmusphinx.sourceforge.net/ June 2014. Espy-Wilson, CY (2004). Articulatory strategies, speech acoustics and variability (pp. B62–B76). …

Exploring the Role of Stress in Bayesian Word Segmentation using Adaptor Grammars B Börschinger, M Johnson – transacl.org … 1987; Brent, 1999), following the procedure just out- lined. This corpus is a de-facto standard for evaluat- 4http://svn.code.sf.net/p/cmusphinx/ code/trunk/cmudict/cmudict. 0.7a 97 Page 6. Pattern brent korman alex Tok Typ Tok Typ … Cited by 3 Related articles All 2 versions

Semi-automated Speaker Adaptation: How to Control the Quality of Adaptation? AV Savchenko – Image and Signal Processing, 2014 – Springer … Automation and Remote Control 74(7), 1225–1232 (2013) 6. Marple Jr, SL: Digital Spectral Analysis: With Applications. Prentice-Hall Series in Signal Processing (1989) 7. CMU Sphinx, http://cmusphinx.sourceforge.net/ Page 9. 646 AV Savchenko … Related articles

An Open Source Corpus and Recording Software for Distant Speech Recognition with the Microsoft Kinect D Schnelle-Walka, S Radeck-Arneth… – … ; 11. ITG Symposium; …, 2014 – ieeexplore.ieee.org … Eg, the three microphones to the right merely span a distance of approximately 10 cm. This 1http://cmusphinx.sourceforge.net/wiki/ tutorialam ITG-Fachbericht 252: Speech Communication, 24. – 26. September 2014 in Erlangen ISBN 978-3-8007-3640-9 … Cited by 1

Investigating stranded GMM for improving automatic speech recognition A Gorin, D Jouvet, E Vincent… – Hands-free Speech …, 2014 – ieeexplore.ieee.org … ICASSP, 2013, pp. 69–74. [16] D. Tran, E. Vincent, and D. Jouvet, “Extension of un- certainty propagation to dynamic MFCCs for noise robust ASR,” in Proc. ICASSP (to appear), 2014. [17] “http://cmusphinx.sourceforge.net,” 2013. … Cited by 1

Robust speech recognition using temporal masking and thresholding algorithm C Kim, KK Chin, M Bacchiani, RM Stern – Fifteenth Annual Conference of …, 2014 – 193.6.4.39 … [Online]. Available: http://cmusphinx.sourceforge.net/ wiki/download/ [26] SG McGovern, “A model for room acoustics,” http://www.sgm- audio.com/research/rir/rir.html. [27] J. Allen and D. Berkley, “Image method for efficiently simulating small-room acoustics,” J. Acoust. Soc. …

Classification of a Sequence of Objects with the Fuzzy Decoding Method AV Savchenko, LV Savchenko – Rough Sets and Current Trends in …, 2014 – Springer Page 1. C. Cornelis et al. (eds.): RSCTC 2014, LNAI 8536, pp. 309–318, 2014. © Springer International Publishing Switzerland 2014 Classification of a Sequence of Objects with the Fuzzy Decoding Method Andrey V. Savchenko1 and Lyudmila V. Savchenko2 …

Automatic discovery of a phonetic inventory for unwritten languages for statistical speech synthesis PK Muthukumar, AW Black – Acoustics, Speech and Signal …, 2014 – ieeexplore.ieee.org … 1762–1765. [19] Kishore Prahallad, Naresh Kumar, Venkatesh Keri, S Rajendran, and Alan W Black, “The IIIT-H indic speech databases.,” in INTERSPEECH, 2012. [20] CMU Sphinx, “CMU Sphinx open source speech rec- ognizer,” http://cmusphinx.sourceforge.net/. …

Automatic Recognition and Synthesis System of Arabic Digit H TEBBI, M HAMADOUCHE, H Azzoune – inase.org … 1) Enlargement of the vocabulary for all digits; 2) Recognition of continuous speech; 3) Recognition in speaker independent mode; 4) Use of the HMM, neural networks and hybrid methods. ASR using CMUSphinx [ 7] 85.55 % DTW-Based ArSR 8] 86% …

Text Alignment from Bimodal Mathematical Expression Sources C VIARD-GAUDIN – 2014 – people.sabanciuniv.edu … Interspeech, 2009, pp. 2123-2126. “Cmu sphinx system,” http://cmusphinx.sourceforge.net, Accessed on February, 20th, 2014. PK Atrey, MA Hossain, A. El Saddik, and MS Kankanhalli, “Multimodal fusion for multimedia analysis: A survey,” Multimedia Systems, vol. 16(6), pp. …

About Combining Forward and Backward-Based Decoders for Selecting Data for Unsupervised Training of Acoustic Models D Jouvet, D Fohr – INTERSPEECH 2014, 15th Annual Conference of the …, 2014 – hal.inria.fr … [15] Corpus EPAC: Transcriptions orthographiques, catalogue ELRA (http://catalog.elra.info), reference ELRA-S0305. [16] NIST evaluation tools: http://www.itl.nist.gov/iad/ mig//tools/ [17] Sphinx. [Online]: http://cmusphinx.sourceforge.net/, 2011. [18] HTK. …

Component Structuring and Trajectory Modeling for Speech Recognition A Gorin, D Jouvet – Interspeech, 2014 – hal.inria.fr … [14] RG Leonard and G. Doddington, “Tidigits speech corpus,” Texas Instruments, Inc, 1993. [15] CMU, “Sphinx toolkit http://cmusphinx.sourceforge.net,” 2014. [16] DC Burnett and M. Fanty, “Rapid unsupervised adaptation to children’s speech on a connected-digit task,” in Proc. …

Structured GMM based on unsupervised clustering for recognizing adult and child speech A Gorin, D Jouvet – Statistical Language and Speech Processing, 2014 – Springer … In: Proceedings of the ICSLP, vol. 2, pp. 1145–1148. IEEE (1996). 4. CMU: Sphinx toolkit (2014), http://?cmusphinx.?sourceforge.?net. 5. Gales, MJ: Maximum likelihood linear transformations for HMM-based speech recognition. Comput. Speech Lang. …

Determinants of Lengths of Repetition Disfluencies: Probabilistic syntactic constituency in speech production Z Harmon, V Kapatsinski – blogs.uoregon.edu … Psychological Methods 14.323–348. Weide, R. 1998. CMU Pronouncing Dictionary. http://svn.code.sf.net/p/cmusphinx/code/trunk/cmudict/cmudict.0.7a Zipf, GK 1949. Human Behavior and the Principle of Least Effort. Oxford: Addison-Wesley.

A kiosk based model for employment generation in rural areas KP Dipin, J Bose, VG Vivek – Global Humanitarian Technology …, 2014 – ieeexplore.ieee.org … 153-173, April 2009. [15] CMU Sphinx (cmusphinx.sourceforge.net) [16] VOCE (voce.sourceforge.net) [17] Apache OpenNLP. The Apache Software Foundation (opennlp.apache.org) [18] List of Natural Languages Processing Toolkits …

Keyword spotting in a-capella singing AM Kruspe, I Fraunhofer – terasoft.com.tw … The resulting MLPs are then used to recognize phonemes in our a-capella dataset, thus generating phoneme posteriorgrams. 1 http://cmusphinx.sourceforge.net/ 15th International Society for Music Information Retrieval Conference (ISMIR 2014) 272 Page 3. …

[BOOK] Raspberry Pi Robotic Projects R Grimmett – 2014 – books.google.com Page 1. Raspberry Pi Robotic Projects Create amazing robotic projects on a shoestring budget Richard Grimmett PACKT] Page 2. Table of Contents RaspberryPiRobotic Projects Credits About the Author About the Reviewers … All 2 versions

Automated Analysis of Child Phonetic Production Using Naturalistic Recordings D Xu, JA Richards, J Gilkerson – Journal of Speech, Language, and Hearing …, 2014 – ASHA … Singer, JD, & Willett, JB (2003). Applied longitudinal data analysis. New York, New York: Oxford University Press Sphinx Website, Retrieved from http://cmusphinx.sourceforge. net/ Stark, RE (1980). Prespeech segmental feature development. … All 8 versions

Voice Controlled Intelligent Wheelchair using Raspberry Pi A Naeem, A Qadir, W Safdar – International Journal of …, 2014 – techpublications.org … [Accessed 26 April 2014]. [10] “CMU Sphinx Speech Recognition,” http://sourceforge. net/, [Online]. Available: http://sourceforge.net/projects/cmusphinx/. [Accessed 26 April 2014]. [11] “MB1000 LV-MaxSonar-EZ0™,” [Online]. …

VUI Design C Haskell – 2014 – scholarworks.gvsu.edu Page 1. Grand Valley State University ScholarWorks@GVSU Technical Library School of Computing and Information Systems 2014 VUI Design Corey Haskell Grand Valley State University Follow this and additional works at: http://scholarworks.gvsu.edu/cistechlib …

SocRob@ Home: Team Description Paper for the Competition Event RoCKIn@ Home 2014 R Ventura, J Messias, A Ahmad – safe.wachenfeld-golla.de … All user responses are explicitly confirmed by the robot. The outcome of each 6http://espeak. sourceforge.net/ 7http://cmusphinx.sourceforge.net/ front LRF rear LRF Mecano wheels Fig. 3. ISR-Cobot’s future navigation platform with laser range finders (LRFs). …

Trainable Videorealistic Facial Animation T Ezzat, G Geiger, T Poggio – ll3.ai.mit.edu Page 1. Trainable Videorealistic Facial Animation Tony Ezzat Gadi Geiger Tomaso Poggio tonebone@ai.mit.edu gadi@ai.mit.edu tp@ai.mit.edu Center for Biological and Computational Learning Massachusetts Institute of Technology Abstract … Related articles All 5 versions

Generation of syllable level templates using dynamic programming for statistical speech synthesis R SRIKANTH – 2014 – web2py.iiit.ac.in Page 1. Generation of syllable level templates using dynamic programming for statistical speech synthesis Thesis submitted in partial fulfillment of the requirements for the degree of Master of Science (by Research) in Electronics and Communications Engineering by …

Unsupervised Acoustic Model Training Using Multiple Seed Asr Systems H Cucu, A Buzo, C Burileanu – speed.pub.ro … For both LMs the 2 CMU Sphinx Toolkit: http://cmusphinx.sourceforge.net 3 SRI-LM Toolkit: http://www-speech.sri.com/projects/srilm Page 5. number of unigrams was limited to the most frequent 64k (this constraint was imposed by the Sphinx4 ASR decoder). … Related articles

Watermelon Team Description Paper RoCKIn 2014 FJR Lera, F Casado, R Rodr?guez, V Matellán, FM Rico – safe.wachenfeld-golla.de … REFERENCES [1] Mathieu Labbé, Find Object, Available Online [09/05/2014]: https:// code.google.com/p/find-object/ [2] Daniel Di Marco, Rob Janssen, RoboEarth Platform, Available Online [09/05/2014]: http://wiki.ros.org/roboearth stack [3] CMUSphinx software, Available …

An Automatic Speech Recognition Solution with Speaker Identification Support A Buzo, H Cucu, L Petric?, D Burileanu, C Burileanu – speed.pub.ro … CMU SPUD Workshop, 2010. [8] H. Cucu, “Towards a speaker-independent, large-vocabulary continuous speech recognition system for Romanian”, PhD Thesis, University “Politehnica” of Bucharest, 2011. [9] http://cmusphinx.sourceforge.net/, last accessed 26.12.2013. Related articles

Towards an intelligent framework for multimodal affective data analysis (Forthcoming/Available Online) S Poria, E Cambria, A Hussain, GB Huang – 2014 – dspace.stir.ac.uk Page 1. Accepted Manuscript Towards an intelligent framework for multimodal affective data analysis Soujanya Poria, Erik Cambria, Amir Hussain, Guang-Bin Huang PII: S0893-6080(14) 00234-2 DOI: http://dx.doi.org/10.1016/j.neunet.2014.10.005 Reference: NN 3407 …

Data partitioning: an approach to preserving data privacy in computation offload in pervasive computing systems M Al-Mutawa, S Mishra – Proceedings of the 10th ACM symposium on …, 2014 – dl.acm.org Page 1. Data Partitioning: An Approach to Preserving Data Privacy in Computation Offload in Pervasive Computing Systems Mohammad Al-Mutawa and Shivakant Mishra Department of Computer Science University of Colorado …

Mobile Imaging and Computing for Intelligent Structural Damage Inspection ZQ Chen, J Chen – Advances in Civil Engineering, 2014 – hindawi.com Advances in Civil Engineering is a peer-reviewed, open access journal that publishes original research articles as well as review articles in all areas of civil engineering.

Modified Mel Filter Bank to Compute MFCC of Subsampled Speech KK Bhuvanagiri, SK Kopparapu – arXiv preprint arXiv:1410.7382, 2014 – arxiv.org … http://cmusphinx.sourceforge.net/sphinx4/javadoc/edu/cmu/sphinx/ frontend/frequencywarp/ MelFrequencyFilterBank.html [15] S. Sigurdsson, KB Petersen, and TL Schiler, “Mel frequency cepstral coefficients: An evaluation of robustness of MP3 encoded music,” Conference …

Enabling Dynamic Web for Differently-Abled Community: A Universal Web Accessibility Driven Approach D Gunatilake, C Dharmasiri – 2014 – dspace.sliit.lk … An Evaluation with Blind Users of the Usability of Two Interfaces for a Social Networking Platform” in Proc. of the 2011 iConference. [4] “CMUSphinx Wiki – CMUSphinx Wiki.” [Online]. Available: http://cmusphinx.sourceforge.net/wiki/. …

Robot initiative in a team learning task increases the rhythm of interaction but not the perceived engagement S Ivaldi, SM Anzalone, W Rousseau… – Frontiers in …, 2014 – ncbi.nlm.nih.gov Warning: The NCBI web site requires JavaScript to function. more… … Cited by 2 Related articles All 12 versions

Birmingham Autonomous Robot Club (BARC)-Team Description Paper L Mudrova, M Becerra, M Chiou, S Bastable – rockincompetition.eu … research/ groupings/robotics/, accessed May 2014. [2] “CMU Sphinx- Open Source Toolkit For Speech Recognition,” http: //cmusphinx.sourceforge.net, accessed May 2014. [3] STRANDS, STRANDS – Spatio-Temporal Represenations …

Unsupervised Topic Modelling on South African Parliament Audio Data N Kleynhans – prasa.org … [15] CM University, “CMU sphinx,” 2014. [Online]. Available: http://cmusphinx.sourceforge.net/ [16] B.-J. Hsu and J. Glass, “Iterative language model estimation: efficient data structure & algorithms,” in Proceedings of Interspeech, vol. 8, Brisbane, Australia, September 2008, pp. …

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language–Amharic MY Tachbelie, ST Abate, L Besacier – Speech Communication, 2014 – Elsevier State-of-the-art large vocabulary continuous speech recognition systems use mostly phone based acoustic models (AMs) and word based lexical and language models. Cited by 3 Related articles All 4 versions

A Mobile Virtual Butler to Bridge the Gap between Users and Ambient Assisted Living: A Smart Home Case Study N Costa, P Domingues, F Fdez-Riverola, A Pereira – Sensors, 2014 – mdpi.com Ambient Intelligence promises to transform current spaces into electronic environments that are responsive, assistive and sensitive to human presence. Those electronic environments will be fully populated with dozens, hundreds or even thousands of connected devices that share …

Towards an intelligent framework for multimodal affective data analysis S Poria, E Cambria, A Hussain, GB Huang – Neural Networks, 2014 – Elsevier An increasingly large amount of multimodal content is posted on social media websites such as YouTube and Facebook everyday. In order to cope with the growth of.

A low-power accuracy-configurable floating point multiplier H Zhang, W Zhang, J Lach – Computer Design (ICCD), 2014 …, 2014 – ieeexplore.ieee.org Page 1. A Low-Power Accuracy-Configurable Floating Point Multiplier Hang Zhang, Wei Zhang and John Lach Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, USA, 22904 {hz9xa, wz6pc, jlach}@virginia.edu …

Characterizing and detecting spontaneous speech: Application to speaker role recognition R Dufour, Y Estève, P Deléglise – Speech Communication, 2014 – Elsevier Processing spontaneous speech is one of the many challenges that automatic speech recognition systems have to deal with. The main characteristics of this kind o. Related articles All 2 versions

Automatic speech recognition for under-resourced languages: A survey L Besacier, E Barnard, A Karpov, T Schultz – Speech Communication, 2014 – Elsevier Speech processing for under-resourced languages is an active field of research, which has experienced significant progress during the past decade. We propose, i. Cited by 19 Related articles All 4 versions

SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian H Cucu, A Buzo, L Besacier, C Burileanu – Speech Communication, 2014 – Elsevier This study investigates the possibility of using statistical machine translation to create domain-specific language resources. We propose a methodology that aim. Cited by 4 Related articles All 4 versions

Content Based Lecture Video Retrieval Using Speech and Video Text Information H Yang, C Meinel – 2014 – ieeexplore.ieee.org … A drawback of this kind of approach is that the salt and pepper noise of video signal can affect the segmentation accuracy. After observing the content of lecture slides, we realize 2. http://cmusphinx.sourceforge.net/ Page 5. This … Related articles All 2 versions

Development of Phonetic Engine for Punjabi Language S Mittal – 2014 – dspace.thapar.edu Page 1. Development of Phonetic Engine for Punjabi Language Dissertation Submitted in partial fulfillment of the requirements for the award of degree of Masters of Technology in Computer Science and Applications Submitted By Sakshi Mittal (Roll No. 601203023) …

Flexible context aware interface for ambient assisted living J McNaull, JC Augusto, M Mulvenna… – … -centric Computing and …, 2014 – hcis-journal.com A Multi Agent System that provides a (cared for) person, the subject, with assistance and support through an Ambient Assisted Living Flexible Interface (AALFI) during the day while complementing the night time assistance offered by NOCTURNAL with feedback assistance, is presented … Cited by 3 Related articles All 3 versions

Knowledge Discovery Through Spoken Dialog A Pappu – 2014 – speech.cs.cmu.edu Page 1. Knowledge Discovery Through Spoken Dialog Aasish Pappu Ph. D. Thesis Language Technologies Institute School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Thesis Committee Alexander …

A domain-independent statistical methodology for dialog management in spoken dialog systems D Griol, Z Callejas, R López-Cózar… – Computer Speech & …, 2014 – Elsevier This paper proposes a domain-independent statistical methodology to develop dialog managers for spoken dialog systems. Our methodology employs a data-driven cla. Cited by 1 Related articles All 3 versions

A Generic and Scalable Architecture for a Large Acoustic Model and Large Vocabulary Speech Recognition Accelerator Using Logic on Memory OA Bapat, PD Franzon, RM Fastow – ieeexplore.ieee.org Page 1. This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS 1 … Related articles

Optimizing the Control of a Wi-Fi based Teleoperated Mobile Wheelchair A Fathima – 2014 – wpi.edu Page 1. Optimizing the Control of a Wi-Fi based Teleoperated Mobile Wheelchair China Project Center, E-term, 2014 A Major Qualifying Project (MQP) Report Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE …

Handbook of BigDataBench (Version 3.1)—A Big Data Benchmark Suite C Luo, W Gao, Z Jia, R Han, J Li, X Lin, L Wang, Y Zhu… – prof.ict.ac.cn Page 1. Handbook of BigDataBench (Version 3.1)—A Big Data Benchmark Suite Chunjie Luo1, Wanling Gao1, Zhen Jia1, Rui Han1, Jingwei Li1, Xinlong Lin1, Lei Wang1, Yuqing Zhu1, and Jianfeng Zhan1 1Institute of Computing …

Audio-video based character recognition for handwritten mathematical content in classroom videos S Vemulapalli, M Hayes – Integrated Computer-Aided Engineering – IOS Press Page 1. Galley Proof 24/01/2014; 15:26 File: ica460.tex; BOKCTP/wyn p. 1 Integrated Computer-Aided Engineering 00 (2014) 1–16 1 DOI 10.3233/ICA-140460 IOS Press Audio-video based character recognition for handwritten mathematical content in classroom videos … Related articles