Commercial Speech Recognition APIs

Notes:

A Commercial Speech Recognition API (Application Programming Interface) is a software interface that allows developers to integrate speech recognition functionality into their applications. It enables users to interact with the application using voice commands, allowing for hands-free control and input. These APIs use advanced machine learning algorithms to transcribe spoken words into written text, and can be customized to recognize specific accents, dialects, or languages. Some examples of commercial speech recognition APIs include Google Cloud Speech-to-Text, Amazon Transcribe, and Microsoft Azure Speech Services.

ASR (Automatic Speech Recognition) refers to the technology that allows computers to recognize and transcribe spoken words into written text.
Speech-to-text refers to the process of converting spoken words into written text using ASR technology.
Voice recognition is the ability of a computer or device to identify and respond to a specific individual’s voice. This can include speaker identification, which is the process of verifying a person’s identity based on their voice, and speaker verification, which is the process of confirming a person’s identity based on their voice.

Resources:

aispeech.com .. chinese language
dev.smt.docomo.ne.jp .. japanese language
gracenote.com .. music recognition
laniervoice.com .. medical transcription
mashape.com .. general api marketplace
nexiwave.com .. voicemail to text
nexmo.com .. voicexml api
phonetag.com .. voicemail to text
programmableweb.com .. general api directory
talkscribe.com .. voice to text transcription
telapi.com .. ivr api
textshark.com .. voice to text transcription
tradeharbor.com .. voice signature service
venstar.com .. internet of things
yactraq.com .. speech2topics

Wikipedia:

References: