NOTICE: The Processors Wiki will End-of-Life on January 15, 2021. It is recommended to download any files or other content you may need that are hosted on processors.wiki.ti.com. The site is now set to read only.

Text-To-Speech-and-Speech-Recognition-on-Android

From Texas Instruments Wiki
Jump to: navigation, search

Text-To-Speech (TTS) on Android[edit]

The Android platform includes a Text-to-Speech (TTS) capability. Also known as "speech synthesis", TTS enables an Android device to "speak" text in various languages. Although all Android-powered devices that support the TTS functionality ship with the TTS-engine (for ex: pico), some devices have limited storage and may lack the language-specific resource files.

Complete TTS Android developer reference.


Enabling TTS on Android[edit]

com.svox.langpack.installer.apk contains speech synthesis data required by the TTS-engine. The following languages are supported:

 - English (US)
 - English (UK)
 - French
 - German
 - Italian
 - Spanish

After successful installation, the Android TTS-engine can be configured in the following menu:

 Settings > Voice input and output > Text to speech settings >

Android-tts-settings.jpg

For example a sample TTS demo can be heard by using the following option:

 Settings > Voice input and output > Text to speech settings > Listen to an Example

Once the speech synthesis data is installed, ANY application running on android can utilise the android TTS-engine to "read out loud" a piece of text.

Attached is a sample application Text_To_Speech_Reloaded_v1.0.apk which can read text typed by the user or from any file.

NOTE: Both the APKs listed are available for free on the android market. They can be installed on the device via adb or from an sdcard.

Speech Recognition on Android[edit]

Android is an open platform, so applications can potentially make use of any speech recognition service on the device that's registered to receive a RecognizerIntent. Google's Voice Search application, which is pre-installed on many Android devices, responds to a RecognizerIntent by displaying the "Speak now" dialog and streaming audio to Google's servers -- the same servers used when a user taps the microphone button on the search widget or the voice-enabled keyboard.

For speech input to be as accurate as possible, it's helpful to have an idea of what words are likely to be spoken. While a message like "Mom, I'm writing you this message with my voice!" might be appropriate for an email or SMS message, you're probably more likely to say something like "weather in Mountain View" if you're using Google Search. You can make sure your users have the best experience possible by requesting the appropriate language model: free_form for dictation, or web_search for shorter, search-like phrases. We developed the "free form" model to improve dictation accuracy for the voice keyboard, while the "web search" model is used when users want to search by voice.

Google's servers support many languages for voice input, with more arriving regularly. You can use the ACTION_GET_LANGUAGE_DETAILS broadcast intent to query for the list of supported languages. The web search model is available for all languages, while the free-form model may not be optimized for all languages.

Complete Speech Input Android developer reference.


Enabling Speech Recognition on Android[edit]

TODO


Third-party alternatives[edit]

A couple of Third-party alternatives exist for Android:

  • iSpeech Text to Speech (TTS) and Speech Recognition (ASR) SDK for Android lets you Speech-enable any Android App quickly and easily with iSpeech Cloud. The SDK has a small footprint and supports 27 TTS and ASR languages and 15 for free-form dictation voice recognition. (Last updated: 10/13/2011)
E2e.jpg {{
  1. switchcategory:MultiCore=
  • For technical support on MultiCore devices, please post your questions in the C6000 MultiCore Forum
  • For questions related to the BIOS MultiCore SDK (MCSDK), please use the BIOS Forum

Please post only comments related to the article Text-To-Speech-and-Speech-Recognition-on-Android here.

Keystone=
  • For technical support on MultiCore devices, please post your questions in the C6000 MultiCore Forum
  • For questions related to the BIOS MultiCore SDK (MCSDK), please use the BIOS Forum

Please post only comments related to the article Text-To-Speech-and-Speech-Recognition-on-Android here.

C2000=For technical support on the C2000 please post your questions on The C2000 Forum. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here. DaVinci=For technical support on DaVincoplease post your questions on The DaVinci Forum. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here. MSP430=For technical support on MSP430 please post your questions on The MSP430 Forum. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here. OMAP35x=For technical support on OMAP please post your questions on The OMAP Forum. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here. OMAPL1=For technical support on OMAP please post your questions on The OMAP Forum. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here. MAVRK=For technical support on MAVRK please post your questions on The MAVRK Toolbox Forum. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here. For technical support please post your questions at http://e2e.ti.com. Please post only comments about the article Text-To-Speech-and-Speech-Recognition-on-Android here.

}}

Hyperlink blue.png Links

Amplifiers & Linear
Audio
Broadband RF/IF & Digital Radio
Clocks & Timers
Data Converters

DLP & MEMS
High-Reliability
Interface
Logic
Power Management

Processors

Switches & Multiplexers
Temperature Sensors & Control ICs
Wireless Connectivity