Java Speech Development Kit: A Tutorial

 in
The authors show how to get started developing voice-activated interfaces using the Speech for Java Development Kit.
The API

The API that is being studied is composed by three packages that form the voice Speech for Java SDK:

  • package javax.speech: has the basic interfaces and implementations for the Engine interface. It also implements the Central class providing access to all the other functions for creating the most important entities of the technology, the synthesizers and recognizers. It is the basis for the remaining packages, remembering that the recognizer and synthesizer interfaces extend the Engine interface. This package also has resources for managing the audio received by the Engine and, as a last feature, its SpeechEvent class is inherited by all the other events of the API.

  • package javax.speech.synthesis: has all the classes and interfaces used for the synthesis process. It also provides the Voice class that determines how the synthesis will take place.

  • package javax.speech.recognition: the largest of the packages, it implements the recognizing and grammar classes, given their close relation. Also, there are the result classes that are very important in this architecture.

The packages are presented in a jar file named ibmjs.jar and the compilation is possible only after adding its full filesystem pathname to the CLASSPATH system variable, as follows:

                   CLASSPATH = %CLASSPATH% ; .../ibmjs.jar

After that and the proper installation of the IBM ViaVoice software, it is necessary to execute the following command:

                      java com.ibm.speech.util.Install
References

You can download the free IBM ViaVoice SDK Java at www.alphaworks.ibm.com/tech/speech.

You can download the free IBM ViaVoice SDK for Linux at www-4.ibm.com/software/speech/dev/sdk_linux.html.

Java Speech Interface from Sun is available at java.sun.com/products/java-media/speech/.

See java.icmc.usp.br for more information.

The authors acknowledge the support for this work from the The Research Foundation of the São Paulo State-FAPESP .

José Rodrigues Júnior is a graduate student at the ICMC-USP University of São Paulo. Dr. Dilvan Moreira is a lecturer at the ICMC-USP.

______________________

Comments

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Reference Links are Dead

Gobinath's picture

hi there
this is a very nice article. i really liked it and i planned to implement this in my project but unfortunately all the references links are dead.

kindly help me out . u can reach me at gobicse@gmail.com....

any help will be of great help..

thanks

Reference Links are Dead

Gobinath's picture

hi there
this is a very nice article. i really liked it and i planned to implement this in my project but unfortunately all the references links are dead.

kindly help me out . u can reach me at gobicse@gmail.com....

any help will be of great help..

thanks

This is really useful. I

siva's picture

This is really useful. I successfully completed my project voice controlled wheel chair with the help of this concept... It works well....

Re: Java Speech Development Kit: A Tutorial

Anonymous's picture

Is there lib to use text to speak in portuguese language ?

Re: Java Speech Development Kit: A Tutorial

Anonymous's picture

i have read your tutorial about the Java Speech Development Kit, it is truely very interesting, i would realy love to develop a programe of my own on this context. please help me im a graduate from the University of Botswana in Computer Science in Botswana, i realy ineterested in speech program but i dont know where to start and what i need. more especialy that i dont have any of the classies that i can use for statup training plz give me an advice.

my Email is matikitim2@yahoo.cu.com

thank you in advace

Moathodi Excellent Matikiti

Re: Java Speech Development Kit: A Tutorial

Anonymous's picture

I am in the same situation as Moathodi, were I would like to develope a program on this subject. I would really appreciate it if you could aid me in kicking it of as I am not sure were to start.

My email is adil_rehman@hotmail.com

Many Thanks

Adil

Speech recognition engines

Mario's picture

I have done a lot of research on the speech development architecture of java..but i am more interested in some package that might make the development of a speech to text application faster. Thanx...hope i get a reply...my emaill is mario_ramotar@yahoo.com

White Paper
Fabric-Based Computing Enables Optimized Hyperscale Data Centers

Today’s modular x86 servers are compute-centric, designed as a least common denominator to support a wide range of IT workloads. Those generic, virtualized IT workloads have much different resource optimization requirements than hyperscale and cloud applications. They have resulted in a “one size fits all” enterprise IT architecture that is not optimized for a specific set of IT workloads, and especially not emerging hyperscale workloads, such as web applications, big data, and object storage. In this report, you will learn how shifting the focus from traditional compute-centric IT architectures to an innovative disaggregated fabric-based architecture can optimize and scale your data center.

Learn More

Sponsored by AMD

White Paper
Red Hat White Paper: Using an Open Source Framework to Catch the Bad Guy

Built-in forensics, incident response, and security with Red Hat Enterprise Linux 6

Every security policy provides guidance and requirements for ensuring adequate protection of information and data, as well as high-level technical and administrative security requirements for a system in a given environment. Traditionally, providing security for a system focuses on the confidentiality of the information on it. However, protecting the data integrity and system and data availability is just as important. For example, when processing United States intelligence information, there are three attributes that require protection: confidentiality, integrity, and availability.

Learn more about catching the bad guy in this free white paper.

Learn More

Sponsored by DLT Solutions