Linux-Based Voice Recognition

February 3rd, 2003 by Janine M. Lodato in

For the medical community especially, better voice recognition solutions could mean time, money and lives.
Your rating: None

Editors' Note: The following article is from Linux Journal's sister on-line publication, Linux Gazette.

Let's look at Linux-based voice recognition software from the perspective of China. It would behoove Linux computer makers to begin manufacturing their computers in China. China offers a low-cost method of manufacturing and provides them with a large market for their hardware, which can also be exported to other important markets around the world.

Linux computers have the capacity to accommodate voice recognition systems, such as IBM ViaVoice. This is especially advantageous to Chinese speakers because both Mandarin and Cantonese are complex languages in their written forms. Using a keyboard is next to impossible for Chinese languages, because so many characters are involved in typing a document. Documents could be produced more easily, therefore, using voice recognition software running on a Linux platform.

Other languages also would benefit from using voice recognition software for purposes of speed. Hands-busy, eyes-busy professionals can benefit greatly from voice recognition so they don't have to use a mouse and keyboard to document their findings. In this manner, voice-activated, easily-used telephone systems might benefit all walks of life. Anyone driving a car, for example, will find voice recognition a much more effective way of communicating while manipulating a vehicle.

The health-care market alone may justify the Linux-based voice recognition project. Health-care services are the largest expense of the Group of Ten nations, and it is the fastest growing sector as well. Health-care workers would benefit from using their voices to document patients' treatments. Voice recognition would allow them a hands-free environment in which to analyze, treat and write about particular cases easily and quickly.

In addition, electronically connected medical devices, using a wireless LAN, could benefit:

  • Hospital administration staff

    • Improve the usage efficiency of resources

    • Achieve standardized, quality patient management

    • Dramatically reduce data recording (transcription) errors

    • Lower costs

    • Make any room a telemetry room on demand (that is, do laboratory measurements in any room regardless of where the central equipment is located)

  • Medical staff

    • Be empowered with a 24/7 complete set of vital-sign data

    • Have more time for hands-on care

    • See changes in patient status immediately to enable quicker responses

For life sciences fields, the simplicity, reliability and low cost of using Linux for servers, tablets, embedded devices and desktops is paramount. Currently, only about 10% of the documents in the health-care field in the USA are produced electronically, due to the cumbersome and unreliable nature of the Windows environment. Thirty percent of the cost of health-care is a direct result of manual creation of the documents. Furthermore, many malpractice cases are caused by imprecise transcriptions of manually scribbled medical records and directives, as anybody who looks at a prescription can attest.

Obviously, the market for these new technologies exists. What remains is for a hungry company with aggressive sales people to tap into that market. Once those sales people distribute the technology, the needs of many will be met and a new mass market will open up that Microsoft is not now filling: assistive technology (AT). Although the field already exists, it needs to be expanded to include both physically disabled and functionally disabled people.

Yes, voice recognition offers great promise for the future. However, it isn't perfect and needs to be improved. One area that needs improving is lip reading, which would bolster its accuracy. Another area for improvement is multi-tonal voice input; another is directional microphones. In conclusion, every generation of voice recognition software will improve as the hardware for Linux gets bigger and stronger.

__________________________


Special Magazine Offer -- 2 Free Trial Issues!
Receive 2 free trial issues of Linux Journal as well as instant online access to current and past issues. There's NO RISK and NO OBLIGATION to buy. CLICK HERE for offer

Linux Journal: delivering readers the advice and inspiration they need to get the most out of their Linux systems since 1994.

Sorry, offer available in the US only. International orders, click here.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Re: Linux-Based Voice Recognition

On March 27th, 2003 Anonymous says:

Hi your voice recognition really sucks ass large

so does yours

On January 12th, 2005 Anonymous (not verified) says:

so does yours

Re: Linux-Based Voice Recognition

On February 9th, 2003 Anonymous says:

Isn't the answer obvious - get WINE finished so we can reliably access software written for windows..?!

Re: Linux-Based Voice Recognition

On July 19th, 2004 Anonymous says:

whoever said get WINE finished is a complete noob to computers and emulation in general.... if you get WINE finished...which was pointed out as damn near impossible any time soon (plan to take out microsoft in the next five years O.o...good luck)...how much of your processor speed is taken up in the emulation of windows. How much ram is wasted to the WINE program itself. Lets take into example x box emulation which is just starting to pick up. The x box uses a p III modified 733 mhz processor and has 64 ram. To get even half way decent results you will need to be running a 2.8 ghz p4 machine with 1 gb of ram ... and even then expect errors. Voice recognition technology, although obviously not as graphic intensive, requires A LOT of processing speed. Perhaps if you have a uber super computer then emulation will seem to make sense...but seeing as how these tend to cost a lot more than any company, or especially desktop owner, will be willing to pay... LETS DROP WINE. Wine is great for minimal integration between the two systems for undeveloped software in linux. It is meant to be used while Linux is still in the process of developing all this excellent software. Also ...have you even used WINE its the most unstable thing ever...its purpose was educational and not meant to be used as a cure all for the linux flaws. Oh and if you really still want to get WINE finished... , pardon my language, get off your lazy ass and finish it yourself.

Re: Linux-Based Voice Recognition

On October 25th, 2003 Anonymous says:

No, it definitely is not obvious.

First, WINE will never be finished. One, because "finished" would mean supporting every Windows program, and windows programs are constantly being developed. Two, because Windows is still being, em, "developed", with something close to a new version every two years or so (which is absurd), implementing new options, thingies, and lots of bugs. WINE has to accomodate all this, and then it will never be finished. Now, I will contradict myself: WINE will be finished the day Windows is. That is, the day Linux (or alike) rule. And this carries to the next point.

Second, for Linux to be the great platform it is now, and conquer further frontiers, native development is essential. This does not mean that programs existing in windows cannot be copycatted in free software versions, but that they have to be -- precisely -- free software and native in Linux.

Third, related to second, native software in Linux will have way much higher stability and performance than imperialist-based one. Now imagine taking all the fails and flukes of Windows software, and adding them an extra emulation (alright, "translation") layer. Unreliability rises exponentially.

Not that WINE is bad, don't get me wrong. It is just that, in my oppinion, it is only a temporal solution. If I can accept that in a perfect future Linux may not be the only OS (after all, full unification means stancation), it is _imperative_ that the GNU ideal of free software and portability triumphs. This way, all software developed will be avaliable for every platform, and to everybody.

It is only reasonable, isn't it?

Re: Linux-Based Voice Recognition

On February 14th, 2003 Anonymous says:

So what's stopping you?

Re: Linux-Based Voice Recognition

On February 9th, 2003 Anonymous says:

Before something can be applied effectively to the desktop, it may have applications elsewhere. Voice recognition can be used for security, telecom automation and assisting the disabled. These are all great applications for Linux, as it is a very strong server platform and is gaining ground in the embedded world.

"Greater China" which includes Taiwan, Hong Kong and the mainland, is where most systems are made now. The mainland has embraced Linux and will increase its usage, because it is not addicted to the same desktop software, like the rest of the world. For all the negative press China receives, there are as many positive things happening there as well.

Re: Linux-Based Voice Recognition

On February 13th, 2003 Anonymous says:

Right on

You have a very good grasp of the Chinese

potential, indeed.

Re: Linux-Based Voice Recognition

On February 8th, 2003 Anonymous says:

Oh Boy... Too busy to type, see, listen, hear, and soon after this voice technology has been introduced, to speak? What next? Put a chip in a your brain and just think? Well, its already in the works! check out what the Docs are doing at Univ. of Florida with a chip in his arm.

Re: Linux-Based Voice Recognition

On February 5th, 2003 Anonymous says:

Medicine can't be any worse with Dr handwriting if you can call it that.

I saw up close and personal a live demo of L-H voice recognition system at a PC expo. Maybe the system was tweaked for this person's voice but I did see him read items from people's conference badges. This was more than a year ago! Error rate on the screen was very low. Greater than 96% accurate at the worst! ANDwas done using the worst platform - windows!

Hardware has come order of magnitude in computing power since that time and growing weekly.

Only lagging thing is software. Programmers are the most creative people on the planet! That's no brag. You are them. JUST DO IT. I have multiple arthristis' and I lost my job as programmer because of it. I need the sw if not doctors. I have read transcriptions of office notes that my doctor dictated with me in the room(very quiet too) and the mistakes are laughable and horrible. I have amoung other things - Reiter's disease and came notes came out as writers disaese(misspelling included).

JACK

Re: Linux-Based Voice Recognition

On February 7th, 2003 Anonymous says:

There is an alternative, a java based speech recognition project sponsored by Carnegie Mellon University, Sun Microsystems Laboratories, Mitsubishi Electric Research Labs, and Hewlett-Packard's Cambridge Research Lab.

http://cmusphinx.sourceforge.net/cgi-bin/twiki/view/Sphinx4/WebHome

It might not be fully functional yet, but in one year... and since it runs on java it runs on linux/windows/anything with a java virtual machine.

Awesome!

Re: Linux-Based Voice Recognition

On February 10th, 2003 Anonymous says:

Dear Dr.

(if I may inSilicon Valley style)

We have looked at all the fine work

you have done listed at your website,

impressive, indeed.

We have a major NIH grant based project

to improve EMR and connect it to the

medical instrument telemed style.

If you would like to work with us then please

email directly to

LaGiannina@aol.com.

We have a system in test at Sequoia Hospital

in Silicon Valley. Please check

www.Sensitron.net for detailes.

Giannina

Re: Linux-Based Voice Recognition

On February 5th, 2003 Anonymous says:

What is China's "low-cost method of manufacturing"? Does it have to do with substandard wages, brutal repression of labor activism, dangerous working conditions and not protecting the environment?

Re: Linux-Based Voice Recognition

On February 10th, 2003 Anonymous says:

You are misinformed, China has one of the best developed PC industries in the world, just ask the scottish and US IBMers, DEller' HP/Compaqers' etc...who have had their workload transferred to China, and India....

In the next year these countries will have the best IT industries in the world, leaving the west traaaiiiling behind...

Who taught them , we did, who financed them , we did, who lost jobs to them , we did... is there a tre3nd here ?

Re: Linux-Based Voice Recognition - alternatives

On February 5th, 2003 Anonymous says:

Linux-based VR is years in the future. Currently, input could be speeded up (and wrist injuries decreased) if the Linux community supported:

(1) the Dvorak keyboards; and

(2) a much better auto spell-check and auto word-complete for Open Office.

Re: Linux-Based Voice Recognition & the handicapped

On February 5th, 2003 SusanCragin (not verified) says:

Voice recognition is critical for me and I use it constantly. I must write up to 1,000 words per day and have carpel tunnel syndrome. Lack of Linux VR prevents me from shifting my desktop system, and I work on Windows with DNS 6.0 and a Sennheiser microphone.

In all professions, systems administrators are being asked to accommodate those with disabilities. It is not only humane, it is required by law. Linux will never make its way in the desktop market until it has a viable voice recognition product.

(I have a quadapalegic friend who writes Linux code using Windows and Dragon Naturally Speaking.)

Voice recognition is not that great now. It requires time and patience, makes lots of mistakes, and is not good in a noisy environment. It requires the fastest processor available and at least 512 RAM. It also requires that you stare at the screen as you dictate, because errors are constant, and the program often stalls.

(The salesmen and VR weenies tell you different, but they are wrong.)

When anyone talks about busy professionals running around dictating, they are talking about the future and not the present.

Voice recognition is used a great deal by medical professionals now, mostly to update charts.

By the way, IBM has abandoned development of Linux ViaVoice. I contacted the sales desk recently, and they said they are unloading their few remaining copies. VV-Linux was always a poorly supported product next to VV-Windows.

Re: Linux-Based Voice Recognition & the handicapped

On February 10th, 2003 Anonymous says:

A NIH grant based project is taking over the VV for Linux

from IBM.

If interested to work with us then please make

direct contact with

LaGiannina@aol.com

Re: Linux-Based Voice Recognition

On February 4th, 2003 Anonymous says:

speech recognition on Linux isn't going to happen anytime soon. The defective ViaVoice for Linux has been withdrawn (thank goodness) and the open source small vocabulary recognizers are of "grad student research" quality and not suitable for real world use.

Speech recognition on Linux will not happen until the sound system is improved[1] and there are a sufficient number of desktop users to support a $10 or $15 million business.

until then, speech recognition on Linux is not even a dream, it's just a hallucination.

--- eric

(anonymous because I refuse to create yet another account anywhere)

[1] linux sound systems are now barely functional for sound output. There are still a lot of problems with clicks and pauses which are death to speech recognition accuracy. The sound system still needs improvement to handle high-quality input and simultaneous multiple sound devices such as a built-in sound card and a USB sound card (a very common configuration with speech recognition users).

Re: Linux-Based Voice Recognition

On February 5th, 2003 Anonymous says:

You are right again and that is why we are developing

a highly improved version with patanted ideas in it,

ViaVoice is just an experiment but a worthwhile one at that.

LaGiannina@aol.com

Re: Linux-Based Voice Recognition

On February 5th, 2003 Anonymous says:

where did you find these "the open source small vocabulary recognizers"

please post some links

i only want to recognize it a few simple commands like "bold","red","new mail"..

i think this should not be to complicated to recognize since every mobile phone can do this

Re: Linux-Based Voice Recognition

On February 5th, 2003 Anonymous says:

You are so very right: current voice rec is not good enough yet for professionals, but a number of major new features, which moves the precision of the recognition to a very high level, are being incorporated by some very good small companies.

If interested then let us communicate directly since

we can use all the constructive ideas we can get.

LaGiannina@aol.com

Re: Linux-Based Voice Recognition

On February 3rd, 2003 Anonymous says:

Where I come from, Histo Pathologists use voice recognition to dictate diagnosis. They currently use dragon dictate, I'd like to see a open source solution (are there any good ones?) The only advantage I see for voice recognition so far has been to make redundant only part of the audio typeists job

Chris.

Featured Videos

The X Window System is a magnificent platform for many uses, but using it to run an application over a slow network is nearly impossible. This is an introduction to NX, a technology that makes remote applications fly even over commodity internet.

Linux Journal Gadget Guy, Shawn Powers, reviews the Flip Video Ultra, a small portable video camera, and shows us how easy it is to edit the video with Kino.

Thanks to our sponsor: Silicon Mechanics

From the Magazine

September 2008, #173

Feeling a bit like a Thermian? Never give up, never surrender! Someday, you could go from underdog to top dog. Just take a look at a few of the underdogs we highlight in this issue: Mutt, djbdns, Nginix, Gentoo, Xara and the program voted mostly likely to fail just a few years back—Firefox. If Firefox not radical enough for you, check out Chef Marcel's column for some more alternatives. Having trouble mapping your program data to your relational database? If so, Rueven Lerner shows you some tricks in his At The Forge column.

Need to run GUI applications on your server in the next state? In his Paranoid Penguin column, Mick Bauer shows you how to do it securely. Kyle Rankin keeps hacking and slashing and shows you a few split screen secrets you may not be familiar with. Finally, we all know what happens next February, but only Doc knows what happens afterward.

Read this issue