What is speech-to-text voice recognition?
Speech-to-text voice recognition is the process that converts spoken letters, words, sentences and numbers into text that appear on the computer screen. In simple terms, you talk to the computer and your words appear in a text format.
Voice-recognition software can help people with a variety of disabilities, particularly people who experience physical disabilities and find typing or using a mouse painful, difficult or impossible. Text-to-speech software can also help learners who struggle with written work or have a dyslexia-related difficulty, for example with spelling.
Getting started
In order to get the best from text-to-speech or voice-recognition programs, you will need a computer with a sound card, a good-quality microphone and an appropriate voice-recognition software program.
If you have Office XP running on your computer, there is a ready-to-use speech-recognition feature already installed. However, unlike some of the dedicated text to speech programs, it lacks a number of features. For example, using it hands-free is particularly difficult; so too is the time it takes to enrol a voice and achieve a high level of accuracy.
As everyone speaks differently, the first step in using voice-recognition software is to train or enrol your voice so that the software and computer can make adjustments to recognise and learn your voice style. This process normally takes up to 10 minutes although this will vary between different text to speech applications. However, the more you train your voice the more accurate it becomes. This process is often known as creating a voice model. With the increase of digital voice recorders it is also possible to record your voice, download the recording into a voice recognition program and convert the recording into text.
Benefits
The benefits of using voice recognition can be quite impressive, particularly for people who have no other means of using a computer other than their voice. Once the computer has learned your voice model, it is possible to use voice recognition for word processing, spreadsheets, databases, writing emails, navigating the internet and, if used correctly, to have independent control over all aspects of a computer.
-
Voice Power provides training and advice on voice recognition software. The 'Educational' and 'Special Needs' links provide helpful background information on how speech-to-text can help meet the needs of a wide range of computer users.
A particularly impressive section of the site is the 'Demo' page which provides some extremely interesting and engaging presentations (in Flash format) of voice recognition being used with MS Word and Excel.
-
If you are looking for a more comprehensive understanding of voice recognition software the Ace Centre provides a detailed guide to voice recognition programs.
The site covers a range of voice recognition related topics including a comparison on different programs, advice on the best microphones, developing a voice model, using voice recognition in exams and a number of case studies.
The section 'Top tips for effective use of Dragon NaturallySpeaking' is particularly helpful for getting you up to speed and get to grips with voice recognition. The tutorial can also be downloaded in a Word document format.
-
The Call Centre offer a number of valuable tutorials for voice recognition programs, including Dragon NaturallySpeaking and ViaVoice, probably the best known text to speech products.
It is possible to download the ‘Introducing NaturallySpeaking Book’ from the Call website as one large file or in separate ‘sessions’. The Introducing NaturallySpeaking Book has been specifically designed for both younger and older learners who need to use voice recognition as an alternative to the keyboard and mouse.
‘Session 9’, Dictate School Work and ‘Session 10’ Plan using Speech Recognition for School Work are particularly useful and helpful tutorials.
-
Becta is the Government's key partner in the strategic development and delivery of its information and communications technology (ICT) and e-learning strategy for the schools and the learning and skills sectors.
This link provides a report on some of the issues that face schools, teachers and pupils on implementing voice recognition in the class room. Issues such as equipment specification, staff training and support are covered here and are well worth reading.
-
ScanSoft is the leading supplier of speech recognition software including Dragon Products, IMB and SpeechWorks. The site provides a 'one-stop guide' to a range of voice to text software including recent updates and new developments in voice recognition software.