|
r
|
What is Text-to-Speech?
TTS is the ability of the computer to convert text - the words as they are displayed on screen - into speech which may be played back immediately, or saved as audio files. The files should be in an audio format compatible with the portable personal audio or CD player you will be using. .WAV and .MP3 are the most common, and a competent TTS program will output to either.
The computer's operating system uses one or more TTS engines to recognise the text in a process similar to how optical character recognition (OCR) software scans printed documents and turns them into digital computer files. Indeed, you can use OCR and TTS in combination to turn printed documents into speech, an ideal way of creating talking ebooks.
The TTS engine links to a synthesized voice to do the actual speaking. Some of these voice programmes sound very artificial and are basic in their capabilities. Others are much more sophisticated and incorporate actual recorded human voices in different accents, languages and with specialised vocabularies.
Not all voices and speech engines are compatible with each other, but the correct engines are usually supplied with the voices, which can be obtained on CDs or as Internet downloads.
You will get the best results from specialist TTS software from various third party suppliers, which are described at length in our Digital Audio guides. These are available in different accents, languages, and with special vocabularies featuring jargon and technical terminology. (Autocluster.com is developing facilities to cater for automotive technical terms also.)
We recommend TextAloud as offering the best value in commercial TTS software, particularly when used in combination with various specialist natural voices. There are other programs which you can try - some of them free - but TextAloud we have identified as the best all-round performer. It costs under $30 and you can set up a really advanced system for well under $100with a variety great new voices to meet your individual preferences.
How to test your system for installed text-to-speech software
You may be able to create speech files immediately using software already on your computer together with the unique autocluster and digitaldrivetime inter-linked networks of online automotive and motoring intelligence resources.
If you have Windows XP:
From Control Panel click on the Speech icon, then the Text-to-Speech tab to see what TTS engines and voices have been installed on your system.
You can make similar checks if your computer uses older versions of Windows, or has a Linux, Macintosh or other operating system.
In addition to the speech facilities, which come with your operating system, text-to-speech software may also be incorporated in word processing and other software installed on your computer.
To augment these, there are additional speech engines and a large selection of voices that can be downloaded free, or for very low cost.
What do the different voices sound like?
In Windows XP, you open Speech in Control Panel and then the Text-to-Speech tab.
The voices available already on your system are in the drop-down list. Select one and listen to it by clicking on Preview Voice
You can type in different text to see how it will sound in the voices available on your system and, in some cases, change the speed or pitch of the voice. It is worth experimenting.
How do the autocluster.com and digitaldrivetime.com talk radio services work?
These sites in the autocluster network use autobots to monitor hundreds of prime Web intelligence sources round-the-clock. Every time that you visit your favourite pages, the content will be updated immediately to include the latest information available on automotive and other topics of specific interest to you.
It would be impossible to have a conventional radio service with human announcers and news readers narrating this dynamic content in real time. Anyway, the sound files created would be many and massive - too much to download.
Streaming audio and other conventional Internet radio services also generate big files that require fast broadband connections. And no conventional audio service can be fully customised to your own personal information and entertainment needs, or respond as quickly as text to breaking news.
So conventional digital audio - either broadcast or on the Web - is not a practical way to avoid being tied to your computer or getting an up-to-date audio information service specific to your individual needs.
The alternative is to choose from the vast amount of written information and texts available information that you download or save as written words in plain text. In digital form text can be grabbed from the Internet very quickly and easily, even over slow connections. Then, using your own computer's processing power, you convert the texts to speech to take with you wherever you go.
Unlike most Web sites, the autocluster Web pages are specifically formatted so that the graphics do not get in the way of copying and saving the texts. There are other features to ease copying the text that you want to hear to the Windows Clipboard or directly into text-to-speech software. You can automate easily how your own computer very quickly converts the information that you select into recorded speech.
Within seconds you have MP3 or Wav audio speech files that can be transferred to CDs or to any portable audio device.
Why should I experiment with different voices?
The voices supplied with your operating system and with most of the TTS-enabled word processors, browsers and other applications usually represent older synthesized speech technology. They sound artificial and you may not enjoy listening to them for extended periods.
They may also have limited flexibility to vary the speed and pitch of the voice to your personal preferences.
The best and latest synthesized voices are much more sophisticated than the basic free voices. You can have naturally sounding male and female voices with American and UK English accents, or select from a wide range of other accents and languages. You can tune these voices and customise them - adding the correct pronunciation of particular words, names or technical phrases and jargon.
There are good commercial voices available at very reasonable cost - often at under $30. A wide selection of voices can be downloaded free of charge. Links to these are on the www.digitaldrivetime.com Web site.
How can I hear the TTS audio files?
Just click on them and they will play back through your computer system's speakers.
Or link your computer to your stereo system to get better quality.
Or transfer the files to CD disks or to the cards or internal memory storage of a portable personal audio player.
Your computer will play back all the digital audio file formats you are likely to encounter. But you need to save directly in the TTS software or convert using one of many free audio converter programs into the file format that your player uses. .
Car and home CD players usually require the standard .WAV audio files. Some will also play the much smaller .MP3 audio files, which, even in their most compressed form, offer sufficient quality for speech. Portable audio players with storage built in or on memory cards that slot into them usually either play .MP3 files or a special proprietary forma. You can convert almost any digital audio into compatible files using the software supplied with your hardware, or one of many audio format converters available free.
These portable devices and CDs can store many hours of speech generated by TTS software - far more than their capacity for large higher quality music files.
Follow the instructions, which accompany your speakers, CD burning hardware and software, or your portable audio player.
Speakers simply plug into the speaker jack on the computer's internal sound card. Use the line-out connection if your speakers have their own power supply and to export the audio to tape cassette systems which have a line-in socket.
The speaker-out connection on the computer's sound card is for non-powered speakers and has a signal that is boosted by the computer. This may be too powerful for the input for powered speakers or external recorders, particularly if the recorder has only a microphone input. So there is a risk of damage if you overload the external powered speakers or recording device.
Adjusting the text-to-speech volume output levels will vary, depending on your system and the hardware you are using. Follow the applicable instructions.
In Windows you can either access mixer controls through the loudspeaker icon displayed in the bottom right of your screen, or open Speech in Control Panel. On the Text-to-Speech tab, click Audio Output, and then Volume. Adjust the levels on the volume control mixer.
Using different voices, languages and engines
Text-to-speech voices and engines are closely matched and the combination may not be obvious from the on-screen displays, particularly if you are using a general purpose lexicon or one that is jargon specific, eg to the legal, medical or engineering professions.
A computer may have multiple engines loaded at the same time, but only one of each type can be active at a given time. You may need to consult the documentation with your hardware and software and experiment to find the best engine for your needs.
To change a TTS language in Windows XP you open Speech in Control Panel.
From the Text-to-Speech tab, you can change the TTS language and engine.
Quite dramatic improvements to speech quality can be made by changing the text-to-speech voice rate. From the Text-to-Speech tab, you access the Voice speed slider, which by default is set to normal.
Click Preview Voice to hear the results, with the words being highlighted as they are spoken.
Fixing text-to-speech problems
If you cannot hear any speech, the reason might be as simple as the speakers not being connected, or the volume being muted or not turned up in either the software controls or the speakers own physical external controls.
Check also if the speakers are selected as the current output device. On the Text-to-Speech tab, click Audio Output to make sure that the speakers are selected.
The sound card may have worked loose and not be properly seated or not installed with the correct drivers.
Your TTS software may not be set up properly. Or it may be recording silently to an audio file and not giving audible playback as it does so. The TTS engine or voice files may be corrupted and need to be reinstalled. You can test by using another engine and voice.
Microsoft's Troubleshooting Speech Systems advice in Wndows XP to determine if TTS is working properly is as follows:
Open Speech in Control Panel.
On the Text-to-Speech tab, click Preview Voice.
The text in Preview Voice should be spoken audibly with each word highlighted in turn. If so, TTS and the speakers are working.
If you cannot hear the Preview Voice and see words highlighted as they are spoken, see Possible Text-to-Speech Problems for more troubleshooting solutions.
You can use either the Speech Recognition tab or the Text-to-Speech tab in Speech properties to initialize and customize speech-enabled applications. These settings control general attributes such as input and output devices, the language used, as well as the playback voice and the accuracy of word recognition.
Individual applications may implement speech differently and you should refer to user's manuals. Those attributes shared by speech engines or systems across different software applications are set in Speech properties.
You can choose between several preferential option settings and customize the speech profile according to your needs. However, many options are proprietary and may vary between computer systems. Therefore, some of the buttons and screens may not act the same under all circumstances. Check instructions for your computer, speech software or device hardware.
|
You can probably convert text into speech right now if you have Windows XP!
From Control Panel click on the Speech icon, then the Text-to-Speech tab to see what TTS engines and voices have been installed on your system.
You will get the best results from specialist TTS software from various third party suppliers. These are available in different accents, languages, and special vocabularies, e.g. for medical or legal terminology. We are developing ways to cater for automotive technical terms also.)
More details about adding and using valuable speech extensions to how computers are used can be found at the latest addition to the autocluster.com network - www.digitaldrivetime.com
Create your own talk radio service on any topic
At www.digitaldrivetime.com you can access continually updated information about sports, religions, entertainment, business, trade, air, sea and ground transport, and a wealth of other topics. There are special sections at www.digitaldrivetime.com and other www.autocluster.com sites covering countries, regions, automotive brands, etc.
All this information can quickly and easily be converted to speech, providing you with knowledge and entertainment wherever you are - driving, flying (or waiting at airports or in traffic jams!), exercising, gardening, fishing, sitting on the beach, etc., etc.
Listen to your email messages through your car audio system
The autocluster.com Web sites enable you to take advantage of the Internet anywhere you may be, and even while you are on the move. Using text-to-speech technology (which may already be installed on your computer) you add another dimension to time that might otherwise be wasted.
When away from your computer you can also listen to your emails, or to Powerpoint and other forms of business presentations that enable you to acquire knowledge about the motor business and achieve valuable competitive advantages.
The autocluster.com Web sites are hosted on fast and secure servers in Africa and Germany, so they provide rapid access through both local and global Internet portals wherever you may be and whether you use a fast broadband or a slow dial-up (even a cell phone) connection.
Build an audio ebook library at little or no cost
The autocluster services include downloadable “Talking eBooks” that, like the Web pages, are formatted for easy listening. Or create your own audio books from any existing computerized texts - even directly from .pdf files or Word documents.
Talking ebooks complement the combination of archival content and fast-breaking news available through the autocluster.com services.
In a few minutes you can assemble enough information and entertainment as speech that will turn even the longest journey into a more pleasant and far more useful experience.
How autocluster's talk radio services work
The autocluster sites use autobots to monitor prime Web intelligence sources round-the-clock, so that every time you visit your favourite pages, the content has changed to reflect the latest information available on automotive and other topics of specific interest to you.
It would be impossible to have a conventional radio service with human announcers and news readers narrating this dynamic content in real time. Anyway, the sound files created would be massive - too big to download. Streaming audio and other conventional Internet radio services also generate big files that require fast broadband connections. And no conventional audio service can be customised to your own personal information and entertainment needs.
So the only practical way to get round spending hours staring at the computer screen to access all the knowledge that you want is to collect it as plain text. Then, using your own computer's processing power, you convert the texts to speech to take with you wherever you go.
We make this text-to-speech conversion easy at www.digitaldrivetime.com , www.autocluster.com , www.autocluster.co.za , and the other autocluster Web sites. Our Web page formats separate the graphics from the text and have other features to ease copying the text that you want to hear to the Windows Clipboard or directly into text-to-speech software. You can automate easily how your own computer very quickly converts this information that you chose into recorded speech.
Within seconds you have MP3 or Wav audio speech files that can be transferred to CDs or to any portable audio device.
Easy and pleasant listening
The listening experience using synthesized voices has been transformed in recent years. You can chose naturally sounding male and female voices with American and UK English accents, or select from a wide range of other accents and languages.
Text-to-speech software may already be incorporated in word processing and other software installed on your computer. Other programs and a large selection of voices can be downloaded free, or for very low cost.
Even the most sophisticated text-to-speech software and special voices costs very little. You can set up a really advanced system that meets your individual needs for well under $100, even when using the latest version of such impressive programs as TextAloud.
|