|
http://www.readplease.com/ Software that lets your computer talk http://www.bytecool.com/ CoolSpeech is the text-to-speech program for Windows. This award-winning software allows you to listen to online news, email messages, clipboard text, keyboard typing and entire text documents spoken by sophisticated computer voices http://tldp.org/HOWTO/Speech-Recognition-HOWTO/software.html Speech Recognition Software http://www.nextup.com/ TextAloud uses voice synthesis to convert text into spoken audio. Listen on your PC or create MP3 or WMA files for use on portable devices like iPods, PocketPCs, and CD players. http://www.gusinc.com/ Communication solutions for autism, stroke, aphasia. http://www.browsealoud.com/page.asp?pg_id=70004 Browsealoud will read websites out to you and highlight words as they are read out. http://www.drspeech.com/ Dr. Speech software is a comprehensive speech/voice assessment and training software system that's easy-to-use, portable, and affordable. This software is intended for use with professionals in voice and speech fields. http://www.zero2000.com/ 2nd Speech Center Award-Winning Text-To-Speech Player to converts any text into spoken words or even MP3/WAVE audio files. http://www.sstil.com/ Speech and Software Technologies http://www.naturalreaders.com/ NaturalReader uses natural sounding voices and an easy to use interface to enable you to convert any written text into speech. http://www.microsoft.com/reader/developers/downloads/tts.asp Microsoft Reader for Tablet PC and Microsoft Reader for Windows-based PCs and laptops boast accessibility features that are bringing eBooks to more communities and providing a richer on-screen reading experience with additional TTS and Verbosity functionality. http://www.realizesoftware.com/ Realize Software develops interactive speech software to enhance productivity and make computing easier by adding voice to complement, reduce or eliminate use of the keyboard and mouse. http://linux-sound.org/speech.html Speech Synthesis & Analysis Software http://www.speech.cs.cmu.edu/hephaestus.html These pages provide a distribution mechanism for a number of Speech related software systems developed at, hosted at or substatially used within the CMU Speech Group. These pages are part of our continuing goal to provide state of the art, stable, free software components to allow anyone to build and use speech technology systems. http://www.speech.cs.cmu.edu/databases/micarray/ CMU Robust Speech Recognition Group: Microphone Array Database http://www.speech.cs.cmu.edu/flite/ Flite (festival-lite) is a small, fast run-time synthesis engine developed at CMU and primarily designed for small embedded machines and/or large servers. http://www.texthelp.com/page.asp Texthelp Systems Ltd is the worldwide leader of literacy software solutions provided through three core business divisions http://www.readplease.com/english/downloads/ ReadPlease Product Downloads, and free voices for ReadPlease. http://www.terakeet.net/ The future of speech recognition is now, and Terakeet Corporation is paving the way with integrated application development services that enable the use of voice automation and speech recognition technology. http://cmusphinx.sourceforge.net/html/cmusphinx.php The Sphinx Group at Carnegie Mellon University is committed to releasing the long-time, DARPA-funded Sphinx projects widely, in order to stimulate the creation of speech-using tools and applications, and to advance the state of the art both directly in speech recognition, as well as in related areas including dialog systems and speech synthesis. http://festvox.org/ This project is part of the work at Carnegie Mellon University's speech group aimed at advancing the state of Speech Synthesis. http://www.speech.cs.cmu.edu/tools/factory.html A fully configured speech decoder uses a set of different models, including acoustic, lexical and language. The tools on this page will allow you to easily construct lexical and language models consistent with the formats in use in the ARPA speech community (and by others). They are part of the Sphinx Knowledge Tools. http://www.speech.cs.cmu.edu/Communicator/ The CMU Communicator project explores advanced dialog management architectures for complex problem solving tasks. It is a project under the Carnegie Mellon Sphinx Group and is funded by the DARPA Communicator program. http://www.opendialog.org/ Ariadne Spoken Dialogue System http://www.festvox.org/cmu_arctic/ The CMU_ARCTIC databases were constructed at the Language Technologies Institute at Carnegie Mellon University as phonetically balanced, US English single speaker databases designed for unit selection speech synthesis research. http://emacspeak.sourceforge.net/ Emacspeak Inc (NASDOG: ESPK) announces immediate world-wide availability of Emacspeak 25.0 --a powerful audio desktop for leveraging today's evolving semantic WWW.Emacspeak is a speech interface that allows visually impaired users to interact independently and efficiently with the computer. http://emu.sourceforge.net/ The EMU Speech Database System EMU is a collection of software tools for the creation, manipulation and analysis of speech databases. At the core of EMU is a database search engine which allows the researcher to find various speech segments based on the sequential and hierarchical structure of the utterances in which they occur. EMU includes an interactive labeller which can display spectrograms and other speech waveforms, and which allows the creation of hierarchical, as well as sequential, labels for a speech utterance. http://www.festvox.org/ This project is part of the work at Carnegie Mellon University's speech group aimed at advancing the state of Speech Synthesis. http://cslu.cse.ogi.edu/tts/flinger/ Flinger is a program for synthesizing singing voice from a MIDI file input. http://ludios.org/programs/tkfestival TkFestival is a frontend to Festival, a speech synthesis program. TkFestival is written in tcl/tk and uses expectk to communicate with the festival binary. http://freetts.sourceforge.net/docs/index.php FreeTTS is a speech synthesis system written entirely in the JavaTM programming language. It is based upon Flite: a small run-time speech synthesis engine developed at Carnegie Mellon University. Flite is derived from the Festival Speech Synthesis System from the University of Edinburgh and the FestVox project from Carnegie Mellon University. http://imskpe.sourceforge.net/wiki/index.php/Main_Page IMSKPE - A formant synthesis GUI in GTK2 IMSKPE is a graphical user interface for the formant synthesis algorithm from Dennis Klatt. http://cmp.felk.cvut.cz/~kybic/dipl/ Kalman Filtering and Speech Enhancement The enhancement of noisy speech is a challenging research field with numerous applications. In the presented work we focus on the case of speech signal corrupted by slowly varying, non-white, additive noise, when only a corrupted signal is available. http://www.speech.cs.cmu.edu/comp.speech/Section5/Synth/klatt.kpe80.html KPE80 - A Klatt Synthesiser and Parameter Editor Platform: Unix Description: The KPE80 program provides a graphical interface for the implementation of the Klatt 1980 formant synthesiser http://accessibility.kde.org/developer/kttsd/index.php KTTS -- KDE Text-to-Speech -- is a subsystem within the KDE desktop for conversion of text to audible speech. KTTS is currently under development and aims to become the standard subsystem for all KDE applications to provide speech output. http://liarliar.sourceforge.net/ LiarLiar - An opensource computerized voice stress analysis (CVSA) tool http://tcts.fpms.ac.be/synthesis/mbrdico/ MBRDICO is a talking dictionnary using MBROLA as a back-end speech synthesizer. Text processing is performed using a complete GNU GPL package for automatic phonetization training (letter/phoneme alignement, decision tree building, stress assignment) and duration/intonation generation. At the moment American, Arabic, British English, Dutch, French and Spanish are available. http://freespeech.sourceforge.net/ The Open Mind Speech project is part of the Open Mind Initiative and aims to develop free (GPL) speech recognition tools and applications, as well as collect speech data from "e-citizens" using the Internet. http://www.fon.hum.uva.nl/praat/ Praat: doing phonetics by computer http://www.speech.cs.cmu.edu/comp.speech/SpeechLinks.html Following is the list of all the hyperlinks from the comp.speech FAQ. This is probably the biggest list of speech technology links available. http://www.speex.org/ Speex is an Open Source/Free Software patent-free audio compression format designed for speech. The Speex Project aims to lower the barrier of entry for voice applications by providing a free alternative to expensive proprietary speech codecs. Moreover, Speex is well-adapted to Internet applications and provides useful features that are not present in most other codecs. Finally, Speex is part of the GNU Project and is available under the Xiph.org variant of the BSD license. http://cmusphinx.sourceforge.net/html/cmusphinx.php The CMU Sphinx Group Open Source Speech Recognition Engines http://www.linux-magazin.de/Artikel/ausgabe/2000/05/Sprachsynthese/sprachsynthese.html Sprachsynthese unter Linux Tux lernt sprechen Ein durchschnittlich ausgestatteter Computer verfügt über eine Vielzahl von Schnittstellen, über welche die Mensch-Maschine-Kommunikation stattfinden kann. Wenig bekannt und genutzt wird die Möglichkeit, die Benutzer vom Rechner mit Hilfe der meist ohnehin vorhanden Soundkarte direkt ansprechen zu lassen. Denkbare Anwendungen für eine Sprachausgabe gibt es aber viele, angefangen bei Bedienungshilfen für Sehbehinderte bis zu automatischen Auskunftsdiensten. http://voxpak.sourceforge.net/ Voxpak is a Gui for playing, recording, editing, renaming etc. voice and fax messages. Includes scripts for popping up sticky-notes or requesters with caller id info. Renames voice/fax messages to date+callerid. Written in python and pyGTK. Includes a small Kaptain version for KDE. http://www.zachary.com/s/xvoice XVoice: Linux Text To Speech Recognition and Integration http://www.speech.cs.cmu.edu/comp.speech/ Welcome to the comp.speech Frequently Asked Questions WWW site. This site provides a range of information on speech technology, including speech synthesis, speech recognition, speech coding, and related material. http://sourceforge.net/projects/rsynth/ rsynth - Text-to-Speech (Formant Synth) To provide basic text-to-speech capability on as many platforms and for as many spoken languages as possible by formant synthesis from an International Phonetic Alphabet representation. http://www.freebsoft.org/speechd-el speechd-el is an Emacs client to speech synthesizers, Braille displays and other alternative output interfaces. It provides full speech and Braille output environment for Emacs. It is aimed primarily at visually impaired users who need non-visual communication with Emacs, but it can be used by anybody who needs sophisticated speech or other kind of alternative output from Emacs. speechd-el can make Emacs a completely speech and BrlTTY enabled application suitable for visually impaired users or, depending on its configuration, it can only speak in certain situations or when asked, to serve needs of any Emacs user. http://www.sp.m.is.nagoya-u.ac.jp/people/banno/spLibs/spwave/ spwave is a speech file editor supporting several sound formats including WAV, AIFF, MP3, raw, and more. The program is designed for research use, so stability and usability are regarded as important. spwave runs on multiple platforms including Linux, Windows, and Mac OS.
|