2020. 10. 25. 10:36ㆍ카테고리 없음
The ML-compatible A52Codec.component will have a timestamp of Jul 27, 2012. If you have installed Perian, it'll put its own version of A52Codec.component with the timestamp Jul 23, 2011 (it's based on these timestamps that you can easily know which version is currently installed). Apr 10, 2007 -from engadget thanks guys for this awesome how to The two biggest Apple TV limitations are the lack of codec support (like XviD, DivX, etc.) and not even having the ability to do basic surround sound like Dolby Digital 5.1. These issues were resolved almost immediately after the Apple TV was released, although the hacks.
A52 Codec Editor's Review
A52 is a free codec that allows you to play media files that use the AC3 encoding.As a rule, you can find the AC3 audio encoding within the DVD movies or native multi-channel music that can use this type of encoding. This codec is also required by Perian plug-in, when it's decoding a movie that has in its container an AC3 audio stream.
The codec is implemented as code-audio component. This means you can use it both in QuickTime and Core Audio components.
The support for QuickTime is well implemented. It includes an importer in order to play the *.ac3 files directly from this player.

Pluses: It does a good job, when support for AC3 audio encoding is required.
Drawbacks / flaws:
In conclusion: this is a nice plug-in that helps you a lot if you use AC3 as an audio compressor for your music or if you have movies that uses this codec within their containers.
version reviewed: 1.7.2
TextSpeech Pro lets you read and convert text from most documents to speech and wav files in a unique way. TextSpeech Pro is a professional text to speech software that converts Outlook emails, web pages, and documents (incl. PDF) to speech or audio files in 3 modes (quick, standard and batch). TextSpeech Pro lets you read and convert text from most documents to speech and wav files in a unique way.
Speech recognition software is available for many computing platforms, operating systems, use models, and software licenses. Here is a listing of such, grouped in various useful ways.
- 5Windows
Acoustic models and speech corpus (compilation)[edit]
The following list presents notable speech recognition software engines with a brief synopsis of characteristics.
Application name | Description | Open-source | License | Operating system | Programming language | Supported language, note | Offline or online |
---|---|---|---|---|---|---|---|
CMU Sphinx | HMM | Yes | BSD style | Cross-platform | Java | English | Offline |
HTK | No | HTK specific | Cross-platform | C | English; version 3.5 released December 2015 | ||
Julius | HMM trigrams | Yes | BSD style, non-commercial | Cross-platform | C | Japanese, English; [2] | Offline |
Kaldi | Neural net | Yes | Apache | Cross-platform | C++ | English | |
RWTH ASR | RWTH Aachen University | No | RWTH ASR, non-commercial use only | Linux, macOS | C++ | English |
Macintosh[edit]
Application name | Description | Open-source | License | Price | Note |
---|---|---|---|---|---|
Dragon for Mac (discontinued 2018) | macOS; by Nuance | No | Proprietary | ||
Dragon Dictate (discontinued) | macOS; by Nuance | No | Proprietary | ||
MacSpeech Scribe (discontinued) | Transcription from recorded text; acquired by Nuance | ||||
iListen (discontinued) | PowerPC Macintosh; discontinued by MacSpeech; acquired by Nuance | ||||
Speakable items | Included with macOS | ||||
ViaVoice (discontinued) | IBM Product; acquired by Nuance | ||||
Voice Navigator | Original GUI voice control; 1989 |
Cross-platform web apps based on Chrome[edit]
Text To Speech Pro
The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.[1]
Application name | Description | Open-source | License | Price | Note |
---|---|---|---|---|---|
Speechmatics[2] | Cloud based and on-premise automatic speech recognition | No | Proprietary | From £0.06 per minute of audio |
Mobile devices and smartphones[edit]

Many mobile phone handsets, including feature phones and smartphones such as iPhones and BlackBerrys, have basic dial-by-voice features built in. Many third-party apps have implemented natural-language speech recognition support, including:
Application name | Description | Open-source | License | Price | Note |
---|---|---|---|---|---|
Assistant.ai | Assistant for Android, iOS and Windows Phone | No | Proprietary, freeware | Free | Discontinued |
Dragon Dictation | No | Proprietary, freeware | Free | ||
Google Now | Android voice search | No | Proprietary, freeware | Free | |
Google Voice Search | No | Proprietary, freeware | Free | ||
Microsoft Cortana | Microsoft voice search | No | Proprietary, freeware | Free | |
Siri Personal Assistant | Apple's virtual personal assistant | No | Proprietary, freeware | Free | |
Alexa – Amazon Echo | Amazon's personal assistant | No | Proprietary | ||
SILVIA | Android and iOS | No | |||
Vlingo |
Windows[edit]
Windows built-in speech recognition[edit]
The Windows Speech Recognition version 8.0 by Microsoft comes built into Windows Vista, Windows 7, Windows 8 and Windows 10.Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the corresponding version of Windows; meaning you cannot use the speech recognition engine in one language if you use a version of Windows in another language. Windows 7 Ultimate and Windows 8 Pro allow you to change the system language, and therefore change which speech engine is available. Windows Speech Recognition evolved into Cortana (software), a personal assistant included in Windows 10.
Add-ons for Windows 7 speech recognition[edit]
- Voice Finger – software for Windows Vista and Windows 7 that improves the Windows speech recognition system by adding several extensions to accelerate and improve the mouse and keyboard control.
Windows 7, 8, 10 third-party speech recognition[edit]
- Braina – Dictate into third party software and websites[3], fill web forms and execute vocal commands.[4]
- Dragon NaturallySpeaking from Nuance Communications – Successor to the older DragonDictate product. Focus on dictation. 64-bit Windows support since version 10.1.
- SpeechMagic – Nuance Communications acquired Philips owned. Medical industry focus according to Frost & Sullivan. Standalone or embedded.[5]
- Tazti – Create speech command profiles to play PC games and control applications – programs. Create speech commands to open files, folders, webpages, applications. Windows 7, Windows 8 and Windows 8.1 versions.[6]
Windows XP or 2000 only[edit]
- Microsoft Speech API – Speech recognition functionality included as part of Microsoft Office and on Tablet PCs running Microsoft Windows XP Tablet PC Edition. It can also be downloaded as part of the Speech SDK 5.1 for Windows applications, but since that is aimed at developers building speech applications, the pure SDK form lacks any user interface, and thus is unsuitable for end users.
Built-in software[edit]
- Microsoft Kinect includes built-in software which allows speech recognition of commands.
- Older generations of Nokia phones like Nokia N Series (before using Windows 7 mobile technology) used speech-recognition with family names from contact list and a few commands.
- Siri, originally implemented in the iPhone 4S, Apple's personal assistant for iOS, which uses technology from Nuance Communications.
- Cortana (software), Microsoft's personal assistant built into Windows Phone and Windows 10.
Interactive voice response[edit]
The following are interactive voice response (IVR) systems:
Text Speech Pro Platinum
- Genesys[7]
- HTK – copyrighted by Microsoft, but allows altering software for licensee's internal use
- LumenVox ASR
- Tellme Networks; acquired by Microsoft
Unix-like x86 and x86-64 speech transcription software[edit]
- Janus Recognition Toolkit (JRTk)[8][9]
Discontinued software[edit]
- IBM ViaVoice – Embedded version still maintained by IBM.[10] No longer supported for versions above Windows Vista.[11] Untested above macOS 10.4 or on Macintoshes with an Intel chipset.[12]
- Quack.com; acquired by AOL; the name has now been reused for an iPad search app.
- SpeechWorks from Nuance Communications.
- Yap Speech Cloud – Speech-to-text platform acquired by Amazon.com.
Textspeech Pro
See also[edit]
How To Install A52codec.component Without
References[edit]
- ^'Web Speech API Specification'. dvcs.w3.org. Archived from the original on 2016-06-21.Cite uses deprecated parameter
|dead-url=
(help) - ^Orlowski, Andrew. 'Total recog: British AI makes universal speech breakthrough'. The Register. Situation Publishing. Retrieved 17 May 2018.
- ^'Speech Recognition Software for Windows PC – Braina'. www.brainasoft.com. Archived from the original on 2015-04-07.Cite uses deprecated parameter
|dead-url=
(help) - ^'Dynamic Faceting-List of Most 57 Speech Recognition SWs and Web Services'. Archived from the original on February 13, 2019. Retrieved February 23, 2019.Cite uses deprecated parameter
|dead-url=
(help) - ^'Philips SpeechMagic named European Technology Leader by Frost & Sullivan'. www.frost.com. Archived from the original on 2008-04-15.Cite uses deprecated parameter
|dead-url=
(help) - ^O'Neill, Mark (2013-11-06). 'Control your PC with these 5 speech recognition programs'. PC World. Archived from the original on 2014-01-01. Retrieved 2013-12-30.Cite uses deprecated parameter
|dead-url=
(help) - ^'Interactive Voice Response'. Genesys. Archived from the original on 2016-10-14.Cite uses deprecated parameter
|dead-url=
(help) - ^[1][dead link]
- ^Lavie, A.; Waibel, A.; Levin, L.; Finke, M.; Gates, D.; Gavalda, M.; Zeppenfeld, T.; Zhan, Puming (1 April 1997). 'Janus-III: speech-to-speech translation in multiple languages'. 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE Xplore. 1. pp. 99–102. CiteSeerX10.1.1.36.6967. doi:10.1109/ICASSP.1997.599557. ISBN978-0-8186-7919-3.
- ^'Archived copy'. Archived from the original on 2010-08-08. Retrieved 2010-06-29.Cite uses deprecated parameter
|dead-url=
(help)CS1 maint: archived copy as title (link) - ^'Nuance product support for Microsoft Windows 7'. Nuance Communications, Customer Help. Retrieved 2019-03-16.
- ^'ViaVoice for Mac OS X on Intel Chipset'. Nuance Communications, Customer Help. Retrieved 2019-03-16.