Friday, 7 July 2017

Speech Recognition : Everything You Need To Know!

Speech Recognition : Everything You Need To Know!

What is Speech Recognition?
Speech recognition (SR) is the interdisciplinary sub-field of computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also known as "automatic speech recognition"(ASR), "computer speech recognition", or just "speech to text" (STT). It incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields.
shutterstock_381624613.jpg
Some SR systems use "training" (also called "enrollment") where an individual speaker reads text or isolated vocabulary into the system. The system analyses the person's specific voice and uses it to fine-tune the recognition of that person's speech, resulting in increased accuracy. Systems that do not use training are called "speaker independent" systems. Systems that use training are called "speaker dependent".
Speech recognition applications include voice user interfaces such as voice dialing (e.g. "Call home"), call routing (e.g. "I would like to make a collect call"), domotic appliance control, search (e.g. find a podcast where particular words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents (e.g. a radiology report),speech-to-text processing (e.g., word processors or emails), and aircraft (usually termed Direct Voice Input).

The term voice recognition or speaker identification refers to identifying the speaker, rather than what they are saying. Recognising the speaker can simplify the task of translating speech in systems that have been trained on a specific person's voice or it can be used to authenticate or verify the identity of a speaker as part of a security process.
IMG_20170705_102432.jpg
Google Assistant
IMG_20170705_102637.jpg
Apple's Siri
From the technology perspective, speech recognition has a long history with several waves of major innovations. Most recently, the field has benefited from advances in deep learning and big data. The advances are evidenced not only by the surge of academic papers published in the field, but more importantly by the worldwide industry adoption of a variety of deep learning methods in designing and deploying speech recognition systems. These speech industry players include Google, Microsoft, IBM, Baidu, Apple, Amazon, Nuance, SoundHound, IflyTek, CDAC many of which have publicized the core technology in their speech recognition systems as being based on deep learning. Google, Apple and Microsoft has bought this Technology in there Google Assistant, Siri, and Cortana Systems.

Do You Know?
Quote: In 2000, Lernout & Hauspie acquired Dragon Systems and was an industry leader until an accounting scandal brought an end to the company in 2001. The L&H speech technology was bought by ScanSoft which became Nuance in 2005.  Apple originally licensed software from Nuance to provide speech recognition capability to its digital assistant Siri.
Thanks for Reading!!! 

No comments:

Post a Comment