In computer science, speech recognition (SR) is the translation of spoken words into text.<div><br/></div><div><div>Speech recognition applications include voice user interfaces such as voice dialling (e.g. "Call home"), call routing (e.g. "I would like to make a collect call"), <span class="GINGER_SOFTWARE_mark">domotic</span> appliance control, search (e.g. <span class="GINGER_SOFTWARE_mark">find</span> a podcast where particular words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents (e.g. <span class="GINGER_SOFTWARE_mark">a</span> radiology report), speech-to-text processing (e.g., word processors or emails), and aircraft (usually termed Direct Voice Input).</div></div><div><br/></div><div><a href="http://en.wikipedia.org/wiki/Speech_recognition">http://en.wikipedia.org/wiki/Speech_recognition</a><br/></div><div><br/></div><div>Requested implementation:</div><div><a href="http://cmusphinx.sourceforge.net/wiki/tutorialsphinx4">http://cmusphinx.sourceforge.net/wiki/tutorialsphinx4</a><br/></div><div><br/></div><div>Expected input: URL to audio file</div><div>Expected output: String </div>
This algorithm uses CMU Sphinx open source library to recognize speech in audio files that are uploaded to the Data API or Youtube videos that are licensed under Creative Commons.