Processing is an open source programming language and environment for people who want to create images, animations, and interactions.
Florian Schulz, Interaction Design Student at FH Potsdam, presented a year ago in the processing forum a speech to text (STT) library, based on the Google API. The source code is available at GitHub, a project page provides additional informations. The library is based on an article of Mike Pultz, named Accessing Google Speech API / Chrome 11, published in March 2011.
I installed the library in my processing environment (version 2.0b8) and run the test examples with success. I did some trials with the french and german Google speech recognition engines. I am impressed by the results.
Additional informations about this topic are provided in the following link list :
- Google Demo page for Chrome 25 and later
- Openmoko voice recognition
- GoogleChrome / webplatform-samples