Tony Robinson (speech recognition)
Tony Robinson is a pioneer in the application of recurrent neural networks to speech recognition,[1][2][3] being one of the first to discover the practical capabilities of deep neural networks and how they can be used to benefit speech recognition.[4] He first published on the topic while studying for his PhD at Cambridge University in the 1980s.[5] He has published over a hundred, widely cited research papers on automatic speech recognition (ASR) in the years since.[6]
In 1995, Robinson formed SoftSound Ltd, a speech technology company which was acquired by search pioneer Autonomy with a view to using the technology to make unstructured video and voice data easily searchable. Robinson helped build the fastest large vocabulary speech recognition system available at the time, and operating in more languages than any other model, by developing and focusing on recurrent neural networks.[7]
From 2008–2010, Robinson was the Director of the Advanced Speech Group at SpinVox, a provider of speech-to-text conversion services for carrier markets, including wireless, VoIP and cable. Their Automatic Speech Recognition (ASR) system was for a while being used more than one million times per day and SpinVox was subsequently acquired by global speech technology company Nuance.
Tony Robinson was also founder of Speechmatics which launched its cloud-based speech recognition services in 2012. Speechmatics subsequently announced a significant technological breakthrough in accelerated new language modeling late in 2017.[8] Robinson continues to publish papers at the rapidly developing edges of speech recognition technology, especially in the area of statistical language modelling.[9]
References
- Robinson, Tony; Fallside, Frank (July 1991). "A recurrent error propagation network speech recognition system". Computer Speech and Language. 5 (3): 259–274. doi:10.1016/0885-2308(91)90010-N.
- Robinson, Tony (1996). "The Use of Recurrent Neural Networks in Continuous Speech Recognition". Automatic Speech and Speaker Recognition. The Kluwer International Series in Engineering and Computer Science. 355. pp. 233–258. CiteSeerX 10.1.1.364.7237. doi:10.1007/978-1-4613-1367-0_10. ISBN 978-1-4612-8590-8.
- Wakefield, Jane (2008-03-14). "Speech recognition moves to text". BBC News. Retrieved 2020-08-24.
- Robinson, Tony (September 1993). "A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project". Third European Conference on Speech Communication and Technology. 1: 1941–1944. Retrieved 17 May 2018.
- Robinson, Anthony John (June 1989). "Dynamic Error Propagation Networks". PHD Thesis. Retrieved 17 May 2018.
- Robinson, Tony. "Tony Robinson - Profile". ResearchGate. Retrieved 17 May 2018.
- Robinson, Tony; Hochberg, Mike; Renals, Steve (1996). The Use of Recurrent Neural Networks in Continuous Speech Recognition. Automatic Speech and Speaker Recognition. The Kluwer International Series in Engineering and Computer Science. 355. pp. 233–258. CiteSeerX 10.1.1.364.7237. doi:10.1007/978-1-4613-1367-0_10. ISBN 978-1-4612-8590-8.
- Orlowski, Andrew. "Brit neural net pioneer just revolutionised speech recognition all over again". The Register. Situation Publishing. Retrieved 17 May 2018.
- Chelba, Ciprian; Mikolov, Tomas; Schuster, Mike (2013). One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling (Report). Cornell University Library. arXiv:1312.3005.