graves et al., Speech recognition with deep recurrent neural networks

From statwiki
Revision as of 15:49, 5 October 2015 by Mbhynes (talk | contribs)
Jump to navigation Jump to search

Introduction

Hello world! This is section [sec:intro].

Testing pandoc latex to wiki math conversions: [math]\displaystyle{ x = \sum_{i=1}^{n}\int_\alpha f(\alpha)^i\,d\alpha }[/math]

Now let’s try an enumerated list:

  1. [math]\displaystyle{ \alpha }[/math]
  2. [math]\displaystyle{ \beta }[/math]
  3. [math]\displaystyle{ \gamma }[/math]

References

10

H.A. Bourlard and N. Morgan, , Kluwer Academic Publishers, 1994.

Qifeng Zhu, Barry Chen, Nelson Morgan, and Andreas Stolcke, “Tandem connectionist feature extraction for conversational speech recognition,” in International Conference on Machine Learning for Multimodal Interaction, Berlin, Heidelberg, 2005, MLMI’04, pp. 223–231, Springer-Verlag.

A. Mohamed, G.E. Dahl, and G. Hinton, “Acoustic modeling using deep belief networks,” , vol. 20, no. 1, pp. 14 –22, jan. 2012.

G. Hinton, Li Deng, Dong Yu, G.E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T.N. Sainath, and B. Kingsbury, “Deep neural networks for acoustic modeling in speech recognition,” , vol. 29, no. 6, pp. 82 –97, nov. 2012.

A. J. Robinson, “An Application of Recurrent Nets to Phone Probability Estimation,” , vol. 5, no. 2, pp. 298–305, 1994.

Oriol Vinyals, Suman Ravuri, and Daniel Povey, “Revisiting Recurrent Neural Networks for Robust ASR,” in ICASSP, 2012.

A. Maas, Q. Le, T. O’Neil, O. Vinyals, P. Nguyen, and A. Ng, “Recurrent neural networks for noise reduction in robust asr,” in INTERSPEECH, 2012.

A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, “Connectionist Temporal Classification: Labelling Unsegmented Sequence Data with Recurrent Neural Networks,” in ICML, Pittsburgh, USA, 2006.

A. Graves, , vol. 385, Springer, 2012.

A. Graves, “Sequence transduction with recurrent neural networks,” in ICML Representation Learning Worksop, 2012.

S. Hochreiter and J. Schmidhuber, “Long Short-Term Memory,” , vol. 9, no. 8, pp. 1735–1780, 1997.

A. Graves, S. Fernández, M. Liwicki, H. Bunke, and J. Schmidhuber, “Unconstrained Online Handwriting Recognition with Recurrent Neural Networks,” in NIPS. 2008.

Alex Graves and Juergen Schmidhuber, “Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks,” in NIPS. 2009.

F. Gers, N. Schraudolph, and J. Schmidhuber, “Learning Precise Timing with LSTM Recurrent Networks,” , vol. 3, pp. 115–143, 2002.

M. Schuster and K. K. Paliwal, “Bidirectional Recurrent Neural Networks,” , vol. 45, pp. 2673–2681, 1997.

A. Graves and J. Schmidhuber, “Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures,” , vol. 18, no. 5-6, pp. 602–610, June/July 2005.

David E. Rumelhart, Geoffrey E. Hinton, and Ronald J. Williams, , pp. 696–699, MIT Press, 1988.

Geoffrey Zweig and Patrick Nguyen, “SCARF: A segmental CRF speech recognition system,” Tech. Rep., Microsoft Research, 2009.

Andrew W. Senior and Anthony J. Robinson, “Forward-backward retraining of recurrent neural networks,” in NIPS, 1995, pp. 743–749.

Abdel rahman Mohamed, Dong Yu, and Li Deng, “Investigation of full-sequence training of deep belief networks for speech recognition,” in in Interspeech, 2010.

M. Lehr and I. Shafran, “Discriminatively estimated joint acoustic, duration, and language model for speech recognition,” in ICASSP, 2010, pp. 5542 –5545.

Kam-Chuen Jim, C.L. Giles, and B.G. Horne, “An analysis of noise in recurrent neural networks: convergence and generalization,” , vol. 7, no. 6, pp. 1424 –1438, nov 1996.

Geoffrey E. Hinton and Drew van Camp, “Keeping the neural networks simple by minimizing the description length of the weights,” in COLT, 1993, pp. 5–13.

Alex Graves, “Practical variational inference for neural networks,” in NIPS, pp. 2348–2356. 2011.

DARPA-ISTO, , speech disc cd1-1.1 edition, 1990.

Kai fu Lee and Hsiao wuen Hon, “Speaker-independent phone recognition using hidden markov models,” , 1989.

O. Abdel-Hamid, A. Mohamed, Hui Jiang, and G. Penn, “Applying convolutional neural networks concepts to hybrid nn-hmm model for speech recognition,” in ICASSP, march 2012, pp. 4277 –4280.