Reading materials: DNNs for acoustic modeling

Reading materials

Collection of reading materials by topic.

Kaldi tutorial: Kaldi is currently the most popular ASR toolkit in research. We recommend using it for this project. You don't have to complete the tutorial, but reading it can help you understand some basics.
Ravanelli, Mirco, Titouan Parcollet, and Yoshua Bengio. The pytorch-kaldi speech recognition toolkit. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019.
Yu, Dong, and Jinyu Li. Recent progresses in deep learning based acoustic models. IEEE/CAA Journal of automatica sinica 4.3 (2017): 396-409.
Hinton, Geoffrey, et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal processing magazine 29.6 (2012): 82-97.
Povey, Daniel, et al. Purely sequence-trained neural networks for ASR based on lattice-free MMI. Interspeech. 2016.

ELEC-E5510 - Speech Recognition D, Lecture, 3.11.2021-17.12.2021