ELEC-E5510 - Speech Recognition D, Lecture, 25.10.2023-8.12.2023
Kurssiasetusten perusteella kurssi on päättynyt 08.12.2023 Etsi kursseja: ELEC-E5510
Reading materials
Suorituksen vaatimukset
Collection of reading materials by topic.
2. Material on project topics
2.15. DNNs for acoustic modeling
- Kaldi tutorial: Kaldi is currently the most popular ASR toolkit in research. We recommend using it for this project. You don't have to complete the tutorial, but reading it can help you understand some basics.
- Ravanelli, Mirco, Titouan Parcollet, and Yoshua Bengio. The pytorch-kaldi speech recognition toolkit. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019.
- Yu, Dong, and Jinyu Li. Recent progresses in deep learning based acoustic models. IEEE/CAA Journal of automatica sinica 4.3 (2017): 396-409.
- Hinton, Geoffrey, et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal processing magazine 29.6 (2012): 82-97.
- Povey, Daniel, et al. Purely sequence-trained neural networks for ASR based on lattice-free MMI. Interspeech. 2016.