Lecture 11 (October 25): Chatbots, NLU and ASR
Recording: here (not including last part on speech processing, covered during the group session with no recording)
Mandatory reading:
Jurafsky and Martin, Speech and Language Processing, 3. ed.:
- Ch. 24, "Dialogue systems and chatbots"
- Sections 24.2 & 24.3
- Ch. 25, "Phonetics"
- Sections 25.1-25.5 (excluding the details not discussed in class)
- Chap 26, "Speech Recognition and ASR"
- Sections 26.1 and 26.5 (excluding the part on statistical significance)
Optional reading:
- Henderson, M., Casanueva, I., Mrk?i?, N., Su, P. H., & Vuli?, I. (2019). ConveRT: Efficient and accurate conversational representations from transformers. arXiv preprint arXiv:1911.03688.
- Gao, J., Galley, M., & Li, L. (2018). Neural Approaches to Conversational AI. arXiv preprint arXiv:1809.08267.
- He, Y., Sainath, T. N., Prabhavalkar, R., McGraw, I., Alvarez, R., Zhao, D., ... & Liang, Q. (2019, May). Streaming end-to-end speech recognition for mobile devices. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6381-6385). IEEE.
Lab session
- Finish last part of the lecture on speech processing
- Go through questions on dialogue and speech processing
- Help with obligatory assignment.