Lecturer: Dr. Yossi Keshet
Teaching assistant: Shua Dissen
Rabiner and Schafer: Theory and Applications of Digital Speech Processing, Prentice Hall, 2010.
Huang, Acero, and Hon: Spoken Language Processing, Prentice Hall, 2001.
Rabiner and Juang: Fundamentals of Speech Recognition, Prentice Hall, 1993.
Deller, Hansen, and Proakis: Discrete-time Processing of Speech Signals, 2000.
Quatieri: Discrete-time Speech Signal Processing, Prentice Hall, 2001.
Lecture 1 - Introduction and signal processing. The matlab code explaining what_is_fft.m (a bonus will be given to anyone how traslate the code into Python)
Lecture 2 - Signal processing and features.
Lecture 3 - Dynamic Time Warping (DTW).
Some of the lecture notes are based on the lecture notes of the speech recognition course given in Columbia University (e6870).
Assignment 1 (corrected version) and it's WAV and transcription (TextGrid) files, additionally here you can find many spoken digits examples. -- Due: May 3, 2017