Time: Monday and Wednesday from 11:30 am to 12:50 pm
First day of class: August 28
Place: Old Student Center 206
Final exam:
Fri, December 15, 8:30am - 11:30am DH1217(new!!!)
Grading: Will be based on homeworks (30%), project (30%), and
final(40%).
Instructors: Alex
Waibel
E-mail: ahw@cs.cmu.edu
Office Hours: By Appointment
Teaching Assistant : Yue Pan
Office: NSH 2602j
Phone: 8-5477
E-mail: ypan@cs.cmu.edu
TA office hours: Friday 3-4pm
Course home page: http://www.is.cs.cmu.edu/11-751
Note that 11-751 has been approved for 1 CS coreunit for CSD students.
Course description:
The technology to allow humans to communicate by speech with machines
or by which machines can understand when humans communicate with each other
is rapidly maturing. This course provides an introduction to the theoretical
tools as well as the experimental practice that has made the field what
it is today. We will cover theoretical foundations, essential algorithms,
major approaches, experimental strategies and current state-of-the-art
systems and will introduce the participants to ongoing work in representation,
algorithms and interface design. This course is suitable for graduate students
with some background in computer science and electrical engineering, as
well as for advanced undergraduates.
Text:
Topics to be covered:
| Date | Lecturer | Topic | |
| Mon Aug 28 | Waibel | Course Overview | |
| Wed Aug 30 | Waibel | Introduction to the Speech Recognition Problem | |
| Mon Sep 4 | No Class | Labor Day | |
| Wed Sep 6 | No Class | ISL Speech Workshop | |
| Mon Sep 11 | Waibel | Speech production, Signal Processing | |
| Wed Sep 13 | Rogina | Digital Signal Processing of Speech - Intro | |
| Mon Sep 18 | Pan | Signals Lab - NSH 2602 | |
| Wed Sep 20 | Waibel | Vector Quantization, Template Matching I | |
| Mon Sep 25 | Waibel | Template-based recognition II | |
| Wed Sep 27 | Stern | Digital Signal Processing - Advanced Concepts | |
| Mon Oct 2 | Waibel | Hidden Markov Models I | |
| Wed Oct 4 | Waibel | Hidden Markov Models II | |
| Mon Oct 9 | Pan | Homework Review | |
| Wed Oct 11 | Woszczyna | Hidden Markov Models III | |
| Mon Oct 16 | Rosenfeld | Language Modeling I, (week of ICSLP) | |
| Wed Oct 18 | Rosenfeld | Language Modeling II, (Week of ICSLP) | |
| Mon Oct 23 | No Class | Mid-Semester Break | |
| Wed Oct 25 | Pan | Review, Project Definitions Due | |
| Mon Oct 30 | Schultz | Acoustic Modeling I | |
| Wed Nov 1 | Eskenazi | Phonetics and prosody | |
| Mon Nov 6 | Schultz | Acoustic Modeling II | |
| Wed Nov 8 | Schultz | Speaker Adaptation | |
| Mon Nov 13 | Waibel | Neural Networks I | |
| Wed Nov 15 | Waibel | Neural Networks II | |
| Mon Nov 20 | Gavalda | NLP | |
| Wed Nov 22 | No Class | Thanksgiving Holiday | |
| Mon Nov 27 | Waibel | Speech Translation | |
| Wed Nov 29 | Denecke | Dialog-based interfaces | |
| Mon Dec 6 | Waibel | Multimodal Interfaces | |
| Sat Dec 16 | Project presentations | ISL Lab, NSH2602 |