Speech Recognition and Understanding (11-751/11-551)

Course title: Speech Recognition and Understanding
Course number: 11-751/11-551
Department: Language Technologies Institute (LTI)
Units: 12
Semester: Fall 2000

Time: Monday and Wednesday from 11:30 am to 12:50 pm
First day of class: August 28
Place: Old Student Center 206
Final exam: Fri, December 15, 8:30am - 11:30am DH1217(new!!!)
Grading: Will be based on homeworks (30%), project (30%), and final(40%).

 Instructors: Alex Waibel
 E-mailahw@cs.cmu.edu
 Office Hours: By Appointment
 Teaching Assistant : Yue Pan
 Office: NSH 2602j
 Phone: 8-5477
 E-mail: ypan@cs.cmu.edu
 TA office hours: Friday 3-4pm
 

 Course home page: http://www.is.cs.cmu.edu/11-751


Index


Course Description

Prerequisites: Permission From Instructor (Undergraduates) No prior experience with speech recognition is necessary. This course is primarily for graduate students in LTI, CS, Robotics, ECE, HCI, Psychology, or Computational Linguistics. Others by prior permission of instructor.

Note that 11-751 has been approved for 1 CS coreunit for CSD students.

Course description:
The technology to allow humans to communicate by speech with machines or by which machines can understand when humans communicate with each other is rapidly maturing. This course provides an introduction to the theoretical tools as well as the experimental practice that has made the field what it is today. We will cover theoretical foundations, essential algorithms, major approaches, experimental strategies and current state-of-the-art systems and will introduce the participants to ongoing work in representation, algorithms and interface design. This course is suitable for graduate students with some background in computer science and electrical engineering, as well as for advanced undergraduates.

Text:

Method of evaluation:
Grading will be based on regular homework/lab assignments, a course project and a final exam.

Topics to be covered:



Lecture Schedule / Syllabus (tentative)

Date  Lecturer  Topic 
Mon Aug 28 Waibel Course Overview
Wed Aug 30 Waibel Introduction to the Speech Recognition Problem
Mon Sep 4 No Class Labor Day
Wed Sep 6 No Class ISL Speech Workshop
Mon Sep 11 Waibel Speech production, Signal Processing
Wed Sep 13 Rogina Digital Signal Processing of Speech - Intro
Mon Sep 18 Pan Signals Lab - NSH 2602
Wed Sep 20 Waibel Vector Quantization, Template Matching I
Mon Sep 25  Waibel Template-based recognition II
Wed Sep 27 Stern Digital Signal Processing - Advanced Concepts
Mon Oct 2 Waibel Hidden Markov Models I
Wed Oct 4 Waibel Hidden Markov Models II
Mon Oct 9 Pan Homework Review
Wed Oct 11 Woszczyna Hidden Markov Models III
Mon Oct 16 Rosenfeld Language Modeling I, (week of ICSLP)
Wed Oct 18 Rosenfeld Language Modeling II, (Week of ICSLP)
Mon Oct 23 No Class Mid-Semester Break
Wed Oct 25 Pan Review, Project Definitions Due
Mon Oct 30 Schultz Acoustic Modeling I
Wed Nov 1 Eskenazi Phonetics and prosody
Mon Nov 6 Schultz Acoustic Modeling II
Wed Nov 8 Schultz Speaker Adaptation
Mon Nov 13 Waibel Neural Networks I
Wed Nov 15 Waibel Neural Networks II
Mon Nov 20 Gavalda NLP
Wed Nov 22 No Class Thanksgiving Holiday
Mon Nov 27 Waibel Speech Translation
Wed Nov 29 Denecke Dialog-based interfaces
Mon Dec 6 Waibel Multimodal Interfaces
Sat Dec 16 Project presentations ISL Lab, NSH2602

 


Homeworks and Tests

(Available only from cmu.edu and uka.de domains. )


Term Project

(Available only from cmu.edu and uka.de domains. ) Project Ideas. (Drafts)


Slides

(Available only from cmu.edu and uka.de domains.)
Mantained by ypan@cs.cmu.edu. Last modified: Dec 4, 2000