Digital Speech and Audio Coding
EE-719 / 4 credits
Teacher(s): Magimai Doss Mathew, Motlicek Petr
Language: English
Remark: Next time: Fall 2024
Frequency
Every 2 years
Summary
The goal of this course is to introduce the engineering students state-of-the-art speech and audio coding techniques with an emphasis on the integration of knowledge about sound production and auditory perception through signal processing techniques.
Content
1. Introduction
Human speech production, Music production, Auditory perception, Brief overview on information theory and coding theory.
2. Applied Signal Processing
Brief overview on sampling and quantization, Discrete Fourier transform, Perfect reconstruction, Quadrature mirror filter, Modified discrete cosine transform, Stereo processing, Linear prediction (LP), Auditory filters, Auditory masking, Perceptual auditory models (Johnston's model, MPEG models), Spectral band replication, Temporal noise shaping.
3. Speech Coding
Scalar and Vector quantization, Lossless coding, Waveform and parametric coding, Vocoders, LP coders, Analysis by Synthesis and Code excited LP codec, Adaptive multi-rate (AMR).
4. Audio Coding and Emerging Trends
Perceptual audio coders, MPEG-1, MPEG-2, MPEG-4, Dolby AC, Sony, AMR-WB, Generic coding.
5. Evaluation and Standardization of Audio and Speech coders
Objective evaluation techniques (PESQ, PEAQ), Subjective evaluation techniques (MOS, MUSHRA, BS.1116), Standardization (ITU).
6. Laboratory Exercises
Auditory perception models, Auditory filters, Estimation of masking threshold, Simple perceptual waveform coder, Objective quality evaluation techniques.
Keywords
Speech coding, Audio coding, Speech and music production, Auditory perception.
Learning Prerequisites
Recommended courses
Undergraduate level signal processing, programming in Matlab or similar.
Learning Outcomes
By the end of the course, the student must be able to:
- Formulate
- Analyze
- Contextualise
- Contrast
- Exploit
- Explain
Teaching methods
Lectures, Hands-on lab exercises, Open discussions
Expected student activities
Attend lectures, read literature, carry out lab exercises, question-answering, make presentations on selected topics
Assessment methods
Multiple.
In the programs
- Exam form: Multiple (session free)
- Subject examined: Digital Speech and Audio Coding
- Courses: 30 Hour(s)
- Exercises: 14 Hour(s)
- TP: 12 Hour(s)
- Type: optional