Monday, November 2, 2015

DS7301 SPEECH AND AUDIO SIGNAL PROCESSING

DS7301 SPEECH AND AUDIO SIGNAL PROCESSING

UNIT I MECHANICS OF SPEECH AND AUDIO

Introduction - Review Of Signal Processing Theory-Speech production mechanism – Nature of Speech signal – Discrete time modelling of Speech production – Classification of Speech sounds – Phones – Phonemes – Phonetic and Phonemic alphabets – Articulatory features. Absolute Threshold of Hearing - Critical Bands- Simultaneous Masking, Masking-Asymmetry, and the Spread of MaskingNonsimultaneous Masking - Perceptual Entropy - Basic measuring philosophy -Subjective versus objective perceptual testing - The perceptual audio quality measure (PAQM) - Cognitive effects in judging audio quality. 

UNIT II TIME-FREQUENCY ANALYSIS: FILTER BANKS AND TRANSFORMS

Introduction -Analysis-Synthesis Framework for M-band Filter Banks- Filter Banks for Audio Coding: Design Considerations - Quadrature Mirror and Conjugate Quadrature Filters- Tree-Structured QMF and CQF M-band Banks - Cosine Modulated “Pseudo QMF” M-band Banks - Cosine Modulated Perfect Reconstruction (PR) M-band Banksand the Modified Discrete Cosine Transform (MDCT) Discrete Fourier and Discrete Cosine Transform - Pre-echo Distortion- Pre-echo Control Strategies.

UNIT III AUDIO CODING  AND TRANSFORM CODERS

LosslessAudioCoding-LossyAudioCoding- ISO-MPEG-1A,2A,2A Advaned , 4A udioCoding - Optimum Coding in the Frequency Domain - Perceptual Transform Coder -Brandenburg-Johnston Hybrid Coder - CNET Coders - Adaptive Spectral Entropy Coding -Differential Perceptual Audio Coder - DFT Noise Substitution -DCT with Vector Quantization -MDCT with Vector Quantization. 

UNIT IV TIME AND FREQUENCY DOMAIN METHODS FOR SPEECH PROCESSING

Time domain parameters of Speech signal – Methods for extracting the parameters :Energy, Average Magnitude – Zero crossing Rate – Silence Discrimination using ZCRand energy Short Time Fourier analysis – Formant extraction – Pitch Extraction using time and frequency domain methodsHOMOMORPHIC SPEECH ANALYSIS: Cepstral analysis of Speech – Formant and Pitch Estimation – Homomorphic Vocoders. 

UNIT V LINEAR PREDICTIVE ANALYSIS OF SPEECH

Formulation of Linear Prediction problem in Time Domain – Basic Principle – Auto correlation method – Covariance method – Solution of LPC equations – Cholesky method – Durbin’s Recursive algorithm – lattice formation and solutions – Comparison of different methods – Application of LPC parameters – Pitch detection using LPC parameters – Formant analysis – VELP – CELP. 

REFERENCES: 

1. Digital Audio Signal Processing, Second Edition, Udo Zölzer, A John Wiley& sons Ltd Publicatioons 
2. Applications of Digital Signal Processing to Audio And Acoustics       Mark Kahrs, Karlheinz Brandenburg, KLUWER ACADEMIC PUBLISHERS NEW YORK, BOSTON, DORDRECHT, L ONDON , MOSCOW 
3. Digital Processing of Speech signals – L.R.Rabiner and R.W.Schaffer - Prentice Hall –1978







DIGITAL AUDIO SIGNAL PROCESSING, 2ND EDITION (English) 2 2nd Edition








No comments:

Post a Comment