Monday, November 16, 2015

CP7008 SPEECH PROCESSING AND SYNTHESIS

CP7008        SPEECH PROCESSING AND SYNTHESIS

UNIT I             FUNDAMENTALS OF SPEECH PROCESSING

Introduction – Spoken Language Structure – Phonetics and Phonology – Syllables and Words – Syntax and Semantics – Probability, Statistics and Information Theory – Probability Theory – Estimation Theory – Significance Testing – Information Theory.   


UNIT II         SPEECH SIGNAL REPRESENTATIONS AND CODING

Overview of Digital Signal Processing – Speech Signal Representations – Short time Fourier Analysis – Acoustic Model of Speech Production – Linear Predictive Coding – Cepstral Processing – Formant Frequencies – The Role of Pitch – Speech Coding – LPC Coder.  

UNITIII         SPEECH RECOGNITION

Hidden Markov Models – Definition – Continuous and Discontinuous HMMs – Practical Issues – Limitations. Acoustic Modeling – Variability in the Speech Signal – Extracting Features – Phonetic Modeling – Adaptive Techniques – Confidence Measures – Other Techniques. 

UNITIV          TEXT ANALYSIS

Lexicon – Document Structure Detection – Text Normalization – Linguistic Analysis – Homograph Disambiguation – Morphological Analysis – Letter-to-sound Conversion – Prosody – Generation schematic – Speaking Style – Symbolic Prosody – Duration Assignment – Pitch Generation    

UNIT V         SPEECH SYNTHESIS 

Attributes – Formant Speech Synthesis – Concatenative Speech Synthesis – Prosodic Modification of Speech – Source-filter Models for Prosody Modification – Evaluation of TTS Systems.
   
REFERENCES: 

1. Xuedong Huang, Alex Acero, Hsiao-Wuen Hon, “Spoken Language Processing – A guide to Theory, Algorithm and System Development”, Prentice Hall PTR, 2001. 
2. Thomas F.Quatieri, “Discrete-Time Speech Signal Processing”, Pearson Education, 2002. 
3. Lawrence Rabiner and Biing-Hwang Juang, “Fundamentals of Speech Recognition”, Prentice Hall Signal Processing Series, 1993. 
4. Sadaoki Furui, “Digital Speech Processing: Synthesis, and Recognition, Second Edition, (Signal Processing and Communications)”, Marcel Dekker, 2000. 
5. Joseph Mariani, “Language and Speech Processing”, Wiley, 2009.  




No comments:

Post a Comment