Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » Malaysian Journals
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top Journal Cited
 » Top Article Cited
 » Journal Citation Statistics
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Prosodic Analysis And Modelling For Malay Emotional Speech Synthesis.

Mumtaz B. Mustafa, and Raja N. Ainon, and Roziati Zainuddin , and Zuraidah M. Don , and Knowles, Gerry, and Salimah Mokhtar , (2010) Prosodic Analysis And Modelling For Malay Emotional Speech Synthesis. Malaysian Journal of Computer Science, 23 (2). pp. 102-110. ISSN 0127-9084

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
207Kb

Official URL: http://ejum.fsktm.um.edu.my/ArticleInformation.aspx?ArticleID=950

Affiliations

University of Malaya. Faculty of Computer Science & Information Technology
University of Malaya. Faculty of Language and Linguistics
Lingenium Sdn Bhd, Kuala Lumpur, Malaysia
University of Malaya. Computational Speech Group

Abstract

This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rule-based prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in several research projects. This project attempts to improve the naturalness of the synthesized emotional Malay speech by establishing an effective mechanism for the re-synthesis of emotion. Such a mechanism is created by analyzing the variation in the F0 contour of continuous emotional Malay speech against a fixed time period. The emotional prosodic generator for Malay developed in the course of this research makes use of principles of parametric prosody manipulation to synthesize four basic emotions, namely happiness, anger, sadness and fear. Subjective evaluation by means of listening tests was conducted to validate the ability of the emotions generator to generate the necessary prosody to synthesize emotional expression. The evaluation results show an overall recognition rate of between 61% and 85%.

Item Type:Journal
Additional Information:This work is supported by an UMRG grant from University of Malaya, Malaysia.
Keywords:Emotional speech re-synthesis; Prosody conversion; Rule-based approach, MBROLA.
Subjects:Q Science, Computer Science
ID Code:11757

1. J.E. Cahn, “The Generation of Affect in Synthesized Speech”, Journal of the American Voice I/O Society, volume 8, 1990, pp. 1-19.

2. I.R. Murray and J.L. Arnott, “Applying an analysis of acted vocal emotion to improve the simulation of speech synthesis,” Journal of Computer Speech and Language 22, 2008, pp. 107-129.

3. A. Iida, S. Iga, N. Campbell, F. Higuchi, and M. Yasumura, “A corpus-based speech synthesis system with emotion,” Journal of Speech Communication, vol. 40, no. 1-2, 2003, pp. 161-187.

4. F. Burkhardt, “Emofilt: The Simulation of Emotional Speech by Prosody-Transformation”, in Proc. of Interspeech, 9th European Conference on Speech Communication and Technology, Lisboan, 2005.

5. B. Mumtaz, R.N. Ainon, M.D. Zuraidah, and G. Knowles, “Integrating Rule and Template-based Approaches for Emotional Malay Speech Synthesis”, in Proc. of Interspeech 2008, pp. 253-256.

6. Syaheerah L.L., J.M. Montero, Raja N. Ainon, and Zuraidah M. Dom, “eXTRA: A Culturally Enriched Malay Text to Speech System", in Symposium on Affective Language in Human and Machine, Aberdeen, UK, (1- 2), 2008.

7. Razak A.A., Abidin M.I.Z. and R. Komiya, “Emotion pitch variation analysis in Malay and English voice samples”, in the 9th Asia Pacific Conference on Communications, 2003.

8. T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Katmura, “Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis”, in Eurospeech 1999, pp. 2347-2350.

9. A.W. Black, Z. Heiga and K. Tokuda, “Statistical parametric speech synthesis”, in Proc. of International Conference on Acoustic Speech and Signal Processing (ICASSP), 2007, pp. 1229-1232.

10. A. Black, “Unit Selection and Emotional Speech”, Eurospeech 2003, Geneva, Switzerland, 2003.

11. M. Tachibana, J. Yamagashi, T. Masuko, and T. Kobayashi, “Speech Synthesis with Various Emotional Expression and Speaking Style by Style Interpolation and Morphing”, IEICE Transaction of Information and System, vol. E88-D, No 11, 2005, pp 2484-2491.

12. T. Banziger and K.R. Scheree, “The role of intonation in emotional expressions”, Journal of Speech Communication, vol. 46, 2005, pp. 252-267.

13. J.M. Montero, J. Gutiérrez-arriola, J. Colás, E. Enríquez, and J.M. Pardo, “Analysis and modelling of emotional speech in Spanish”, in Proceeding of Eurospeech ’99, Budapest, Hungary, September 1999, pp. 957-960.

14. Y. EL-IMAN and Zuraidah M.Dom, “Rules and algorithms for phonetic transcription of Standard Malay”, IEICE Transaction of Information and System, vol. E88-D, No. 10, October 2005, pp. 2354-2372.

15. Farid M.O., Aspects of Malay Phonology and Morphology-Generative approach, Bangi, University Kebangsaan Malaysia, Press, 1980.

16. Yunus Maris M., The Malay Sound System, Kuala Lumpur, Fajar Bakti Sdn. Bhd. Malaysia, 1980.

17. G.O. Knowles, and Zuraidah M.Dom, Word Class in Malay, first edition, Kuala Lumpur, Malaysia, Dewan Bahasa and Pustaka, 2006.

18. E. Navas, I. Hernaez and I. Leungo, “An Objective and Subjective Study of the Role of Semantic and Prosodic Features in Building Corpora for Emotional TTS”, IEEE Transaction on Audio, Speech and Language Processing, vol. 14, 2006, pp. 1117-1127.

19. C.F. Huang, and M. Akagi, “A three-layered model for expressive speech perception”, Journal of Speech Communication, vol. 50, 2008, pp. 810-828.

20. P. Boersma, and D. Weenink, 2004. Praat, doing phonetics by computer, http://www.fon.hum.uva.nl/praat/. Accessed and downloaded Jan 2006.

21. MBROLA project homepage at http://tcts.fpms.ac.be/synthesis/mbrola.html. Accessed and downloaded on May, 2006.

Repository Staff Only: item control page