Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » Malaysian Journals
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top Journal Cited
 » Top Article Cited
 » Journal Citation Statistics
 » Usage Since Sept 2007


 
 
 

Login | Create Account

A Fuzzy Based Approach for Emotion Recognition in Speech

Aishah Abd Razak, and Mohd Hafizuddin Mohd Yusof , and Ryoichi Komiya, (2004) A Fuzzy Based Approach for Emotion Recognition in Speech. In: Proceedings of the Joint Conference on Informatics and Research on Women in ICT (RWICT) 2004 , 28 - 30 July 2004 , Putra World Trade Center Kuala Lumpur, Malaysia.

Full text not available from this repository.

Affiliations

University Cyberjaya Malaysia, Faculty of Information Technology, Multimedia

Abstract

This paper uses LPC analysis to extract emotion features from speech. From this analysis, 18 features namely pitch. Jitter, energy, duration and 14 LPC coefficients are extracted from each voice samples to represent the emotion features of six basic emotions; happiness, sadness, fear. anger, surprise and disgust. These 18 features extracted from different samples give rise to 18 fuzzy sets. However, when we have limited number of samples and the variance range between fuzzy set is large, the choice of a proper fuzzification function is crucial. In this paper, we have devised a fuzzification function, which depends on the variance of the fuzzy set. by introducing structural parameters in the membership function. This structural parameter, s and t in the membership function helped to model the emotion parameter variation for individual emotion and thus improve the recognition rate.

Item Type:Conference or Workshop Item (Paper)
Keywords:LPC analysis, emotion features, fuzzy set, membership functions, emotion recognition
Subjects:Q Science
ID Code:1118

[1] Ryochi Komiya et,al, "A Proposal of Virtual Reality Telecommunication System", Proceedings WEC'99, July 1999, pp. 93-98.

[2] Scherer, K. R., Vocal affect expression: A review and a model for future research. Psychological Bulletin, 99,1986, pp.143-165.

[3] Ekman, P., Darwin and Facial Expression; A Century of Research in Review, New York: Academic Press, 1973.

[4] Izard, C. E., Human Emotions, New York: Plenum Press, 1977.

[5] Plutchik, R., Emotion: A Psycho evolutionary Synthesis, New York: Harper and Row,1980.

[6] Cornelius, R. R., The Science of Emotion: Research and Tradition in the Psychology 0f Emotion, Upper Saddle River, N.J.: Prentice-Hall, 1996.

[7] Banse, R. and Scherer, K.R., "Acoustic profiles in vocal motion expression". Journal of Personality and Social Psychology, 70,1996, pp. 614-636.

[8] Tosa, N. and Nakatsu, R., "Life-like communication agent-emotion sensing character "MIC" and feeling session character "MUSE". Proceedings of IEEE Conference on Multimedia, 1996, pp. 12-19.

[9] Rabiner, L.R. and Schafer, R.W., Digital Processing of Speech Signals, Prentice-Hall, Eaglewood Cliffs, NJ, F, 1978.

[10] Aishah A.R., Mohamad Izani, Z.A., Komiya, R., "Pitch Variation Analysis on Malay and English Voice Samples", APCC. 2003.

[11] Aishah A.R., Mohamad Izani, Z.A., Komiya, R., "A Preliminary Speech Analysis for Emotion Recognition", to appear in IEEE Student Conference on Research and Development (SCORED2003), 2003.

[12] Morishima, S., Harashima, H., "A Media Conversion from Speech to Facial Image for Intelligent Man-Machine Interface," IEEE J. on Sel. Areas in Comm., 1991.

[13] Valery A. P.n, "Emotion in speech:Recognition and application to call centers". Proceedings of the 1999 Conference On Artificial Neural Networks in Engineering (Annie '99), 1999.

[14] Nakatsu, R.,NichoIson ,J., Tosa, N.,"Emotion Recognition and Its Application to Computer Agents with Spontaneous Interactive Capabilities," International Congress of Phonetic Science.

[15] M. Hanmandlu, M.H.M. Yusof and Vamsi K. Madasu, "Fuzzy based approach to the recognition of Multi-Font numerals", 2nd National Conference on Document Analysis and Recognition (NCDAR-2003), Mandya, India, 2003.

[16] Childers, D.G., Speech Processing and Synthesis Toolboxes. John Wiley & Sons, NY, 1999.

[17] Ning, T. and Whiting, S., "Power .spectrum estimation is a orthogonal transformation," Proc. IEEE Conf Acoust., Speech, Signal Process.; 1990, pp.2523-2526.

Repository Staff Only: item control page