Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » Malaysian Journals
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top Journal Cited
 » Top Article Cited
 » Journal Citation Statistics
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Time-Domain Features And Probabilistic Neural Network For The Detection Of Vocal Fold Pathology.

Hariharan, M. , and Paulraj, M.P. , and Sazali Yaacob , (2010) Time-Domain Features And Probabilistic Neural Network For The Detection Of Vocal Fold Pathology. Malaysian Journal of Computer Science, 23 (1). pp. 60-67. ISSN 0127-9084

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
112Kb

Official URL: http://ejum.fsktm.um.edu.my/ArticleInformation.aspx?ArticleID=878

Affiliations

Universiti Malaysia Perlis. School of Mechatronic Engineering

Abstract

Due to the nature of job, unhealthy social habits and voice abuse, people are subjected to the risk of voice problems. It is well known that most of vocal fold pathologies cause changes in the acoustic voice signal. Therefore, the voice signal can be a useful tool to diagnose them. Acoustic voice analysis can be used to characterize the pathological voices. This paper presents the detection of vocal fold pathology with the aid of the speech signal recorded from the patients. The speech samples from Massachusetts Eye and Ear Infirmary (MEEI) database are used to evaluate the scheme. Time-domain features based on energy variation are proposed and extracted from the speech to form a feature vector. In order to test the effectiveness and reliability of the proposed time-domain features, a Probabilistic Neural Network (PNN) is employed. The experimental results show that the proposed features gives very promising classification accuracy and can be effectively used to detect the vocal fold pathology clinically.

Item Type:Journal
Keywords:Acoustic Analysis, Vocal Fold Pathology, Time-Domain Features, Probabilistic Neural Network
Subjects:Q Science, Computer Science
ID Code:11754

1. Carlos Hernandez-Espinosa, Mercedes Fernandez-Redondo, Santiago Aguilera-Navarro, Pedro-gomez-Vilda,Godino-Llorente J.I, Gomez-Vilda P, “Diagnosis of Vocal and Voice Disorders by the Speech Signal”, Proceedings of IEEE-INNS-ENNS International Joint Conference on Neural Networks,2000, vol.4, pp.253-258.

2. Boyanov B. and Stefan Hadjitodorov, “Acoustic Analysis of Pathological Voices. A voice analysis system for the screening of laryngeal diseases”, Proceedings of IEEE International Conference on Engineering in Medicine and Biology Society, 1997,vol.16, pp.74-82.

3. Kausya H, Ogawa,S., Mashima,K., “Normalized noise energy as an acoustic measure to evaluate pathologic voice”, Journal of the Acoustical Society of America, 1986, vol.80,pp.1329-1334.

4. Martinez Cesar E, Rufiner hugo L., “Acoustic Analysis of Speech for Detection of Laryngeal Pathologies”, Proc. of 22nd Annual IEEE International Conference on Engineering in Medicine and Biology Society, 2000, vol.3, pp.2369-2372.

5. Dimitar D. Deliyski, “Acoustic Model and Evaluation of Pathological Voice Production”, Third International Conference on Speech Communication and Technology, EUROSPEECH’93, 1993, pp1969-72.

6. Godino-Llorente J.I, Gomez-Vilda P, and Blanco-Velasco M., “Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-term Cepstral Parameters”, IEEE Transactions on Biomedical Engineering, 2006, vol.53, pp.1943-1953.

7. Ritchings, T., McGillion, M., and Moore, C., “Pathological voice quality assessment using artificial neural networks”, Medical Engineering and Physics,2002, vol.24, no.8, pp.561-564.

8. Childers,D.G, Keun Sung Bae, “Detection of Laryngeal Function Using Speech and Electroglottograph Data”, IEEE Transactions on Biomedical Engineering, 1992, vol,39,pp.19-25.

9. Massachusetts Eye and Ear Infirmary, “Voice Disorders Database”, Version 1.03(CDROM), Kay Elemetrics Corporation, 1994, Lincoln Park, NJ, USA.

10. Boyanov B, Ivanov Tm Cholet G, “Robust hybrid pitch detector”, Electronics letters, 1980, vol.29, pp. 1924-1926.

11. Feijoo S, Hernandez C, “Short-tem stability measures for the evaluation of voice quality”, Journal of Speech and Hearing Research, 1990, vol. 33, pp.324-334.

12. Kasuya H, Endo and Sliu S, “Novel acoustic measurements of jitter and shimmer characteristics from pathological voice”, Proceedings of EUROSPEECH’93,1993, pp.1973-1976.

13. Ludlow C, Bassich C, Connor N, Coulter D, Lee Y, “The validity of using phonatory jitter and shimmer to detect laryngeal pathology”, Laryngeal function in phonation and respiration, Brown & Co., Boston, 1987, pp.492-508.

14. Yunik M and Boyanov B, “Method for evaluation of the noise-to harmonic-component ratios in pathological and normal voices”, Acoustica, 1990, vol. 70, pp. 89-91.

15. Eiji Yumoto, Wilbur J, Gould, “Harmonics to noise ratio as an index of the degree of hoarseness”, The Journal of the Acoustical Society of America, 1990, vol.87, no. 3, pp.1278-1289.

16. DeKrom G, “A cepstrum based technique for determining a harmonics to noise ratio in speech signals”, Journal of Speech and Hearing Research, 1993, vol. 36, pp. 254-266.

17. Tim Ritchings, Mark A. McGillion, Christopher J. Moore, “Objective assessment of pathological voice quality using multi-layer perceptrons”, Proc. of the First Joint EMBS/BMES Conference, 1999, vol. 2, pp.925.

18. Tim Ritchings, Mark A. McGillion, Christopher J. Moore,GV Conroy, “Objective assessment of pathological voice quality”, IEEE International Conference on Systems, Man, and Cybernetics, 1999, vol.6,pp. 340-345.

19. J. I. Godino-Llorente and P. Gomez-Vilda, “Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors,” IEEE Transactions on Biomedical Engineering, 2004,vol. 51, no. 2, pp. 380–384.

20. Mirjam Wester, “Automatic classification of voice quality: comparing regression models and hidden Markov models”, Proceedings of VOICEDATA98, symposium on databases in voice quality research and education, 1998, pp. 92-97.

21. K. Umapathi, S. Krishnan, V. Parsa, and D. G. Jamieson, “Discrimination of pathological voices using a time-frequency approach,” IEEE Transactions on Biomedical Engineering, 2005, vol. 52, no. 3, pp. 421–430.

22. T. Ananthakrishna T, Kumara Shama, and U.C. Niranjan, “k-means nearest neighbor classifier for voice pathology”, IEEE INDIA Annual Conference, 2004, pp.352-354.

23. Kumara Shama, Ananthakrishna, and Niranjan U. Cholayya, “Study of harmonics-to-noise ration and critical band energy spectrum of speech as acoustic indicators of laryngeal and voice pathology”, EURASIP Journal on Advances in Signal Processing, Hindawi Publishing Corporation, 2007, vol. 2007, pp.1-9.

24. S.N Sivanandam and Paulraj M P, “An Introduction to Artificial Neural Networks”,2003, Vikhas Publication, India.

25. Specht, D, “Probabilistic neural networks”, Neural Networks,1990, vol.3. no.1, pp. 109-118.

26. Laurene Fausett, “Fundamentals of Neural Network”, 1994,Prentice Hall, New Jersey, USA.

27. Kohavi, R. “A study of cross validation and bootstrap for accuracy estimation and model selection” In Proceedings of the 14th International Conference on Artificial Intelligence, 1995, pp. 1137-1143.

Repository Staff Only: item control page