Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » Malaysian Journals
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top Journal Cited
 » Top Article Cited
 » Journal Citation Statistics
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Malay Speaker Dependent Digits Recognition with Improved Backpropagation

Ummu Salamah Mohamad, and Ramlan Mahmod, and Siti Mariyam Shamsuddin, (2004) Malay Speaker Dependent Digits Recognition with Improved Backpropagation. Jurnal Teknologi Maklumat dan Multimedia, 1 . pp. 1-14. ISSN 18230113

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
344Kb

Affiliations

Universiti Pertanian Malaysia, Faculty of Computer Science & Information Technology
Universiti Pertanian Malaysia. Faculty of Computer Science & Information Technology
Universiti Teknologi Malaysia. Faculty of Computer Science & Information System

Abstract

ABSTRACT: This paper presents a study of a Malay speaker dependent recognition using improved Neural Network (NN). The performances are evaluated for recognition of the isolated Malay digits of "0" through "9". The Error Backpropagation (BP) and an improved error signal of the BP are used in this study. Experiments are carried out by comparing the recognition rates and convergence time of the standard BP and improved BP, as well as the effects of normalisation techniques on Malay speaker dependent data. The utterances are represented using the Linear Prediction Coding (LPC) method. The results show that the improved BP outperforms the standard BP in terms of its convergence with better recognition rates for unnormalised data. For the effects of normalisation data, the unit simple method gives better result compared to unit range and unit variance with improved BP gives faster convergence and higher recognition rates.

ABSTRAK: Makalah ini membincangkan kajian pengecaman jurucakap Melayu tidak bebas dengan menggunakan rangkaian neural pembaikan. Prestasi dinilai untuk pengecaman digit terpisah Melayu '0' hingga '9'. Rambatan balik dan isyaratralat pembaikan telah digunakan dalam kajian ini. Ujikaji telah dijalankan dengan membandingkan kadar pengecaman dan masa penumpuan rambatan balik piawai, rambatan balik pembaikan dan kesan teknik-teknik penormalan pada datajurucakap tidak bebas Melayu. Ujaran-ujaran diwakili dengan pengkodan ramalan lelurus. Keputusan menunjukkan bahawa, rambatan balik pembaikan mengatasi rambatan balik piawai dari sudut penumpuan dengan kadar pengecaman yang lebih baik untuk data yang tidak dinormalkan. Bagi kesan penormalan data, kaedah mudah memberikan keputusan baik berbanding dengan unit julat dan unit varians dengan rambatan balik pembaikan memberikan penumpuan lebih cepat dan kadar pengecaman yang lebih tinggi

Item Type:Journal
Keywords:Malay language, Spoken Malay, Neural networks techniques; audio data
Subjects:Q Science, Computer Science
P Language and Literature
ID Code:8158

Azoff, E. M. 1994. Neural Network Time Series Forecasting of Financial Market. Chichester: John Wiley & Sons.

Baert, L. 1995. Digital Audio and Compact Disc Technology, 3rd Edition. Oxford: Focal Press.

Fausett, L. 1994. Fundamentals of Neural Networks: Architectures Algorithms and Applications. London: Prentice Hall.

Kalman, B. and Kwasny, S. C. 1991. A Superior Error Function for Training Neural Network. International Joint Conference of Neural Network 2: 42-52.

Kevans, L. and Rodman, R. D. 1997. Voice Recognition. London: Artech House.

Markowitz, J.A. 1996. Using Speech Recognition. London: Prentice Hall. Morgan, D. P. 1991. Neural Networks and Speech Processing. Boston: Kluwer Academic Publishers.

Nakagawa, S. 1995. Speech, Hearing and Neural Networks Models. Amsterdam: lOS Press.

Rabiner, L. R. 1993. Fundamentals of Speech Recognition. London: Prentice Hall. Rabiner, L. R. 1995. Impact of Voice Processing on Modern Telecommunications. Speech Communication 17(3-4): 217-226.

Shamsuddin, S. M., Sulaiman M. N. and Darus, M. 2001. An Improved Error Signal of BP Model for Classification Problems. International Journal of Computer Mathematics 76: 297-305.

Torkkola, K. 1994. Stochastic Models and Artificial Neural Networks for Automatic Speech Recognition, In Eric Keller (Eds.) Fundamentals of Speech Synthesis and By Speech Recognition: Basic Concepts, State of the Art and Future Challenges. New York JohnWiley.

Repository Staff Only: item control page