Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » By Affiliations
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top 20 Journal Cited
 » Top 20 Cited
 » Top 20 Author Cited
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Digit Recognition Using Neural Networks

Tan, Chin Luh and Jantan, Adznan (2004) Digit Recognition Using Neural Networks. Malaysian Journal of Computer Science, 17 (2). pp. 40-54. ISSN 0127-9084

Full text not available from this repository.

Official URL: http://mjcs.fsktm.um.edu.my/detail.asp?AID=308

Affiliations

Universiti Putra Malaysia

Abstract

This paper investigates the use of feed-forward multi-layer perceptrons trained by back-propagation in speech recognition. Besides this, the paper also proposes an automatic technique for both training and recognition. The use of neural networks for speaker independent isolated word recognition on small vocabularies is studied and an automated system from the training stage to the recognition stage without the need of manual cropping for speech signals is developed to evaluate the performance of the automatic speech recognition (ASR) system. Linear predictive coding (LPC) has been applied to represent speech signal in frames in early stage. Features from the selected frames are used to train multilayer perceptrons (MLP) using back-propagation. The same routine is applied to the speech signal during the recognition stage and unknown test patterns are classified to the nearest patterns. In short, the selected frames represent the local features of the speech signal and all of them contribute to the global similarity for the whole speech signal. The analysis, design and development of the automation system are done in MATLAB, in which an isolated word speaker independent digits recogniser is developed.

Item Type:Journal
Keywords:Digits recognition, Feed-forward back-propagation, Linear predictive coding, Neural networks, Speech recognition
Subjects:Q Science
ID Code:487

L. R. Rabiner, B. H. Juang, Fundamental of Speech Recognition. Prentice Hall, New Jersey, 1993.

P. G. J. Lisboa, Neural Networks Current Application. Chapman & Hall, 1992.

B. A. St. George, E. C. Wooten, L, Sellami, “Speech Coding and Phoneme Classification Using MATLAB and NeuralWorks”, in Education Conference, North-Holland University, 1997.

M. Nakamura, K. Tsuda, J. Aoe, “A New Approach to Phoneme Recognition by Phoneme Filter Neural Networks”. Information Sciences Elsevier, Vol. 90, 1996, pp. 109-119.

J. Frankel, K. Richmond, S. King, P. Taylor, “An Automatic Speech Recognition System Using Neural Networks and Linear Dynamic Models to Recover and Model Articulatory Traces”, in Proceeding ICSLP, University of Edinburgh, 2000.

J. T. Jiang, A. Alwan, P. A. Keating, E. T. Auer L. E. Jr, Bernstein, “On the Relationship between Face Movements, Tongue Movements, and Speech Acoustics”. EURASIP Journal on Applied Signal Processing, Vol. 11, 2002, pp. 1174-1188.

X. Z. Zhang, C.C. Broun, R. M. Mersereau, M. A. Clements, “Automatic Speechreading with Applications to Human-Computer Interfaces”. EURASIP Journal on Applied Signal Processing, Vol. 11, 2002, pp. 1228- 1247.

H. S. Li, J. Liu, R. S. Liu, “High Performance Mandarin Digit Speech Recognition”. Journal of Tsinghua University (Science and Technology), 2000.

H. S. Li, M. J. Yang, R. S. Liu, “Mandarin Digital Speech Recognition Adaptive Algorithm”. Journal of Circuits and Systems, Vol. 4, No. 2, 1999.

H. Demuth, M. Beale, Neural Network Toolbox. The Math Works, Inc., Natick, MA, 2000.

S. Peeling, R. Moore, “Experiments in Isolated Digit Recognition Using the Multi-Layer Perceptron”. Technical Report 4073, Royal Speech and Radar Establishment, Malvern, Worcester, Great Britain, 1987.

B. Kammerer, W. Kupper, “Experiments for Isolated-Word Recognition with Single and Multi-Layer Perceptrons”. Abstracts of 1st Annual INNS Meeting, Boston, 1988.

D. Burr, “Experiments on Neural Net Recognition of Spoken and Written Text”, in IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. 36, 1988, pp. 1162-1168.

L. R. Rabiner, M. R. Sambur, “An Algorithm for Determining the Endpoints for Isolated Utterances”. The Bell System Technical Journal, Vol. 54, No. 2, 1975, pp. 297-315.

Y. Shiraki, M. Honda, “LPC Speech Coding Based on Variable-Length Segment Quantization”. IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. 36, No. 9, 1988.

T. F. Quatieri, Discrete-Time Speech Signal Processing Principles and Practice. Prentice Hall, USA, 2001.

S. Haykin, Neural Networks, A Comprehensive Foundation. Prentice Hall, New Jersey, 1999.

G. D. Magoulas, M. N. Vrahatis, G. S. Androulakis, “Improving the Convergence of the Backpropagation Algorithm Using Learning Rate Adaptation Methods”. Neural Computation, Vol. 11, 1999, pp. 1769-1796.

N. K. Kasabov, Foundations of Neural Network, Fuzzy Systems, and Knowledge Engineering. The MIT Press Cambridge, London, 1996.

T. F. Li, “Speech Recognition of Mandarin Monosyllables”. The Journal of the Pattern Recognition Society, 2003.

Repository Staff Only: item control page