• Sonuç bulunamadı

when low noise was added to the speech signal (30 dB SNR), but the result decreased to 86.67%

N/A
N/A
Protected

Academic year: 2021

Share "when low noise was added to the speech signal (30 dB SNR), but the result decreased to 86.67%"

Copied!
2
0
0

Yükleniyor.... (view fulltext now)

Tam metin

(1)

ABSTRACT

In this thesis Isolated Words Speech Recognition system was designed. Linear Predictive Coding (LPC), Mel Frequency Cepstral Coefficients (MFCC), and Spectrogram methods were used as feature extraction methods. Artificial Neural Network (ANN) was used as a technique to classify the spoken words to different patterns so the system can recognize unknown spoken words according to these patterns. The developed system has a Graphical User Interface (GUI) that contains many buttons to allow the user to choose the necessary method, train the network, choose trained or not trained spoken words and recognize them. The system allows the user to add noise (in different Signal to Noise Ratio (SNR) values) to the speech signal. 30 spoken words were recorded by the author voice using Audionic AH-112 headphone set. Different methods were used to extract the features of words then the features of 30 words were used to train the neural network. Testing the system was divided into three steps. Step one is the testing of the system with the trained words, step two is the testing of the system with not trained words, and step three is the testing of the system with the trained words with noise added. The system was tested using various numbers of hidden layers. The obtained results show that the number of the hidden layers has no effect on the Recognition Rate (R.R) of the system when trained words were tested. The best R.R obtained for LPC method was 73.3% for not trained words. Using MFCC method the R.R was 83.33% for not trained words. For Spectrogram method R.R was 73.33% for not trained words. Because every method produces a different number of output data from feature extraction process, the number of neurons in the input layer of the neural network was different for each method. The neurons in the input layer of the neural network were 420 neurons-when LPC method was used, 613 neurons-when MFCC method was used, and 4235 neurons-when Spectrogram method was used. The best R.R obtained from testing of the system with trained words was 100% for all the three methods. For MFCC method the best R.R obtained was 100% when low noise was added to the speech signal (30 dB SNR) and 96.67% when high noise was added to the speech signal (5 dB SNR). For LPC method the best R.R obtained was 70% when noise was added to the speech signal. For Spectrogram method the R.R was 100%

when low noise was added to the speech signal (30 dB SNR), but the result decreased to 86.67%

when high noise was added to the speech signal (5 dB SNR). Finally the simulation results demonstrate that the best method used for feature extraction was MFCC comparing with LPC

i

(2)

and Spectrogram methods. MFCC method has low number of output data produced comparing with Spectrogram method.

Key Words: Speech Recognition system, LPC, MFCC, Spectrogram, Neural Networks.

ii

Referanslar

Benzer Belgeler

This is one of the few studies which report the trend in the rates of different complications of resident phaco surgery according to number of operations performed, and it

The higher the learning rate (max. of 1.0) the faster the network is trained. However, the network has a better chance of being trained to a local minimum solution. A local minimum is

• The Rashidun army was the primary military body of the Muslims during the Muslim conquests of the 7th century, serving alongside the Rashidun navy.. • The three most

Doğum ağırlığı, sütten kesme ağırlığı, anne sütü, ergin inek bedeni ile ilgili masrafların da dahil olduğu hayvan başına diğer hayvanlardan farklılığın dolar

AIM: To check the different shape of the glow curves of each material and to assess the number of peaks present.. Irradiation (0.5 Gy for synthetic materials, 15 Gy for

Naqvi, Noorul Hasan (2001) wrote a book in Urdu entitled "Mohammadan College Se Muslim University Tak (From Mohammadan College to Muslim University)".^^ In this book he

The T-test results show significant differences between successful and unsuccessful students in the frequency of using the six categories of strategies except

The adsorbent in the glass tube is called the stationary phase, while the solution containing mixture of the compounds poured into the column for separation is called