Son Aramalar

Sonuç bulunamadı

Etiketler

Sonuç bulunamadı

Document

Sonuç bulunamadı

Ana Sayfa Okul Konu

Giriş

CONTENTS ABSTRACT

Share "CONTENTS ABSTRACT"

N/A

N/A

Protected

Akademik yıl: 2021

Info

Protected

Academic year: 2021

Share "CONTENTS ABSTRACT"

Copied!

5

0

0

5

0

0

Yükleniyor.... (view fulltext now)

Şimdi indir ( 5 sayfa )

Tam metin

(1)

CONTENTS

ABSTRACT

... i

ACKNOWLEDGEMENTS

... ii

CONTENTS

... iii

LIST OF TABLES

... v

iii

LIST OF FIGURES

... ix

CHAPTER 1: INTRODUCTION ... 1

CHAPTER 2: SPEAKER RECOGNITION CONCEPTS

... 4

2.1

Overview

... 4

2.1.1 Problem Statement

... 8

2.2 Biometrics

... 10

2.3 Relevant Studies

... 12

2.4

Summary

... 13 CHAPTER 3: SPEECH ... 14

3.1

Overview

... 14

3.2 Nature of Speech

... 14

3.3 Speech Processing

... 14

3.3.1 Speech Signal Acquisition

... 15

3.3.2 Speech Production

... 15

3.4 Designing Effective Speech

... 16

3.5 When to Use Speech

... 17

3.6 Challenges

... 18

3.6.1 Transience: What did you say?

... 19

3.6.2 Invisibility: What can I say?

... 20

3.6.3 Asymmetry

... 20

3.6.4 Speech Synthesis Quality

... 20

3.6.5 Speech Recognition Performance

... 21

3.6.6 Recognition: Flexibility vs. Accuracy

... 21

(2)

3.8 Technical Characteristics and Analysis of the Speech Signal

... 22

3.8.1 Bandwidth

... 23

3.8.2 Oscillogram (Waveform)

... 23

3.8.3 Fundamental Frequency (Pitch)

... 24

3.8.4 Spectrum

... 24

3.8.5 Spectrogram

... 26

3.8.6 Cepstrum

... 28

3.9

Summary

... 29

CHAPTER 4: SPEAKER IDENTIFICATION SYSTEM

... 30

4.1

Overview

... 30

4.2 Speaker Recognition

... 30

4.3 Speaker Identification

... 31

4.3.1 VQ Based Speaker Identification

... 32

4.3.2 Real Time Speaker Identification

... 32

4.3.3 Speaker Pruning

... 34

4.4 Principles of Speaker Identification

... 35

4.5 Verification versus Identification

... 36

4.6 Steps in Speaker Recognition

... 38

4.6.1 Extraction Feature

... 39

4.6.2 Classification

... 39

4.6.2.1 Text Independent Recognition

... 40

4.6.2.2 Text Dependant Recognition

... 40

4.7

Summary

... 43

CHAPTER 5: SPEECH FEATURE EXTRACTION AND VECTOR QUANTIZATION ... 44

5.1

Overview

... 44

5.2 Speech Feature Extraction

... 44

5.2.1 Linear Predictive Coding (LPC)

... 44

5.2.2 Linear Predictive Cepstral Coefficient (LPCC)

... 46

5.2.3 Mel-Frequency Cepstrum Coefficients (MFCC)

... 47

(3)

5.2.3.2 Framing and Windowing

... 49

5.2.3.3 Hamming Window

... 51

5.2.3.4 Fast Fourier Transform (FFT)

... 52

5.2.3.5 Mel Frequency Warping

... 52

5.2.3.6 Discrete Cosine Transform

... 55

5.2.3.7 Cepstrum

... 57

5.3 Cepstral Analysis

... 57

5.4 Summary of Feature Extraction Technniques

... 61

5.5

Summary

... 63

CHAPTER 6: FEATURE MATCHING

... 64

6.1 Overview

... 64

6.2 Speech Feature Matching

... 64

6.3 Quantization

... 65

6.4 Vector Quantization

... 66

6.4.1 Distortion Measure

... 68

6.4.2 Clustering the Training Vectors

... 70

6.5 K-Means Clustering

... 71

6.5.1 Clustering Overview

... 72

6.5.2 Non-Hierarchical Clustering

... 73

6.5.3 K-means Method

... 73

6.5.4 K-means Implementation

... 74

6.6 Summary

... 76

CHAPTER 7: MATLAB BASED SPEAKER RECOGNITION

... 77

7.1 Overview

... 77

7.2 The Speaker Recognition Program

... 77

7.2.1 Option 1: Load a New Sound File From Disk

... 78

7.2.2 Option 2: Play a Sound File From Disk

... 81

7.2.3 Option 3: Display a Sound Waveform From Disk

... 82

7.2.4 Option 4: Display a Sound Waveform From the Database

... 83

(4)

7.2.6 Option 6: Speaker Recognition

... 84

7.2.7 Option 7: Display Sound Power Spectrum

... 86

7.2.8 Option 8: Display Sound With and Without Windowing

... 87

7.2.9 Option 9: Sound Database Information

... 90

7.2.10 Option 10: Display Information of a Sound File in the Database

... 92

7.2.11 Option 11: Delete Sound Database

... 92

7.2.12 Option 12: Help

... 93

7.2.13 Option 13: Exit

... 93

7.3 Steps in Speaker Recognition

... 94

7.3.1

Feature Extraction

... 94

7.3.1.1 Frame Blocking

... 94

7.3.1.2 Windowing

... 95

7.3.1.3 Fast Fourier Transform (FFT)

... 95

7.3.1.4 Mel Frequency Wrapping

... 96

7.3.1.5 Cepstrum

... 96

7.4 Speech Feature Matching

... 96

7.4.1 Vector Quantization

... 96

7.4.2 Distance Measure

... 97

7.4 Results

... 98

7.4.1

Modifying the Centroids (Code Book Size)

... 100

7.4.2 Speaker recognition in the Presence of Noise

... 101

7.5

Summary

... 106

CHAPTER 8: CONCLUSIONS

... 107

REFERENCES

... 109

APPENDIX A (The Main Program)

... 113

speaker.m

... 113

APPENDIX B

... 123

pspectrum.m

... 123

APPENDIX C

... 125

(5)

APPENDIX D

... 127

Cmatrix.m

... 127

APPENDIX E

... 129

Disteu.m

... 129

zero.m

... 129

APPENDIX F

... 130

mel.m

... 130

APPENDIX G

... 131

vqglbg.m

... 131

noise.m

... 131

Referanslar

Şimdi indir ( PDF - 5 sayfa - 84.72 KB )

Benzer Belgeler

NAKİL İŞLEMİNİN GÖKKUŞAĞI ALABALIĞI, Oncorhynchus mykiss (WALBAUM, 1792) YAVRULARININ MALONDİALDEHİT DÜZEYİNE ETKİSİ

Bu çalışmada, nakil uygulanmamış balıkların nakilden hemen sonra (t-0), nakilden 6 saat sonra (t-6) kan, ve kas, karaciğer, böbrek dokularına ait MDA değerlerinin kontrol

Palmprint Recognition with Statistical, Wavelet and Local Feature Extraction Methods

Figure 4.6 shows the performance of feature decision level by using majority voting for left hand palmprint images and the result is 75.95%. Figure 4.7: Feature Decision Level by

CHAPTER FIVE:SCREEN OUTPUTS CHAPTER FOUR:REF'ERENCES CHAPTER THREE:USER MANUAL CHAPTER TWO:DESIGN OF THE SYSTEM CHAPTER ONE:INTRODUCTION ABSTRACT ACKNOWLEDGEMENT CONTENTS

1.0 CHASE OCESS 1.1 CPPLIER ADD PROCESS 1.2 UPPLIER UPDATE PROCESS 1.3 UPPLIER DELETE PROCESS 1.4 PURCHASE GOOD PROCESS 1.5 GOOD ADD PROCESS l.6 ı UPDATE GOOD PROCESS J .7 GOOD

5. SPEECH FEATURE EXTRACTION AND VECTOR QUANTIZATION

MFCCs are computed by taking the windowed frame of the speech signal, putting it through a Fast Fourier Transform (FFT) to obtain certain parameters and

photodecomposition of samples can take place, the sample is placed after the monochromators in scanning instruments while positioning of the sample before the

Age Classification using Facial Feature Extraction

Regarding to this issue shape variation analysis is done by categorizing the differences between ratios during formative years, in order to classify the facial

Vector quantization

Since the encoder output is only a representation of code vectors, in order to reconstruct the signal although with losses, the representation formed by the index numbers must enter

Benzer Belgeler

Plasmonic nanoparticles by laser dewetting of thin metallic films

Plasmonic nanoparticles by laser dewetting of thin metallic films

81

0

0

The comparison of the precision of different dental radiographic methods in mandibular peri-implantary measurements: An in vitro study

The comparison of the precision of different dental radiographic methods in mandibular peri-implantary measurements: An in vitro study

10

0

0

Elmalı yöresinde yayla yetiştiriciliği yapılan domates (Solanum lycopersicum L.) seralarının beslenme durumları nın belirlenmesi

Elmalı yöresinde yayla yetiştiriciliği yapılan domates (Solanum lycopersicum L.) seralarının beslenme durumları nın belirlenmesi

70

0

0

Kurumsallaşma çerçevesinde Arktika’da deniz alanları sınırlandırılması

Kurumsallaşma çerçevesinde Arktika’da deniz alanları sınırlandırılması

103

0

0

Blok zinciri teknolojisi ile veri korunumu sağlanmış oylama uygulaması

Blok zinciri teknolojisi ile veri korunumu sağlanmış oylama uygulaması

5

0

0

Cultural Bases for Self-Evaluation: Seeing Oneself Positively in Different Cultural Contexts

Cultural Bases for Self-Evaluation: Seeing Oneself Positively in Different Cultural Contexts

19

0

0

The cultural burden of architecture

The cultural burden of architecture

9

0

0

Financial performance of Turkish defense firms: a comparative analysis

Financial performance of Turkish defense firms: a comparative analysis

72

0

0