Study of Speaker Recognition Systems

Panda, Ashish Kumar and Sahoo, Amit Kumar (2011) Study of Speaker Recognition Systems. BTech thesis.

[img]PDF
1671Kb

Abstract

Speaker Recognition is the computing task of validating a user’s claimed identity using characteristics extracted from their voices. This technique is one of the most useful and popular biometric recognition techniques in the world especially related to areas in which security is a major concern. It can be used for authentication, surveillance, forensic speaker recognition and a number of related activities.

Speaker recognition can be classified into identification and verification. Speaker identification is the process of determining which registered speaker provides a given utterance. Speaker verification, on the other hand, is the process of accepting or rejecting the identity claim of a speaker.

The process of Speaker recognition consists of 2 modules namely: - feature extraction and feature matching. Feature extraction is the process in which we extract a small amount of data from the voice signal that can later be used to represent each speaker. Feature matching involves identification of the unknown speaker by comparing the extracted features from his/her voice input with the ones from a set of known speakers.

Our proposed work consists of truncating a recorded voice signal, framing it, passing it through a window function, calculating the Short Term FFT, extracting its features and matching it with a stored template. Cepstral Coefficient Calculation and Mel frequency Cepstral Coefficients (MFCC) are applied for feature extraction purpose. VQLBG (Vector Quantization via Linde-Buzo-Gray), DTW (Dynamic Time Warping) and GMM (Gaussian Mixture Modelling) algorithms are used for generating template and feature matching purpose.

Item Type:Thesis (BTech)
Uncontrolled Keywords:Short Term Fourier Transform, Discrete Cosine Transform, Mel Frequency Cepstral Coefficients Vector Quantization using Linde,Buzo and Gray Dynamic Time Warping Gaussian Mixture Modeling
Subjects:Engineering and Technology > Electronics and Communication Engineering > Signal Processing
Divisions: Engineering and Technology > Department of Electronics and Communication Engineering
ID Code:2450
Deposited By:Sahoo Amit Pappu
Deposited On:17 May 2011 14:54
Last Modified:17 May 2011 14:54
Supervisor(s):Meher, S

Repository Staff Only: item control page