Benchmark for Speaker Identification using Mel Frequency Cepstral Coefficients on Vowels Preceding Nasal Continuants in Kannada

Arjun, M S

Home
→
1.Research Reports
→
PG Dissertations
→
View Item

Institutional Repository

All India Institute of Speech and Hearing

dc.contributor.author	Arjun, M S
dc.date.accessioned	2024-09-13T05:30:14Z
dc.date.available	2024-09-13T05:30:14Z
dc.date.issued	2015
dc.identifier.uri	http://203.129.241.86:8080/xmlui/handle/123456789/5032
dc.description.abstract	Identification of speakers in forensic context is generally about comparing voices. In forensic speaker identification, the serious problem is to identify an unfamiliar speaker whose voice has been recorded at some stage in the committing of a crime. Vowels, nasals and fricatives (in decreasing order) are usually suggested for voice recognition because they are somewhat easy to identify in speech signals and their spectra contain features that reliably differentiate speakers based on semi-automatic methods. In this context, the aim of the present study was to obtain the percentage of speaker identification using vowels preceding nasal continuants in Kannada speaking individuals using semi-automatic method. The participants chosen for the study were twenty Kannada speaking adult males in the age range of 21-32 years constituted as Group I. This was further sub grouped (participants reduced) as Group II constituting ten speakers. The material was meaningful mono-, bi-, and/or multisyllabic Kannada words containing long vowels /a:/, /i:/ and /u:/ preceding nasal continuants /m/ and /n/ embedded in Kannada sentences. The participants read the material four times each under two conditions (a) live recording and (b) mobile network recording which were stored into the computer memory. The target words were truncated using the PRAAT software. Each vowel preceding nasal was subjected for Mel Frequency Cepstral Coefficients (MFCCs) using Speech Science lab Workbench for Semi-automatic speaker recognition (vocabulary dependent) software. The same was found across the three conditions when the participants reduced from twenty to ten in number. The study was compared under three conditions: (a) Live vs live recording, (b) Mobile network vs mobile network recording and (c) Live vs mobile network recoding. The results of the present study indicated quite high percent of correct speaker identification using MFCCs in Live vs Live and Mobile network vs Mobile network conditions compared to Live vs mobile network condition. Thus, the present study provided some proof to look at the efficiency of semi-automatic method using MFCC which helps in speaker identification. The obtained outcome would serve as potential measure in the forensic scenario for identification of speakers using vowels preceding nasal continuants in Kannada.	en_US
dc.language.iso	en	en_US
dc.publisher	All India Institute of Speech and Hearing	en_US
dc.title	Benchmark for Speaker Identification using Mel Frequency Cepstral Coefficients on Vowels Preceding Nasal Continuants in Kannada	en_US
dc.type	PG Dissertations	en_US
dc.degree	PGDFST	en_US
dc.dissno	PGDFST-3	en_US
dc.guide	Rajasudhakar R	en_US
dc.npages	88	en_US
dc.place	Mysuru	en_US
dc.terms	Speaker identification, Mel frequency, Cepstral coefficients, Vowels, Nasal, Kannada	en_US