AiiSH-iR

Benchmark for Speaker Identification using Mel Frequency Cepstral Coefficients on Vowels Preceding Nasal Continuants in Kannada

Show simple item record

dc.contributor.author Arjun, M S
dc.date.accessioned 2024-09-13T05:30:14Z
dc.date.available 2024-09-13T05:30:14Z
dc.date.issued 2015
dc.identifier.uri http://203.129.241.86:8080/xmlui/handle/123456789/5032
dc.description.abstract Identification of speakers in forensic context is generally about comparing voices. In forensic speaker identification, the serious problem is to identify an unfamiliar speaker whose voice has been recorded at some stage in the committing of a crime. Vowels, nasals and fricatives (in decreasing order) are usually suggested for voice recognition because they are somewhat easy to identify in speech signals and their spectra contain features that reliably differentiate speakers based on semi-automatic methods. In this context, the aim of the present study was to obtain the percentage of speaker identification using vowels preceding nasal continuants in Kannada speaking individuals using semi-automatic method. The participants chosen for the study were twenty Kannada speaking adult males in the age range of 21-32 years constituted as Group I. This was further sub grouped (participants reduced) as Group II constituting ten speakers. The material was meaningful mono-, bi-, and/or multisyllabic Kannada words containing long vowels /a:/, /i:/ and /u:/ preceding nasal continuants /m/ and /n/ embedded in Kannada sentences. The participants read the material four times each under two conditions (a) live recording and (b) mobile network recording which were stored into the computer memory. The target words were truncated using the PRAAT software. Each vowel preceding nasal was subjected for Mel Frequency Cepstral Coefficients (MFCCs) using Speech Science lab Workbench for Semi-automatic speaker recognition (vocabulary dependent) software. The same was found across the three conditions when the participants reduced from twenty to ten in number. The study was compared under three conditions: (a) Live vs live recording, (b) Mobile network vs mobile network recording and (c) Live vs mobile network recoding. The results of the present study indicated quite high percent of correct speaker identification using MFCCs in Live vs Live and Mobile network vs Mobile network conditions compared to Live vs mobile network condition. Thus, the present study provided some proof to look at the efficiency of semi-automatic method using MFCC which helps in speaker identification. The obtained outcome would serve as potential measure in the forensic scenario for identification of speakers using vowels preceding nasal continuants in Kannada. en_US
dc.language.iso en en_US
dc.publisher All India Institute of Speech and Hearing en_US
dc.title Benchmark for Speaker Identification using Mel Frequency Cepstral Coefficients on Vowels Preceding Nasal Continuants in Kannada en_US
dc.type PG Dissertations en_US
dc.degree PGDFST en_US
dc.dissno PGDFST-3 en_US
dc.guide Rajasudhakar R en_US
dc.npages 88 en_US
dc.place Mysuru en_US
dc.terms Speaker identification, Mel frequency, Cepstral coefficients, Vowels, Nasal, Kannada en_US


File(s) in this item

This item appears in the following Collection

Show simple item record

Browse

My Account