Evaluation of the modified group delay feature for isolated word recognition

File Size Format
32210.pdf 795Kb Adobe PDF View
Title Evaluation of the modified group delay feature for isolated word recognition
Author Alsteris, Leigh; Paliwal, Kuldip Kumar
Publication Title The 8th International Symposium on Signal Processing and Its Applications (ISSPA-2005)
Editor B. Boashash
Year Published 2005
Place of publication Woolongong, NSW, Australia
Publisher IEEE
Abstract The results of our recent human perception experiments indicate that the short-time phase spectrum can significantly contribute to speech intelligibility over small window durations (i.e., 20–40 ms). This motivates us to investigate the use of the short-time phase spectrum to derive features for automatic speech recognition, which generally uses small window durations of 20–40 ms for spectral analysis. In this paper, we specifically investigate the frequency-derivative of the short-time phase spectrum (i.e., group delay function, GDF) from which to extract features. We demonstrate, with some simple examples, the volatility of the GDF to noise, pitch epochs and windowing effects. We summarise the work by Yegnanarayana and Murthy on the modified GDF (MGDF), which serves to remedy the problems of the GDF. We then implement Murthy and Gadde's MGDF-based features (MODGDF) to determine if they provide an improvement over the popular MFCC representation either by themselves or in combination with MFCCs on an isolated word recognition task.
Peer Reviewed Yes
Published Yes
Publisher URI http://ieeexplore.ieee.org/servlet/opac?punumber=10550
Alternative URI http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1581038
Copyright Statement Copyright 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
ISBN 0780392442
Conference name The 8th International Symposium on Signal Processing and Its Applications (ISSPA-2005)
Location Sydney, Australia
Date From 2005-08-28
Date To 2005-08-31
URI http://hdl.handle.net/10072/2577
Date Accessioned 2006-02-22
Date Available 2009-09-21T05:50:03Z
Language en_AU
Research Centre Institute for Integrated and Intelligent Systems
Faculty Faculty of Engineering and Information Technology
Subject PRE2009-Speech Recognition
Publication Type Conference Publications (Full Written Paper - Refereed)
Publication Type Code e1

Brief Record

Griffith University copyright notice