Multi-frame GMM-based block quantisation for distributed speech recognition under noisy conditions

File Size Format
39597.pdf 103Kb Adobe PDF View
Title Multi-frame GMM-based block quantisation for distributed speech recognition under noisy conditions
Author So, Stephen; Paliwal, Kuldip Kumar
Publication Title Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
Editor F. Castanie
Year Published 2006
Publisher IEEE Signal Processing Society
Abstract In this paper, we report on the recognition accuracy of the multiframe GMM-based block quantiser for the coding of MFCC features in a distributed speech recognition framework under varying noise conditions. All experiments were performed using the ETSI Aurora-2 connected-digits recognition task. For comparison, we have also investigated other quantisation schemes such as the memoryless GMM-based block quantiser, the unconstrained vector quantiser, and non-uniform scalar quantisers. The results show that the rate-distortion efficiency of the quantiser is a factor in determining the level of recognition accuracy at low to medium levels of additive noise. For high levels of additive noise, the influence of rate-distortion efficiency diminishes and the recognition accuracy becomes dependent on the recognition features.
Peer Reviewed Yes
Published Yes
Publisher URI http://ieeexplore.ieee.org/servlet/opac?punumber=11024
Alternative URI http://dx.doi.org/10.1109/ICASSP.2006.1659989
Copyright Statement Copyright 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
ISBN 1-4244-0469-X
Conference name IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2006
Location Toulouse, France
Date From 2006-05-14
Date To 2006-05-19
URI http://hdl.handle.net/10072/12322
Date Accessioned 2007-02-13
Language en_AU
Research Centre Institute for Integrated and Intelligent Systems
Faculty Faculty of Environmental Sciences
Subject PRE2009-Signal Processing; PRE2009-Speech Recognition
Publication Type Conference Publications (Full Written Paper - Refereed)
Publication Type Code e1

Show simple item record

Griffith University copyright notice