Multi-frame GMM-based block quantisation of line spectral frequencies

There are no files associated with this record.

Title Multi-frame GMM-based block quantisation of line spectral frequencies
Author So, Stephen; Paliwal, Kuldip Kumar
Journal Name Speech Communication
Year Published 2005
Place of publication Amsterdam, The Netherlands
Publisher Elsevier
Abstract In this paper, we investigate the use of the Gaussian mixture model-based block quantiser for coding line spectral frequencies that uses multiple frames and mean squared error as the quantiser selection criterion. As a viable alternative to vector quantisers, the GMM-based block quantiser encompasses both low computational and memory requirements as well as bitrate scalability. Jointly quantising multiple frames allows the exploitation of correlation across successive frames which leads to more efficient block quantisation. The efficiency gained from joint quantisation permits the use of the mean squared error distortion criterion for cluster quantiser selection, rather than the computationally expensive spectral distortion. The distortion performance gains come at the cost of an increase in computational complexity and memory. Experiments on narrowband speech from the TIMIT database demonstrate that the multi-frame GMM-based block quantiser can achieve a spectral distortion of 1 dB at 22 bits/frame, or 21 bits/frame with some added complexity.
Peer Reviewed Yes
Published Yes
Publisher URI http://www.elsevier.com/wps/find/journaldescription.cws_home/505597/description#description
Alternative URI http://dx.doi.org/10.1016/j.specom.2005.02.007
Copyright Statement Copyright 2005 Elsevier : Reproduced in accordance with the copyright policy of the publisher : This journal is available online - use hypertext links.
Volume 47
Page from 265
Page to 276
ISSN 0167-6393
Date Accessioned 2006-02-22
Date Available 2009-09-21T05:49:18Z
Language en_AU
Research Centre Institute for Integrated and Intelligent Systems
Faculty Faculty of Engineering and Information Technology
Subject PRE2009-Signal Processing
URI http://hdl.handle.net/10072/4279
Publication Type Journal Articles (Refereed Article)
Publication Type Code c1

Brief Record

Griffith University copyright notice