Multi-frame GMM-based block quantisation for distributed speech recognition under noisy conditions
| File | Size | Format | |
|---|---|---|---|
| 39597.pdf | 103Kb | Adobe PDF | View |
| Title | Multi-frame GMM-based block quantisation for distributed speech recognition under noisy conditions |
|---|---|
| Author | So, Stephen; Paliwal, Kuldip Kumar |
| Publication Title | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing |
| Editor | F. Castanie |
| Year Published | 2006 |
| Publisher | IEEE Signal Processing Society |
| Abstract | In this paper, we report on the recognition accuracy of the multiframe GMM-based block quantiser for the coding of MFCC features in a distributed speech recognition framework under varying noise conditions. All experiments were performed using the ETSI Aurora-2 connected-digits recognition task. For comparison, we have also investigated other quantisation schemes such as the memoryless GMM-based block quantiser, the unconstrained vector quantiser, and non-uniform scalar quantisers. The results show that the rate-distortion efficiency of the quantiser is a factor in determining the level of recognition accuracy at low to medium levels of additive noise. For high levels of additive noise, the influence of rate-distortion efficiency diminishes and the recognition accuracy becomes dependent on the recognition features. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://ieeexplore.ieee.org/servlet/opac?punumber=11024 |
| Alternative URI | http://dx.doi.org/10.1109/ICASSP.2006.1659989 |
| Copyright Statement | Copyright 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
| ISBN | 1-4244-0469-X |
| Conference name | IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP 2006 |
| Location | Toulouse, France |
| Date From | 2006-05-14 |
| Date To | 2006-05-19 |
| URI | http://hdl.handle.net/10072/12322 |
| Date Accessioned | 2007-02-13 |
| Date Available | 2009-09-21T05:50:08Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Environmental Sciences |
| Subject | PRE2009-Signal Processing; PRE2009-Speech Recognition |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/12322
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top