Influence of autocorrelation lag ranges on robust speech recognition
| File | Size | Format | |
|---|---|---|---|
| 32182_1.pdf | 236Kb | Adobe PDF | View |
| Title | Influence of autocorrelation lag ranges on robust speech recognition |
|---|---|
| Author | Shannon, Ben James; Paliwal, Kuldip Kumar |
| Publication Title | ICASSP 2005 |
| Editor | Kenneth Barner |
| Year Published | 2005 |
| Place of publication | Piscataway, N.J., USA |
| Publisher | IEEE |
| Abstract | It is generally believed that the lower-lag autocorrelation coefficients carry information about the spectral envelop and the higher-lag autocorrelation coefficients are more related to pitch information. In this paper, we use lower-lag and higher-lag ranges of the autocorrelation function separately for deriving speech recognition features, and investigate their role in terms of speech recognition performance. The state-of-the-art MFCC features use the whole autocorrelation function in their computation and are used here as a benchmark in our experiments. Our recognition results from the Aurora II corpus show that the higher-lag autocorrelation coefficients perform as well as the whole autocorrelation function for clean speech, and provide better performance for noisy speech, while lower-lag autocorrelation coefficients are not as effective in this aspect. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://ieeexplore.ieee.org/servlet/opac?punumber=9711 |
| Alternative URI | http://dx.doi.org/10.1109/ICASSP.2005.1415171 |
| Copyright Statement | Copyright 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
| Conference name | IEEE International Conference on Acoustics, Speech, and Signal Processing |
| Location | Philadelphia, USA |
| Date From | 2005-03-18 |
| Date To | 2005-03-23 |
| URI | http://hdl.handle.net/10072/2575 |
| Date Accessioned | 2006-02-22 |
| Date Available | 2009-09-21T05:48:20Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Engineering and Information Technology |
| Subject | PRE2009-Speech Recognition |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/2575
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top