Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition
| File | Size | Format | |
|---|---|---|---|
| 32206.pdf | 527Kb | Adobe PDF | View |
| Title | Spectral estimation using higher-lag autocorrelation coefficients with applications to speech recognition |
|---|---|
| Author | Shannon, Ben James; Paliwal, Kuldip Kumar |
| Publication Title | The 8th International Symposium on Signal Processing and Its Applications (ISSP-2005) |
| Editor | B. Boashash |
| Year Published | 2005 |
| Place of publication | Woolongong, NSW, Australia |
| Publisher | IEEE |
| Abstract | In this paper, we introduce a noise robust spectral estimation technique for speech signals that is derived from a windowed one-sided higher-lag autocorrelation sequence. We also introduce a new high dynamic range window design method, and utilise both techniques in a modied Mel Frequency Cepstral Coefcient (MFCC) algorithm to produce noise robust speech recognition features. We call the new features Autocorrelation Mel Frequency Cepstral Coefcients (AMFCCs). We compare the recognition performance of AMFCCs to MFCCs for a range of stationary and non-stationary noises on the Aurora II database. We show that the AMFCC features perform as well as MFCCs in clean conditions and have higher noise robustness in noisy conditions. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://ieeexplore.ieee.org/servlet/opac?punumber=10550 |
| Alternative URI | http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=1581009 |
| Copyright Statement | Copyright 2005 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
| ISBN | 0780392442 |
| Conference name | The 8th International Symposium on Signal Processing and Its Applications |
| Location | Sydney, Australia |
| Date From | 2005-08-28 |
| Date To | 2005-08-31 |
| URI | http://hdl.handle.net/10072/2576 |
| Date Accessioned | 2006-02-22 |
| Date Available | 2009-09-21T05:51:37Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Engineering and Information Technology |
| Subject | PRE2009-Speech Recognition |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/2576
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top