Fast converging iterative Kalman filtering for speech enhancement using long and overlapped tapered windows with large side lobe attenuation
| File | Size | Format | |
|---|---|---|---|
| 65039_1.pdf | 637Kb | Adobe PDF | View |
| Title | Fast converging iterative Kalman filtering for speech enhancement using long and overlapped tapered windows with large side lobe attenuation |
|---|---|
| Author | So, Stephen; Paliwal, Kuldip Kumar |
| Publication Title | Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010) |
| Editor | Satoshi Nakamura |
| Year Published | 2010 |
| Publisher | International Speech Communication Association (ISCA) |
| Abstract | In this paper, we propose an iterative Kalman filtering scheme that has faster convergence and introduces less residual noise, when compared with the iterative scheme of Gibson, et al. This is achieved via the use of long and overlapped frames as well as using a tapered window with a large side lobe attenuation for linear prediction analysis. We show that the Dolph-Chebychev window with a -200 dB side lobe attenuation tends to enhance the dynamic range of the formant structure of speech corrupted with white noise, reduce prediction error variance bias, as well as provide for some spectral smoothing, while the long overlapped frames provide for reliable autocorrelation estimates and temporal smoothing. Speech enhancement experiments on the NOIZEUS corpus show that the proposed method outperformed conventional iterative and non-iterative Kalman filters as well as other enhancement methods such as MMSE-STSA and PSC. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://www.isca-speech.org/iscaweb/ |
| Alternative URI | http://www.interspeech2010.org |
| Copyright Statement | Copyright 2010 ISCA and the Authors. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher. For information about this conference please refer to the conference's website or contact the authors. |
| ISBN | 1990-9772 |
| Conference name | Interspeech 2010 |
| Location | Makuhari, Japan |
| Date From | 2010-09-26 |
| Date To | 2010-09-30 |
| URI | http://hdl.handle.net/10072/36158 |
| Date Accessioned | 2010-10-19 |
| Date Available | 2011-02-14T09:13:53Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Science, Environment, Engineering and Technology |
| Subject | Signal Processing |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/36158
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top