Modulation Domain Spectral Subtraction for Speech Enhancement
| File | Size | Format | |
|---|---|---|---|
| 61446_1.pdf | 5352Kb | Adobe PDF | View |
| Title | Modulation Domain Spectral Subtraction for Speech Enhancement |
|---|---|
| Author | Paliwal, Kuldip Kumar; Schwerin, Belinda Marie; Wojcicki, Kamil |
| Publication Title | Proceedings of Interspeech 2009 |
| Editor | Maria Uther, Roger Moore, Stephen Cox |
| Year Published | 2009 |
| Place of publication | Australia |
| Publisher | Causal Productions |
| Abstract | In this paper we investigate the modulation domain as an alternative to the acoustic domain for speech enhancement. More specifically, we wish to determine how competitive the modulation domain is for spectral subtraction as compared to the acoustic domain. For this purpose, we extend the traditional analysis-modification-synthesis framework to include modulation domain processing. We then compensate the noisy modulation spectrum for additive noise distortion by applying the spectral subtraction algorithm in the modulation domain. Using subjective listening tests and objective speech quality evaluation we show that the proposed method results in improved speech quality. Furthermore, applying spectral subtraction in the modulation domain does not introduce the musical noise artifacts that are typically present after acoustic domain spectral subtraction. The proposed method also achieves better background noise reduction than the MMSE method. Index Terms: speech enhancement, spectral subtraction, modulation domain, analysis-modification-synthesis (AMS) |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://maxwell.me.gu.edu.au/spl/publications/papers/is09_kkp_mod.pdf |
| Copyright Statement | Copyright 2009 ISCA and the Authors. The attached file is reproduced here in accordance with the copyright policy of the publisher. For information about this conference please refer to the conference’s website or contact the authors. |
| ISBN | 1990-9772 |
| Conference name | Annual Conference of the International Speech Communication Association |
| Location | Brighton, United Kingdom |
| Date From | 2009-09-06 |
| Date To | 2009-09-10 |
| URI | http://hdl.handle.net/10072/30777 |
| Date Accessioned | 2010-03-10 |
| Date Available | 2013-05-16T04:43:30Z |
| Language | en_US |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Science, Environment, Engineering and Technology |
| Subject | Signal Processing |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/30777
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top