Single-channel speech enhancement using spectral subtraction in the short-time modulation domain
There are no files associated with this record.
| Title | Single-channel speech enhancement using spectral subtraction in the short-time modulation domain |
|---|---|
| Author | Paliwal, Kuldip Kumar; Wojcicki, Kamil; Schwerin, Belinda Marie |
| Journal Name | Speech Communication |
| Year Published | 2010 |
| Place of publication | Netherlands |
| Publisher | Elsevier BV |
| Abstract | In this paper we investigate the modulation domain as an alternative to the acoustic domain for speech enhancement. More specifically, we wish to determine how competitive the modulation domain is for spectral subtraction as compared to the acoustic domain. For this purpose, we extend the traditional analysis-modification-synthesis framework to include modulation domain processing. We then compensate the noisy modulation spectrum for additive noise distortion by applying the spectral subtraction algorithm in the modulation domain. Using an objective speech quality measure as well as formal subjective listening tests, we show that the proposed method results in improved speech quality. Furthermore, the proposed method achieves better noise suppression than the MMSE method. In this study, the effect of modulation frame duration on speech quality of the proposed enhancement method is also investigated. The results indicate that modulation frame durations of 180–280 ms, provide a good compromise between different types of spectral distortions, namely musical noise and temporal slurring. Thus given a proper selection of modulation frame duration, the proposed modulation spectral subtraction does not suffer from musical noise artifacts typically associated with acoustic spectral subtraction. In order to achieve further improvements in speech quality, we also propose and investigate fusion of modulation spectral subtraction with the MMSE method. The fusion is performed in the short-time spectral domain by combining the magnitude spectra of the above speech enhancement algorithms. Subjective and objective evaluation of the speech enhancement fusion shows consistent speech quality improvements across input SNRs. |
| Peer Reviewed | Yes |
| Published | Yes |
| Alternative URI | http://dx.doi.org/10.1016/j.specom.2010.02.004 |
| Volume | 52 |
| Issue Number | 5 |
| Page from | 450 |
| Page to | 475 |
| ISSN | 0167-6393 |
| Date Accessioned | 2011-02-25 |
| Date Available | 2011-03-15T08:04:00Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Science, Environment, Engineering and Technology |
| Subject | Signal Processing |
| URI | http://hdl.handle.net/10072/37260 |
| Publication Type | Journal Articles (Refereed Article) |
| Publication Type Code | c1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/37260
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top