Speech Enhancement Based on Spectral Estimation from Higher-lag Autocorrelation
There are no files associated with this record.
| Title | Speech Enhancement Based on Spectral Estimation from Higher-lag Autocorrelation |
|---|---|
| Author | Shannon, Ben James; Paliwal, Kuldip Kumar; Nadeu, Climent |
| Publication Title | Proceedings of the NInth International Conference on Spoken Language Processing |
| Editor | Richard M. Stern |
| Year Published | 2006 |
| Place of publication | Bonn Germany |
| Publisher | ISCA c/o Institute for Communication, University of Bonn |
| Abstract | In this paper, we propose a unique approach to enhance speech signals that have been corrupted by non-stationary noises. This approach is not based on a spectral subtraction algorithm, but on an algorithm that separates the speech signal and noise signal contributions in the autocorrelation domain. We call this technique the AR-HASE speech enhancement algorithm. In this initial study, we evaluate the performance of the new algorithm using the average PESQ score computed from 10 male utterances and 10 female utterances taken from the TIMIT database as a measure of speech quality. We test the algorithm using one broadband stationary noise and two non-stationary noises. We will show that the AR-HASE enhancement algorithm produces near transparent quality for clean speech, gives poor enhancement performance for broadband stationary noises, and gives significantly enhanced quality for the two nonstationary noises. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://interspeech2006.org/ |
| Alternative URI | http://www.isca-speech.org/archive/interspeech_2006/ |
| Conference name | Interspeech 2006 |
| Location | Pittsburgh |
| Date From | 2006-09-17 |
| Date To | 2006-09-21 |
| URI | http://hdl.handle.net/10072/13318 |
| Date Accessioned | 2007-03-18 |
| Date Available | 2009-09-21T05:47:35Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Science, Environment, Engineering and Technology |
| Subject | PRE2009-Speech Recognition |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/13318
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top