Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech
| File | Size | Format | |
|---|---|---|---|
| 48082_1.pdf | 216Kb | Adobe PDF | View |
| Title | Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech |
|---|---|
| Author | Wojcicki, Kamil; Paliwal, Kuldip Kumar |
| Publication Title | ICASSP 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing |
| Editor | K.J. Ray Lu and Todd Reed |
| Year Published | 2007 |
| Abstract | The short-time Fourier transform (STFT) of a speech signal has two components: the short-time magnitude spectrum and the short-time phase spectrum. It is traditionally believed that the short-timemagnitude spectrum plays the dominant role for speech perception at small window durations (20–40ms). However, recent perceptual studies have shown that the short-time phase spectrum can contribute as much to speech intelligibility as the short-time magnitude spectrum. It was observed that the use of the rectangular (non-tapered) analysis window for the computation of the short-time phase spectrum is more advantageous than the use of the Hamming (tapered) analysis window. This paper investigates the effect that the dynamic range of an analysis window has on the intelligibility of speech for phaseonly and magnitude-only stimuli. For this purpose, the Chebyshev analysis window with adjustable equi-ripple side-lobes is employed. Two types of magnitude-only stimuli are investigated: random phase and zero phase. It is shown that the intelligibility of the magnitudeonly stimuli constructed with zero phase is independent of the dynamic range of the analysis window, while the random phase stimuli are intelligible only for analysis windows with high dynamic range. This study also shows that for low dynamic range analysis windows, the short-time phase spectrum at small window durations (20–40ms) contributes as much as to speech intelligibility as the short-time magnitude spectrum. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://ieeexplore.ieee.org/servlet/opac?punumber=4216989 |
| Alternative URI | http://dx.doi.org/10.1109/ICASSP.2007.367016 |
| Copyright Statement | Copyright 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
| ISBN | 1424407281 |
| Conference name | IEEE International Conference on Acoustics, Speech, and Signal Processing |
| Location | Honolulu, USA |
| Date From | 2007-04-15 |
| Date To | 2007-04-20 |
| URI | http://hdl.handle.net/10072/17416 |
| Date Accessioned | 2008-01-24 |
| Date Available | 2009-09-21T05:48:38Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Science, Environment, Engineering and Technology |
| Subject | PRE2009-Speech Recognition |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/17416
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top