Importance of window shape for phase-only reconstruction of speech
| File | Size | Format | |
|---|---|---|---|
| 27935_1.pdf | 469Kb | Adobe PDF | View |
| Title | Importance of window shape for phase-only reconstruction of speech |
|---|---|
| Author | Alsteris, Leigh; Paliwal, Kuldip Kumar |
| Publication Title | Acoustics, Speech, and Signal Processing (ICASSP), 2004 IEEE International Conference |
| Editor | Douglas O'Shaughnessy (General Chair) |
| Year Published | 2004 |
| Place of publication | Piscataway, N.J. |
| Publisher | IEEE |
| Abstract | The authors recently conducted a human perception experiment [6] to measure the intelligibility of speech stimuli synthesised either from short-time magnitude spectra or short-time phase spectra. The results of the experiment indicate that even for small window durations (of relevance for automatic speech recognition applications), the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modificationsynthesis parameters are properly selected. This intelligibility is significantly more than that reported by Liu et al. [3], who carried out a similar experiment with the same analysis-modificationsynthesis framework. The significant improvement in intelligibility over Liu's results may be attributed to the differences in the parameter settings adopted. In this paper, we review our previous experiment and conduct an additional experiment to determine the contribution that each parameter setting provides towards the intelligibility of stimuli reconstructed from short-time phase spectra. The parameter selection that contributes most to the intelligibility of the phase-only stimuli is that of a rectangular analysis window, as opposed to a Hamming window (which is generally used in speech analysis). |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://ieeexplore.ieee.org/servlet/opac?punumber=9248 |
| Alternative URI | http://dx.doi.org/10.1109/ICASSP.2004.1326050 |
| Copyright Statement | Copyright 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. |
| ISBN | 0-7803-8484-9 |
| Conference name | 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing |
| Location | Montreal, Canada |
| Date From | 2004-05-17 |
| Date To | 2004-05-21 |
| URI | http://hdl.handle.net/10072/2119 |
| Date Accessioned | 2005-03-31 |
| Date Available | 2009-09-18T07:40:49Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Engineering and Information Technology |
| Subject | PRE2009-Speech Recognition |
| Publication Type | Conference Publications (Full Written Paper - Refereed) |
| Publication Type Code | e1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/2119
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top