Importance of window shape for phase-only reconstruction of speech

File Size Format
27935_1.pdf 469Kb Adobe PDF View
Title Importance of window shape for phase-only reconstruction of speech
Author Alsteris, Leigh; Paliwal, Kuldip Kumar
Publication Title Acoustics, Speech, and Signal Processing (ICASSP), 2004 IEEE International Conference
Editor Douglas O'Shaughnessy (General Chair)
Year Published 2004
Place of publication Piscataway, N.J.
Publisher IEEE
Abstract The authors recently conducted a human perception experiment [6] to measure the intelligibility of speech stimuli synthesised either from short-time magnitude spectra or short-time phase spectra. The results of the experiment indicate that even for small window durations (of relevance for automatic speech recognition applications), the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modificationsynthesis parameters are properly selected. This intelligibility is significantly more than that reported by Liu et al. [3], who carried out a similar experiment with the same analysis-modificationsynthesis framework. The significant improvement in intelligibility over Liu's results may be attributed to the differences in the parameter settings adopted. In this paper, we review our previous experiment and conduct an additional experiment to determine the contribution that each parameter setting provides towards the intelligibility of stimuli reconstructed from short-time phase spectra. The parameter selection that contributes most to the intelligibility of the phase-only stimuli is that of a rectangular analysis window, as opposed to a Hamming window (which is generally used in speech analysis).
Peer Reviewed Yes
Published Yes
Publisher URI http://ieeexplore.ieee.org/servlet/opac?punumber=9248
Alternative URI http://dx.doi.org/10.1109/ICASSP.2004.1326050
Copyright Statement Copyright 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
ISBN 0-7803-8484-9
Conference name 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing
Location Montreal, Canada
Date From 2004-05-17
Date To 2004-05-21
URI http://hdl.handle.net/10072/2119
Date Accessioned 2005-03-31
Date Available 2009-09-18T07:40:49Z
Language en_AU
Research Centre Institute for Integrated and Intelligent Systems
Faculty Faculty of Engineering and Information Technology
Subject PRE2009-Speech Recognition
Publication Type Conference Publications (Full Written Paper - Refereed)
Publication Type Code e1

Brief Record

Griffith University copyright notice