dc.contributor.convenor | Douglas O'Shaughnessy (General Chair) | |
dc.contributor.author | Alsteris, LD | |
dc.contributor.author | Paliwal, KK | |
dc.contributor.editor | Douglas O'Shaughnessy (General Chair) | |
dc.date.accessioned | 2017-05-03T13:01:00Z | |
dc.date.available | 2017-05-03T13:01:00Z | |
dc.date.issued | 2004 | |
dc.date.modified | 2009-09-18T07:40:49Z | |
dc.identifier.isbn | 0-7803-8484-9 | |
dc.identifier.issn | 1520-6149 | |
dc.identifier.doi | 10.1109/ICASSP.2004.1326050 | |
dc.identifier.uri | http://hdl.handle.net/10072/2119 | |
dc.description.abstract | The authors recently conducted a human perception experiment [6] to measure the intelligibility of speech stimuli synthesised either from short-time magnitude spectra or short-time phase spectra. The results of the experiment indicate that even for small window durations (of relevance for automatic speech recognition applications), the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modificationsynthesis parameters are properly selected. This intelligibility is significantly more than that reported by Liu et al. [3], who carried out a similar experiment with the same analysis-modificationsynthesis framework. The significant improvement in intelligibility over Liu's results may be attributed to the differences in the parameter settings adopted. In this paper, we review our previous experiment and conduct an additional experiment to determine the contribution that each parameter setting provides towards the intelligibility of stimuli reconstructed from short-time phase spectra. The parameter selection that contributes most to the intelligibility of the phase-only stimuli is that of a rectangular analysis window, as opposed to a Hamming window (which is generally used in speech analysis). | |
dc.description.peerreviewed | Yes | |
dc.description.publicationstatus | Yes | |
dc.format.extent | 23400 bytes | |
dc.format.extent | 480868 bytes | |
dc.format.mimetype | text/plain | |
dc.format.mimetype | application/pdf | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | IEEE | |
dc.publisher.place | Piscataway, N.J. | |
dc.publisher.uri | http://ieeexplore.ieee.org/servlet/opac?punumber=9248 | |
dc.relation.ispartof | 0 | |
dc.relation.ispartofconferencename | IEEE International Conference on Acoustics, Speech, and Signal Processing | |
dc.relation.ispartofconferencetitle | 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS | |
dc.relation.ispartofdatefrom | 2004-05-17 | |
dc.relation.ispartofdateto | 2004-05-21 | |
dc.relation.ispartoflocation | Montreal, CANADA | |
dc.relation.ispartofpagefrom | 573 | |
dc.relation.ispartofpageto | 576 | |
dc.relation.ispartofvolume | 1 | |
dc.subject.fieldofresearchcode | 280206 | |
dc.title | Importance of window shape for phase-only reconstruction of speech | |
dc.type | Conference output | |
dc.type.description | E1 - Conferences | |
dc.type.code | E - Conference Publications | |
gro.faculty | Griffith Sciences, Griffith School of Engineering | |
gro.rights.copyright | © 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. | |
gro.date.issued | 2004 | |
gro.hasfulltext | Full Text | |
gro.griffith.author | Paliwal, Kuldip K. | |