Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra
There are no files associated with this record.
| Title | Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra |
|---|---|
| Author | Alsteris, Leigh; Paliwal, Kuldip Kumar |
| Journal Name | Computer Speech and Language |
| Year Published | 2007 |
| Place of publication | United Kingdom |
| Publisher | Academic Press |
| Abstract | In this paper, we consider the topic of iterative, one dimensional, signal reconstruction (specifically speech signals) from the magnitude spectrum and the phase spectrum. While this topic has been extensively researched and documented, we wish to recast some well-established results for the benefit of new researchers and those who desire a short, yet comprehensive, review of the subject. The three main points of the review are: (i) a signal can be reconstructed to within a scale factor from its phase spectrum, (ii) a signal cannot be reconstructed to within a scale factor from its magnitude spectrum, and (iii) a signal can be reconstructed to within a scale factor from its magnitude spectrum when the phase-sign (i.e., one bit of phase spectrum information) is known. Through a number of illustrative examples, we first demonstrate how the algorithms work when the spectral information is determined over the entire duration of the signal. We then demonstrate that the algorithms are equally valid for reconstruction of a signal from the spectra obtained from short-time segments. In addition, we present the results of some further experimentation in which we have attempted to reconstruct a speech signal from only partial phase spectrum information (in the absence of all magnitude spectrum information). We make the following observations: (i) intelligible signal reconstruction (albeit noisy) is possible from knowledge of only the phase spectrum sign information, (ii) an intelligible signal cannot be reconstructed from knowledge of only the phase spectrum frequency-derivative or only the phase spectrum time-derivative, and (iii) an intelligible signal can be reconstructed from the combined knowledge of both the phase spectrum frequency-derivative and time-derivative. |
| Peer Reviewed | Yes |
| Published | Yes |
| Publisher URI | http://www.elsevier.com/wps/find/journaldescription.cws_home/622808/description#description |
| Alternative URI | http://dx.doi.org/10.1016/j.csl.2006.03.001 |
| Volume | 21 |
| Page from | 174 |
| Page to | 186 |
| ISSN | 0885-2308 |
| Date Accessioned | 2008-01-22 |
| Date Available | 2009-09-21T05:50:17Z |
| Language | en_AU |
| Research Centre | Institute for Integrated and Intelligent Systems |
| Faculty | Faculty of Science, Environment, Engineering and Technology |
| Subject | PRE2009-Speech Recognition |
| URI | http://hdl.handle.net/10072/17414 |
| Publication Type | Journal Articles (Refereed Article) |
| Publication Type Code | c1 |
Please use this identifier to cite this record: http://hdl.handle.net/10072/17414
Griffith University copyright notice
Copyright in individual works within the repository belongs to their authors or publishers. You may make a print or digital copy of a work for your personal non-commercial use. All other rights are reserved, except for fair dealings or other user rights granted by the copyright laws of your country.
Back to top