On optimal modelling of speech spectral transitions

Athaudage, C and Lech, M 2003, 'On optimal modelling of speech spectral transitions', in 4th International Conference on Information, Communications and Signal Processing, Singapore, 15-18 December 2003.


Document type: Conference Paper
Collection: Conference Papers

Title On optimal modelling of speech spectral transitions
Author(s) Athaudage, C
Lech, M
Year 2003
Conference name ICICS - PCM
Conference location Singapore
Conference dates 15-18 December 2003
Proceedings title 4th International Conference on Information, Communications and Signal Processing
Publisher IEEE
Place of publication Singapore
Abstract In this paper, we propose an optimal spectral transition modelling technique for speech. The proposed technique optimizes the spectral interpolation trajectory by minimizing the mean-square-error of spectral parameters on a frame-by-frame basis. The performance of the proposed techniques is compared with that of two spectral interpolation techniques, namely the linear interpolation and the Gaussian interpolation, reported in literature. Line spectral frequencies are used as the short-term spectral parameter representation of the speech signal. The regions between maximally stable (stationary) frames in the spectral parameter sequence are identified as the regions of spectral transitions. Numerical results show that both linear and Gaussian interpolation techniques have similar modelling performance in terms of average spectral distortion. The proposed optimal technique shows an improved modelling accuracy in terms of average spectral distortion (up to 1 dB improvement), in comparison to that of the linear and Gaussian techniques. The proposed technique can be useful for speech processing applications such as coding and recognition.
Subjects Information Systems not elsewhere classified
DOI - identifier 10.1109/ICICS.2003.1292680
Copyright notice © 2003 IEEE
Versions
Version Filter Type
Altmetric details:
Access Statistics: 146 Abstract Views  -  Detailed Statistics
Created: Mon, 09 Aug 2010, 09:40:26 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us