Research Output
A new phase-based feature representation for robust speech recognition
  The aim of this paper is to introduce a novel phase-based feature representation for robust speech recognition. This method consists of four main parts: autoregressive (AR) model extraction, group delay function (GDF) computation, compression, and scale information augmentation. Coupling GDF with an AR model results in a high-resolution estimate of the power spectrum with low frequency leakage. The compression step includes two stages similar to MFCC without taking a logarithm of the output energies. The fourth part augments the phase-based feature vector with scale information which is based on the Hilbert transform relations and complements the phase spectrum information. In the presence of additive and convolutional noises, the proposed method has led to 15% and 12% reductions in the averaged error rates, respectively (SNR ranging from 0 to 20 dB), compared to the standard MFCCs.

  • Date:

    21 October 2013

  • Publication Status:

    Published

  • Publisher

    IEEE

  • DOI:

    10.1109/icassp.2013.6639051

  • Funders:

    Historic Funder (pre-Worktribe)

Citation

Loweimi, E., Ahadi, S. M., & Drugman, T. (2013). A new phase-based feature representation for robust speech recognition. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. https://doi.org/10.1109/icassp.2013.6639051

Authors

Monthly Views:

Available Documents