Research Output
Statistical normalisation of phase-based feature representation for robust speech recognition
  In earlier work we have proposed a source-filter decomposition of speech through phase-based processing. The decomposition leads to novel speech features that are extracted from the filter component of the phase spectrum. This paper analyses this spectrum and the proposed representation by evaluating statistical properties at various points along the parametrisation pipeline. We show that speech phase spectrum has a bell-shaped distribution which is in contrast to the uniform assumption that is usually made. It is demonstrated that the uniform density (which implies that the corresponding sequence is least-informative) is an artefact of the phase wrapping and not an original characteristic of this spectrum. In addition, we extend the idea of statistical normalisation usually applied for the magnitudebased features into the phase domain. Based on the statistical structure of the phase-based features, which is shown to be super-gaussian in the clean condition, three normalisation schemes, namely, Gaussianisation, Laplacianisation and table-based histogram equalisation have been applied for improving the robustness. Speech recognition experiments using Aurora-2 show that applying an optimal normalisation scheme at the right stage of the feature extraction process can produce average relative WER reductions of up to 18.6% across the 0-20 dB SNR conditions.

  • Date:

    19 June 2017

  • Publication Status:

    Published

  • Publisher

    IEEE

  • DOI:

    10.1109/icassp.2017.7953170

  • Funders:

    Engineering and Physical Sciences Research Council

Citation

Loweimi, E., Barker, J., & Hain, T. (2017). Statistical normalisation of phase-based feature representation for robust speech recognition. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). https://doi.org/10.1109/icassp.2017.7953170

Authors

Keywords

phase spectrum, robust speech recognition, phase distribution, statistical normalisation

Monthly Views:

Available Documents