Automatic Human Utility Evaluation of ASR Systems: Does WER Really Predict Performance?

Research Output

We propose an alternative evaluation metric to Word Error Rate (WER) for the decision audit task of meeting recordings, which exemplifies how to evaluate speech recognition within a legitimate application context. Using machine learning on an initial seed of human-subject experimental data, our alternative metric handily outperforms WER, which correlates very poorly with human subjectsf success in finding decisions given ASR transcripts with a range of WERs.

Date:

31 December 2013
Publication Status:

Published
DOI:

10.21437/Interspeech.2013-610
Funders:

Historic Funder (pre-Worktribe)

http://researchrepository.napier.ac.uk/output/3085862 <p>Favre, B., Cheung, K., Kazemian, S., Lee, A., Liu, Y., Munteanu, C., …Zeller, F. (2013). Automatic Human Utility Evaluation of ASR Systems: Does WER Really Predict Performance?. In <i>Proc. Interspeech 2013</i> (3463-3467). https://doi.org/10.21437/Interspeech.2013-610</p>

Citation

Favre, B., Cheung, K., Kazemian, S., Lee, A., Liu, Y., Munteanu, C., …Zeller, F. (2013). Automatic Human Utility Evaluation of ASR Systems: Does WER Really Predict Performance?. In Proc. Interspeech 2013 (3463-3467). https://doi.org/10.21437/Interspeech.2013-610

Authors

automatic human utility evaluation of asr systems does wer really predict performance

Dr Frauke Zeller

Professor
School of Computing Engineering and the Built Environment

0131 455 2372

F.Zeller@napier.ac.uk

Monthly Views:

Available Documents

Files currently unavailable for download , please contact M.Wilson2@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Date:

Publication Status:

DOI:

Funders:

Citation

Authors

Dr Frauke Zeller

Monthly Views:

Files currently unavailable for download , please contact M.Wilson2@napier.ac.uk to request a copy

Downloadable citations