Using the Standard P.862 to Compare the Quality of Low-Bitrate Vocoders

Authors

  • A. V. Korobeynikov Kalashnikov ISTU
  • M. A. Boyarshinov Kalashnikov ISTU
  • A. I. Nistyuk Kalashnikov ISTU
  • V. N. Emelianov Kalashnikov ISTU

DOI:

https://doi.org/10.22213/2410-9304-2018-4-109-113

Keywords:

speech compression, low-bitrate vocoders, quality assessment, standard P.862

Abstract

The objective methods for assessing the speech signal quality are considered: 1) Perceptual Evaluation of Speech (PESQ, ITU-T Rec. P.862), 2) Listening Quality Objective (LQO, ITU-T Rec. P.800.1). A brief description and work schemes of the PESQ technique and formulas for converting Raw MOS estimates to MOS-LQO and back are given. Low-bitrate vocoders were chosen for testing: 1) MELPe, 2) Speex, 3) Codec2. Testing of vocoders was performed on bit speeds from 700 to 4800 bps. For testing we used audio files of articulation tables with 20 records (wav, 8000 KHz, 16 bit, mono). As a result of testing, tables and graphs for Raw MOS and MOS-LQO estimates of the selected vocoders were built. When analyzing the experiments results, the conclusion is made about the effectiveness of objective methods of speech quality assessment. MELPe was identified as a promising vocoder for further development, providing at bit rates of 1200 and 2400 bps MOS quality assessment respectively 2.9...3.2 and 3.0...3.3. Speex vocoder showed comparable with MELPe evaluation results at a higher bitrate (4800 bps). Codec2 vocoder showed lower evaluation results than MELPe.

Author Biographies

A. V. Korobeynikov, Kalashnikov ISTU

PhD in Engineering, Associate Professor

M. A. Boyarshinov, Kalashnikov ISTU

PhD in Engineering, Associate Professor

A. I. Nistyuk, Kalashnikov ISTU

DSc in Engineering, Professor

V. N. Emelianov, Kalashnikov ISTU

PhD in Engineering

References

Тестирование цифровых микросхем и программирование стендового оборудования «Formula 2k» для измерения параметров / А. Н. Копысов, Р. А. Хатбуллин, В. В. Хворенков, Ф. М. Ермаков, К. А. Зырянов // Интеллектуальные системы в производстве. 2017. Т. 15, № 4. C. 29-34. DOI 10.22213/ 2410-9304-2017-4-29-34.

ГОСТ 50840-95. Передача речи по трактам связи. Методы оценки качества, разборчивости и узнаваемости.

ITU-T Rec. P.862: Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs. Available at: http://www.itu.int/ rec/T-REC-P.862 (accessed 01.11.2018).

Там же.

ITU-T Rec. P.800: Methods for subjective determination of transmission quality. Available at: http://www.itu.int/rec/T-REC-P.800 (accessed 01.11.2018).

Audio File Format Specifications. WAVE or RIFF WAVE sound file. Available at: http://www-mmsp.ece.mcgill.ca/Documents/AudioFormats/WAVE/ WAVE.html (accessed 01.11.2018).

MELPe - Enhanced Mixed-Excitation Linear Predictive Vocoder. Available at: http://melpe.org/ (accessed 01.11.2018).

Standard: NATO - STANAG 4591. The 600 bit/s, 1200 bit/s and 2400 bit/s NATO interoperable narrow band voice coder. Available at: https://standards.globalspec.com/std/1664099/natostanag-4591 (accessed 01.11.2018).

Speex: A Free Codec For Free Speech. Available at: https://www.speex.org/ (accessed 01.11.2018).

Standard: ISO/IEC 14496-3. Information technology - Coding of audio-visual objects. Part 3: Audio amendment 4: New levels for AAC profiles technical corrigendum 1. Available at: https://standards.globalspec.com/ std/9907734/iso-iec-14496-3) (accessed 01.11.2018).

Codec2. Available at: http://www.rowetel.com/ ?page_id=452 (accessed 01.11.2018).

Published

25.02.2019

How to Cite

Korobeynikov А. В., Boyarshinov М. А., Nistyuk А. И., & Emelianov В. Н. (2019). Using the Standard P.862 to Compare the Quality of Low-Bitrate Vocoders. Intellekt. Sist. Proizv., 16(4), 109–113. https://doi.org/10.22213/2410-9304-2018-4-109-113

Issue

Section

Articles