Using the Standard P.862 to Compare the Quality of Low-Bitrate Vocoders
DOI:
https://doi.org/10.22213/2410-9304-2018-4-109-113Keywords:
speech compression, low-bitrate vocoders, quality assessment, standard P.862Abstract
The objective methods for assessing the speech signal quality are considered: 1) Perceptual Evaluation of Speech (PESQ, ITU-T Rec. P.862), 2) Listening Quality Objective (LQO, ITU-T Rec. P.800.1). A brief description and work schemes of the PESQ technique and formulas for converting Raw MOS estimates to MOS-LQO and back are given. Low-bitrate vocoders were chosen for testing: 1) MELPe, 2) Speex, 3) Codec2. Testing of vocoders was performed on bit speeds from 700 to 4800 bps. For testing we used audio files of articulation tables with 20 records (wav, 8000 KHz, 16 bit, mono). As a result of testing, tables and graphs for Raw MOS and MOS-LQO estimates of the selected vocoders were built. When analyzing the experiments results, the conclusion is made about the effectiveness of objective methods of speech quality assessment. MELPe was identified as a promising vocoder for further development, providing at bit rates of 1200 and 2400 bps MOS quality assessment respectively 2.9...3.2 and 3.0...3.3. Speex vocoder showed comparable with MELPe evaluation results at a higher bitrate (4800 bps). Codec2 vocoder showed lower evaluation results than MELPe.References
Тестирование цифровых микросхем и программирование стендового оборудования «Formula 2k» для измерения параметров / А. Н. Копысов, Р. А. Хатбуллин, В. В. Хворенков, Ф. М. Ермаков, К. А. Зырянов // Интеллектуальные системы в производстве. 2017. Т. 15, № 4. C. 29-34. DOI 10.22213/ 2410-9304-2017-4-29-34.
ГОСТ 50840-95. Передача речи по трактам связи. Методы оценки качества, разборчивости и узнаваемости.
ITU-T Rec. P.862: Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs. Available at: http://www.itu.int/ rec/T-REC-P.862 (accessed 01.11.2018).
Там же.
ITU-T Rec. P.800: Methods for subjective determination of transmission quality. Available at: http://www.itu.int/rec/T-REC-P.800 (accessed 01.11.2018).
Audio File Format Specifications. WAVE or RIFF WAVE sound file. Available at: http://www-mmsp.ece.mcgill.ca/Documents/AudioFormats/WAVE/ WAVE.html (accessed 01.11.2018).
MELPe - Enhanced Mixed-Excitation Linear Predictive Vocoder. Available at: http://melpe.org/ (accessed 01.11.2018).
Standard: NATO - STANAG 4591. The 600 bit/s, 1200 bit/s and 2400 bit/s NATO interoperable narrow band voice coder. Available at: https://standards.globalspec.com/std/1664099/natostanag-4591 (accessed 01.11.2018).
Speex: A Free Codec For Free Speech. Available at: https://www.speex.org/ (accessed 01.11.2018).
Standard: ISO/IEC 14496-3. Information technology - Coding of audio-visual objects. Part 3: Audio amendment 4: New levels for AAC profiles technical corrigendum 1. Available at: https://standards.globalspec.com/ std/9907734/iso-iec-14496-3) (accessed 01.11.2018).
Codec2. Available at: http://www.rowetel.com/ ?page_id=452 (accessed 01.11.2018).