Mobile QR Code QR CODE
Title Wideband Speech Codec Algorithm based on Compressed Sensing and Fractional Calculus
Authors (Xiuhuan Wang)
DOI https://doi.org/10.5573/IEIESPC.2024.13.2.176
Page pp.176-185
ISSN 2287-5255
Keywords Calculus; Broadband voice; Encoding and decoding; Compressed sensing; Wavelet transform; Speech synthesis
Abstract Broadband speech encoding and decoding methods are important for achieving high-quality speech communication and audio applications. However, encoding and decoding algorithms often face problems such as large data-transmission volume and high computational complexity. To overcome these problems, a wideband speech codec algorithm is proposed based on compressed sensing and fractional calculus. Compressed sensing theory was used to sparsely represent wideband speech signals. The concept and method of fractional calculus are introduced to analyze and process wideband speech signals. Algebraic codebooks were used to adapt the structure and bit allocation of speech based on its different states and actual encoding and decoding rates. Embedded encoding and decoding of wideband speech can be achieved by adding and generating digital book pulses layer by layer. The results show that the proposed algorithm has a minimum encoding and decoding rate of 6.9 bits/s and a speech quality score of over 4.0. It also has low latency and high speed for speech encoding and decoding and provides high-quality speech evaluation. It has clear advantages in speech quality and data transmission efficiency. This study could provide new ideas and methods for further research and application of broadband speech coding and decoding.