Abstract
It was recently shown that the combination of source prediction, two-times oversampling, and noise shaping, can be used to obtain a robust (multiple-description) audio coding frame- work for networks with packet loss probabilities less than 10%. Specifically, it was shown that audio signals could be encoded into two descriptions (packets), which were separately sent over a communication channel. Each description yields a desired performance by itself, and when they are combined, the performance is improved. This paper extends the previ- ous work to an arbitrary number of descriptions (packets) by using fractional oversampling and a new decoding principle. We demonstrate that, due to source aliasing, existing MSE optimized reconstruction rules from noisy sampled data, performs poorly from a perceptual point of view. A simple reconstruction rule is proposed, that improves the PEAQ objective difference grades (ODG) by more than 2 points. The proposed audio coder enables low- delay high-quality audio streaming on networks with late packet arrivals or packet losses. With a coding delay of 2.5 ms, and a total bitrate of 300 kbps, it is demonstrated that mean PEAQ ODGs around -0.65 can be obtained for 48 kHz (mono) music (pop & rock), and packet loss probabilities of 20%.
Original language | English |
---|---|
Title of host publication | Proceedings - DCC 2021 : 2021 Data Compression Conference |
Editors | Ali Bilgin, Michael W. Marcellin, Joan Serra-Sagrista, James A. Storer |
Number of pages | 10 |
Publisher | IEEE Signal Processing Society |
Publication date | 2021 |
Pages | 273-282 |
Article number | 9418676 |
ISBN (Print) | 978-1-6654-4785-0 |
ISBN (Electronic) | 978-1-6654-0333-7 |
DOIs | |
Publication status | Published - 2021 |
Event | 2021 Data Compression Conference (DCC) - Snowbird, United States Duration: 23 Mar 2021 → 26 Mar 2021 |
Conference
Conference | 2021 Data Compression Conference (DCC) |
---|---|
Country/Territory | United States |
City | Snowbird |
Period | 23/03/2021 → 26/03/2021 |
Series | Data Compression Conference. Proceedings |
---|---|
ISSN | 1068-0314 |
Keywords
- Multiple descriptions
- audio coding
- fractional sampling
- low delay
- noise shaping
- source predictions
Fingerprint
Dive into the research topics of 'Low Delay Robust Audio Coding by Noise Shaping, Fractional Sampling, and Source Prediction'. Together they form a unique fingerprint.Datasets
-
Open source MATLAB implementation of MD_DSQ coder
Østergaard, J. (Creator), VBN, 1 Mar 2021
DOI: 10.5278/b27e5ae6-f5e8-411d-9fcf-a6617143a14d
Dataset
File