Abstract
In this paper, a new model-based algorithm for optimizing the MPEG-Advanced Audio Coder (AAC) in MS-stereo mode is presented. This algorithm is an extension to stereo signals of prior work on a statistical model of quantization noise. Traditionally, MS-stereo coding approaches replace the Left (L) and Right (R) channels by the Middle (M) and Sides (S) channels, each channel being independently processed, almost like a monophonic signal. In contrast, our method proposes a global approach for coding both channels in the same process.Amodel for the quantization error allows us to tune the quantizers on channelsMand S with respect to a distortion constraint on the reconstructed channels L and R as they will appear in the Dec.oder. This approach leads to a more efficient perceptual noise-shaping and avoids using complex psychoacoustic models built on theMand S channels. Furthermore, it provides a straightforward scheme to choose between LR and MS modes in each subband for each frame. Subjective listening tests prove that the coding efficiency at a medium bitrate (96 kbits/s for both channels) is significantly better with our algorithm than with the standard algorithm, without increase of complexity.
| Original language | English |
|---|---|
| Pages (from-to) | 1373-1382 |
| Number of pages | 10 |
| Journal | IEEE Transactions on Audio, Speech and Language Processing |
| Volume | 16 |
| Issue number | 8 |
| DOIs | |
| Publication status | Published - 1 Nov 2008 |
| Externally published | Yes |
Keywords
- Bitrate constraint
- Distortion constraint
- Mpeg-advanced audio coder (aac)
- Ms-stereo
- Optimization algorithm
- Perceptual audio coding
- Quantization
- Scalefactor
- Statistical model