Difference between revisions of "Joint stereo"

From Hydrogenaudio Knowledgebase
Jump to: navigation, search
(Total rewrite + merge of mid-side stereo and intensity stereo)
(+References section. Pruned external links.)
(12 intermediate revisions by 6 users not shown)
Line 1: Line 1:
{{merge-from|Mid-side stereo}}
+
'''Joint stereo''' is a property of an audio data stream and means that the stream supports more than one method of stereo coding, such as SS ("simple" or "L/R" stereo or DualMono), MS ("mid-side" stereo), or IS ("intensity" stereo).  A joint stereo stream may still only employ a single coding method, but for the sake of efficiency or quality may switch between methods on a frame or even sub-frame basis.
  
'''Joint stereo''' coding methods try to increase the coding efficiency when encoding stereo signals by exploiting commonalities between the left and right channel signal. In the context of [[MP3]] encoding, there are actually 2 totally different method of Joint Stereo: ''Mid-side Stereo'' and ''Intensity Stereo''.
+
For example, a high-[[bitrate]] "joint stereo" [[MP3]] file may contain a mixture of SS and MS frames, or it may contain all SS frames or all MS frames. A non-"joint stereo" MP3 will never contain a mixture of frame types.
  
 +
"Joint stereo coding methods" in prose generally refers to whatever alternatives to simple (L/R) stereo coding are supported by a particular format, even though simple stereo is also an option.
  
 +
==Stereo coding methods or "modes"==
 +
 +
===Left-Right (L/R) or "Simple" Stereo (SS)===
 +
Simple stereo is the most straightforward method of coding a stereo signal: each channel is treated as a completely separate entity. This can be inefficient and may adversely impact quality (as compared to other modes) when both channels contain nearly identical signals (i.e., are mono or nearly so).
 +
 +
===Mid-side Stereo (MS)===
 +
Mid-side stereo coding calculates a "mid"-channel by addition of left and right channel, and a "side"-channel, i.e.:
  
==Mid-side Stereo==
 
'''Mid-side stereo''' coding calculates a "mid"-channel by addition of left and right channel, and a "side"-channel, i.e.:
 
  
 
<center><math>Left = L \qquad Right = R\,</math></center>
 
<center><math>Left = L \qquad Right = R\,</math></center>
Line 16: Line 22:
 
<center> <math>Left=Middle+Side \qquad Right=Middle-Side</math></center>
 
<center> <math>Left=Middle+Side \qquad Right=Middle-Side</math></center>
  
Whenever a signal is concentrated in the middle of the stereo image (i.e. more mono-like), mid-side stereo can achieve a significant saving in [[bitrate]], since one can use less bitrate to encode the side-channel. Even more important is the fact that by applying the inverse matrix in the decoder the quantization noise becomes correlated and falls in the middle of the stereo image where it is masked by the signal.
 
  
Unlike [[Joint stereo#intensity stereo|intensity stereo]] which destroys phase information, mid-side coding keeps the phase information pretty much intact. Correctly implemented mid-side stereo does very little or no damage to the stereo-image and increases compression efficiency either by reducing size or increasing overall quality.
+
Whenever a signal is concentrated in the middle of the stereo image (i.e. more mono-like), mid-side stereo can achieve a significant saving in bitrate, since one can use fewer bits to encode the side-channel. Even more important is the fact that by applying the inverse matrix in the decoder, the quantization noise becomes correlated and falls in the middle of the stereo image, where it is masked by the signal.
  
 +
Unlike [[Joint stereo#intensity stereo|intensity stereo]] which destroys phase information, mid-side coding keeps the phase information pretty much intact. Correctly implemented mid-side stereo does very little or no damage to the stereo image and increases compression efficiency either by reducing size or increasing overall quality.
  
==Intensity Stereo==
+
===Intensity Stereo===
  
'''Intensity stereo''' coding is a method that achieves a saving in bitrate by replacing the left and the right signal by a single representing signal plus directional information. This replacement is psychoacoustically justified in the higher [[frequency]] range since the human auditory system is insensitive to the signal phase at frequencies above approximately 2kHz.
+
Intensity stereo coding is a method that achieves a saving in bitrate by replacing the left and the right signal by a single representing signal plus directional information. This replacement is psychoacoustically justified in the higher [[frequency]] range since the human auditory system is insensitive to the signal phase at frequencies above approximately 2 kHz.<ref>http://www.hydrogenaudio.org/forums/index.php?showtopic=1491&view=findpost&p=14091</ref>
  
 
Intensity stereo is by definition a [[lossy]] coding method thus it is primarily useful at low bitrates. For coding at higher bitrates only mid-side stereo should be used.
 
Intensity stereo is by definition a [[lossy]] coding method thus it is primarily useful at low bitrates. For coding at higher bitrates only mid-side stereo should be used.
 
  
 
==Additional information==
 
==Additional information==
Some more details about joint stereo & mid-side coding:
 
 
* Bugs and/or not-optimized encoders may implement mid-side coding incorrectly, making mid-side coding sounds worse than simple stereo, while in reality (see the formulas above) there should be no difference in quality between mid-side stereo and simple stereo.
 
 
* Modern/optimized encoders will use mid-side coding or simple stereo coding as necessary, depending on the correlation between the Left and Right channels.
 
  
 +
Some early MP3 encoders didn't make ideal decisions about what mode to use from frame to frame in joint stereo files, or how much bandwidth to allocate to encoding the side channel. This led to a widespread but mistaken belief that an abundance of M/S frames, or the use of joint stereo in general, always negatively impacts channel separation and other measures of audio quality. This is not an issue with modern encoders. Modern, optimized encoders will switch between mid-side coding or simple stereo coding as necessary, depending on the correlation between the left and right channels, and will allocate channel bandwidth appropriately to ensure the best mode is used for each frame.
  
 
==External Links==
 
==External Links==
* [http://harmsy.freeuk.com/mostync/ Joint stereo myths and realities] ''(very important read!)''
+
* [http://en.wikipedia.org/wiki/Joint_stereo joint stereo at Wikipedia]
 
+
* [http://www.audiocoding.com/ Audiocoding] -- written by Menno Bakker
+
 
+
* [http://www.High-Quality.ch.vu/ High Quality Audio guides] -- written by user<br />'''Note: Despite this page's recommendation for encoding in MPC, please be aware that in recent listening tests, other encodings are also proven to be at least as good as MPC, if not better.
+
 
+
* [http://en.wikipedia.org/wiki/Joint_stereo joint stereo at wikipedia]
+
  
 +
==References==
 +
<references/>
  
 
[[Category:Technical]]
 
[[Category:Technical]]

Revision as of 21:58, 23 March 2014

Joint stereo is a property of an audio data stream and means that the stream supports more than one method of stereo coding, such as SS ("simple" or "L/R" stereo or DualMono), MS ("mid-side" stereo), or IS ("intensity" stereo). A joint stereo stream may still only employ a single coding method, but for the sake of efficiency or quality may switch between methods on a frame or even sub-frame basis.

For example, a high-bitrate "joint stereo" MP3 file may contain a mixture of SS and MS frames, or it may contain all SS frames or all MS frames. A non-"joint stereo" MP3 will never contain a mixture of frame types.

"Joint stereo coding methods" in prose generally refers to whatever alternatives to simple (L/R) stereo coding are supported by a particular format, even though simple stereo is also an option.

Stereo coding methods or "modes"

Left-Right (L/R) or "Simple" Stereo (SS)

Simple stereo is the most straightforward method of coding a stereo signal: each channel is treated as a completely separate entity. This can be inefficient and may adversely impact quality (as compared to other modes) when both channels contain nearly identical signals (i.e., are mono or nearly so).

Mid-side Stereo (MS)

Mid-side stereo coding calculates a "mid"-channel by addition of left and right channel, and a "side"-channel, i.e.:


Left = L \qquad Right = R\,


Middle=\frac{L+R}{2} \qquad Side=\frac{L-R}{2}


Left=Middle+Side \qquad Right=Middle-Side


Whenever a signal is concentrated in the middle of the stereo image (i.e. more mono-like), mid-side stereo can achieve a significant saving in bitrate, since one can use fewer bits to encode the side-channel. Even more important is the fact that by applying the inverse matrix in the decoder, the quantization noise becomes correlated and falls in the middle of the stereo image, where it is masked by the signal.

Unlike intensity stereo which destroys phase information, mid-side coding keeps the phase information pretty much intact. Correctly implemented mid-side stereo does very little or no damage to the stereo image and increases compression efficiency either by reducing size or increasing overall quality.

Intensity Stereo

Intensity stereo coding is a method that achieves a saving in bitrate by replacing the left and the right signal by a single representing signal plus directional information. This replacement is psychoacoustically justified in the higher frequency range since the human auditory system is insensitive to the signal phase at frequencies above approximately 2 kHz.[1]

Intensity stereo is by definition a lossy coding method thus it is primarily useful at low bitrates. For coding at higher bitrates only mid-side stereo should be used.

Additional information

Some early MP3 encoders didn't make ideal decisions about what mode to use from frame to frame in joint stereo files, or how much bandwidth to allocate to encoding the side channel. This led to a widespread but mistaken belief that an abundance of M/S frames, or the use of joint stereo in general, always negatively impacts channel separation and other measures of audio quality. This is not an issue with modern encoders. Modern, optimized encoders will switch between mid-side coding or simple stereo coding as necessary, depending on the correlation between the left and right channels, and will allocate channel bandwidth appropriately to ensure the best mode is used for each frame.

External Links

References

  1. http://www.hydrogenaudio.org/forums/index.php?showtopic=1491&view=findpost&p=14091