Difference between revisions of "Vorbis"

From Hydrogenaudio Knowledgebase
Jump to: navigation, search
(Tagger and VorbisComment 1.1.1)
(Typoes and Dots and DAPs)
Line 16: Line 16:
  
 
==Pros==
 
==Pros==
* Ogg Vorbis specification is in the public domain. It is free for commercial or noncommercial use, under both (LGPL and BSD liscenes).
+
* Ogg Vorbis specification is in the public domain; it is free for commercial or noncommercial use, under both (LGPL and BSD liscenes)
 
* Good all-round performance (>48 kbps - a leading codec at [http://www.rjamorim.com/test/multiformat128/results.html 128 kbps])
 
* Good all-round performance (>48 kbps - a leading codec at [http://www.rjamorim.com/test/multiformat128/results.html 128 kbps])
 
* Well written [http://www.xiph.org/ogg/vorbis/docs.html specs]
 
* Well written [http://www.xiph.org/ogg/vorbis/docs.html specs]
* Several portable [http://wiki.hydrogenaudio.org/index.php?title=Ogg_Vorbis#Hardware hardware players]
+
* Supported by portable [[Ogg Vorbis#Supporting Digital Audio Players|DAPs]]
 
* Suitable for internet-streaming (via [http://www.icecast.org/ Icecast] and other methods)
 
* Suitable for internet-streaming (via [http://www.icecast.org/ Icecast] and other methods)
 
* Fully [[gapless]] playback
 
* Fully [[gapless]] playback
 
* High potential for further tuning
 
* High potential for further tuning
* Structured to allow the design for a hybrid filterbank.
+
* Structured to allow the design for a hybrid filterbank
  
  
Line 29: Line 29:
 
* Limited official development (third-party developement is always encouraged)
 
* Limited official development (third-party developement is always encouraged)
 
* Current implementations are more computationally intensive to decode than MP3
 
* Current implementations are more computationally intensive to decode than MP3
* Multichannel input mappings for 5.1, Ambisonic-B, and other configs have no channel coupling and aren't tuned (expect sub-optimal results until code is improved).
+
* Multichannel input mappings for 5.1, Ambisonic-B, and other configs have no channel coupling and aren't tuned (expect sub-optimal results until code is improved)
  
  
Line 38: Line 38:
 
* Psychoacoustics masking is exploited via an ([[ATH]] model)
 
* Psychoacoustics masking is exploited via an ([[ATH]] model)
 
* Masking curves are computed from an ''emperically'' adjusted set of [http://www.zainea.com/masking2.htm Ehmer Curves]
 
* Masking curves are computed from an ''emperically'' adjusted set of [http://www.zainea.com/masking2.htm Ehmer Curves]
* Modified Discrete Cosine Transform ([[MDCT]]) is used for noise analysis.
+
* Modified Discrete Cosine Transform ([[MDCT]]) is used for noise analysis
* Fast Fourier Transform ([[FFT]]) is used for tonal analysis.
+
* Fast Fourier Transform ([[FFT]]) is used for tonal analysis
* Global masking curve is a mixture between calculated FFT+MDCT curves and ATH curves overlayed
+
* Global masking curve is a mixture between calculated FFT+MDCT curves and ATH curves overlayed
* Floor 1 or the noise-floor (envelope) is calculated using the global masking curve & piecewise linear approximation divided by spectrum to generate the residue (fine detail). The Levinson-Durbin [http://www.data-compression.com/speech.html#ana LPC model] in Floor 0 is no longer used, however the code still exists.
+
* Floor 1 or the noise-floor (envelope) is calculated using the global masking curve & piecewise linear approximation divided by spectrum to generate the residue (fine detail). The Levinson-Durbin [http://www.data-compression.com/speech.html#ana LPC model] in Floor 0 is no longer used, however the code still exists
* [[Noise normalization]] is applied to compensate for energy lost in certain frequency bands due to quantization.
+
* [[Noise normalization]] is applied to compensate for energy lost in certain frequency bands due to quantization
* The channels are [[channel coupling|coupled]] ''strictly'' by residue using ([http://us.xiph.org/ogg/vorbis/doc/stereo.html point/phase stereo] and lossless).
+
* The channels are [[channel coupling|coupled]] ''strictly'' by residue using ([http://us.xiph.org/ogg/vorbis/doc/stereo.html point/phase stereo] and lossless)
* Multistage [[Vector quantization]] is used for coding the noise-floor and residue backend using ''trained'' codebooks.  
+
* Multistage [[Vector quantization]] is used for coding the noise-floor and residue backend using ''trained'' codebooks.
* [[Huffman coding]] is used to minimize vector codeword redundancy.
+
* [[Huffman coding]] is used to minimize vector codeword redundancy
  
  
Line 58: Line 58:
 
===Decoders===
 
===Decoders===
 
* [[Ogg123]] for Unix systems (GPL), a very simple to use command-line player. (Win32/Posix)
 
* [[Ogg123]] for Unix systems (GPL), a very simple to use command-line player. (Win32/Posix)
* [http://www.rarewares.org/ogg.html OggDec] for Windows, by John33, a very featureful command line decoder. (Win32)
+
* [http://www.rarewares.org/ogg.html OggDec] for Windows, by John33, a very featureful command line decoder (Win32)
 
* [http://qtcomponents.sourceforge.net/ QuickTime Component] (Allows playback in [[QuickTime]]/[[iTunes]]) (Win32)
 
* [http://qtcomponents.sourceforge.net/ QuickTime Component] (Allows playback in [[QuickTime]]/[[iTunes]]) (Win32)
 
* [http://corevorbis.corecodec.org/ CoreVorbis] (DirectShow) (Win32)
 
* [http://corevorbis.corecodec.org/ CoreVorbis] (DirectShow) (Win32)
Line 64: Line 64:
  
 
===ReplayGain===
 
===ReplayGain===
* [http://www.rarewares.org/ogg.html Vorbisgain] to apply [[ReplayGain]] on Vorbis files.
+
* [http://www.rarewares.org/ogg.html Vorbisgain] to apply [[ReplayGain]] on Vorbis files
  
 
===Cutter===
 
===Cutter===
The following utilities are used to cut Vorbis files without decoding/reencoding. This list is derived from [http://www.hydrogenaudio.org/forums/index.php?showtopic=44830&hl= this HA thread].
+
The following utilities are used to cut Vorbis files without decoding/reencoding. This list is derived from [http://www.hydrogenaudio.org/forums/index.php?showtopic=44830&hl= this HA thread]
 
* [http://www.free-codecs.com/download/Ogg_Cutter.htm Ogg Cutter]
 
* [http://www.free-codecs.com/download/Ogg_Cutter.htm Ogg Cutter]
 
* [http://www.free-codecs.com/download/musiCutter.htm musiCutter]
 
* [http://www.free-codecs.com/download/musiCutter.htm musiCutter]
Line 78: Line 78:
  
  
=Supported Digital Audio Players=
+
=Supporting Digital Audio Players=
 
The following list contains some players that support Vorbis playback.
 
The following list contains some players that support Vorbis playback.
 
* [[Apple iPod]] with [[Rockbox]] firmware -- check out this [http://www.hydrogenaudio.org/forums/index.php?s=32eeac65958144db631c8a739b41983c&showtopic=40992 still evolving HA thread]
 
* [[Apple iPod]] with [[Rockbox]] firmware -- check out this [http://www.hydrogenaudio.org/forums/index.php?s=32eeac65958144db631c8a739b41983c&showtopic=40992 still evolving HA thread]
Line 98: Line 98:
  
 
* [[Ogg]] (Container)
 
* [[Ogg]] (Container)
* [http://www.vorbis.com Vorbis official website] (updated)
+
* [http://www.vorbis.com Vorbis official website] (updated continually)
* [http://en.wikipedia.org/wiki/Vorbis Vorbis at Wikipedia.]  
+
* [http://en.wikipedia.org/wiki/Vorbis Vorbis at Wikipedia]  
 
* [http://www.audiocoding.com/modules/wiki/?page=Ogg+Vorbis Ogg Vorbis at AudioCoding]
 
* [http://www.audiocoding.com/modules/wiki/?page=Ogg+Vorbis Ogg Vorbis at AudioCoding]
* [http://www.rarewares.org/ogg.html Ogg Vorbis binaries at Rarewares. ]
+
* [http://www.rarewares.org/ogg.html Ogg Vorbis binaries at Rarewares]
 
* [http://www.rjamorim.com/test/ Listening tests comparing Vorbis against MP3, AAC, WMA, etc.]
 
* [http://www.rjamorim.com/test/ Listening tests comparing Vorbis against MP3, AAC, WMA, etc.]
* [[Recommended Ogg Vorbis|Recommended settings for encoding with Vorbis]] and its related [http://www.hydrogenaudio.org/forums/index.php?showtopic=15049 HA thread].
+
* [[Recommended Ogg Vorbis|Recommended settings for encoding with Vorbis]] and its related [http://www.hydrogenaudio.org/forums/index.php?showtopic=15049 HA thread]
 
* [http://wiki.hydrogenaudio.org/index.php?title=EAC_and_Ogg_Vorbis Configuring EAC and Vorbis as an external command-line encoder]  
 
* [http://wiki.hydrogenaudio.org/index.php?title=EAC_and_Ogg_Vorbis Configuring EAC and Vorbis as an external command-line encoder]  
* [http://www.geocities.jp/aoyoume/aotuv/ Aoyumi's homepage of tuned versions of Vorbis encoder and current beta binaries.]
+
* [http://www.geocities.jp/aoyoume/aotuv/ Aoyumi's homepage of tuned versions of Vorbis encoder and current beta binaries]
* [http://homepage3.nifty.com/blacksword/index_e.htm The Ogg Vorbis Acceleration Project]: Archer/[[Lancer]] homepage for optimized versions of aoTuV Vorbis encoder and other SSE optmizations.
+
* [http://homepage3.nifty.com/blacksword/index_e.htm The Ogg Vorbis Acceleration Project]: Archer/[[Lancer]] homepage for optimized versions of aoTuV Vorbis encoder and other SSE optmizations  
* [http://www.hydrogenaudio.org/forums/index.php?showtopic=20132&st=0 Noise Normalization and HF Boost problem solution that ultimatly lead to the aoTuV tunings]  
+
* [http://www.hydrogenaudio.org/forums/index.php?showtopic=20132&st=0 Noise Normalization and HF Boost problem solution that ultimatly lead to the aoTuV tunings (HA Thread)]  
 
* [http://www.free-comp-shop.com/vorbis.pdf Keith Wright rendition of understanding the MDCT in Vorbis by defining it's basic trig properties (good read) (PDF)]  
 
* [http://www.free-comp-shop.com/vorbis.pdf Keith Wright rendition of understanding the MDCT in Vorbis by defining it's basic trig properties (good read) (PDF)]  
* [http://www.mp3-tech.org/programmer/docs/embedded_vorbis_thesis.pdf Ogg Vorbis decoder for an embedded system. Master Thesis (PDF)]  
+
* [http://www.mp3-tech.org/programmer/docs/embedded_vorbis_thesis.pdf Ogg Vorbis decoder for an embedded system (Master Thesis in PDF)]  
 
* [http://www.xiph.org Xiph.org Foundation]
 
* [http://www.xiph.org Xiph.org Foundation]
 
* [http://wiki.xiph.org/index.php/Bounties Xiph.org Vorbis bounties]
 
* [http://wiki.xiph.org/index.php/Bounties Xiph.org Vorbis bounties]

Revision as of 07:46, 22 May 2006

Fish logo.png


Introduction

Ogg Vorbis (www.vorbis.com) is a fully open, non-proprietary, patent-free (subject to speculation), and royalty-free, general-purpose compressed audio format for mid to high quality (8khz-48.0kHz, 16+ bit, multichannel) audio and music at fixed and variable bitrates from 16 to >256 kbps/channel. This places vorbis in the same competitive class as audio representations such as MPEG-4 (AAC), and similar to, but higher performance than MP3, TwinVQ (VQF), WMA and PAC. Vorbis is the first of a planned family of Ogg multimedia coding formats being developed as part of Xiph.org's ogg multimedia project.

Informal listening test suggests Vorbis to be comparable to MPEG-4 AAC at most bitrates and MPC at 128 kbps. Transparency is generally reached at about 150-170 kbps (-q 5) (with some exceptions). The encoder is reasonably young and unoptimized, so further improvements can always be expected.

Unfortunately, Xiph.org has failed to improve Vorbis at a steady rate since its initial 1.0 release in July 2002 (due to other developement projects and time constraints). Since then development has been led by other coders such as Garf and Aoyumi. Aoyumi's aoTuV series of encoders was incorporated into the September 2004 release of 1.1, which brought about the first quality improvements across the board for 2 years. Currently Aoyumi is working on aoTuV Beta 4 and future releases. The latest version is aoTuV Beta 4.51, released in December 2005. Unfortunately, the improvements of aoTuV Beta 4 line has not been incorporated yet into the 'official' Vorbis line.

Vorbis has had success with many recent video game titles employing Vorbis as opposed to MP3 (with Epic Games' Unreal Tournament 2003 and Unreal Tournament 2004, the PC port of Microsoft's Halo and Uru being notable examples). Ogg Vorbis is also an official part of the OpenAL API extension library, used in many popular computer games. On April 10, 2006, RAD Game Tools integrated Ogg Vorbis support to their Miles Sound System (MSS), which has been used in over 3,200 games worldwide. This ensures that future games utilizing MSS will have the capability to play Ogg Vorbis files. Check out xiph wiki for a full list of games confirmed to use Ogg Vorbis.

Before encoding files using Ogg Vorbis, check out the Recommended Ogg Vorbis to determine what encoder to use and what settings are recommended by Hydrogenaudio.


Pros

  • Ogg Vorbis specification is in the public domain; it is free for commercial or noncommercial use, under both (LGPL and BSD liscenes)
  • Good all-round performance (>48 kbps - a leading codec at 128 kbps)
  • Well written specs
  • Supported by portable DAPs
  • Suitable for internet-streaming (via Icecast and other methods)
  • Fully gapless playback
  • High potential for further tuning
  • Structured to allow the design for a hybrid filterbank


Cons

  • Limited official development (third-party developement is always encouraged)
  • Current implementations are more computationally intensive to decode than MP3
  • Multichannel input mappings for 5.1, Ambisonic-B, and other configs have no channel coupling and aren't tuned (expect sub-optimal results until code is improved)


Technical details

  • Multiple block sizes for window switching including overlap (powers of two only) (128/1024, 256/2048, 512/4096)
  • Customly designed window function is applied similiar to the sine window. (good sidelobe rejection)
w_k = \sin{(\frac{\pi}{2} \cdot sin^2[(\pi\div2n \cdot (k+0.5))]}
  • Psychoacoustics masking is exploited via an (ATH model)
  • Masking curves are computed from an emperically adjusted set of Ehmer Curves
  • Modified Discrete Cosine Transform (MDCT) is used for noise analysis
  • Fast Fourier Transform (FFT) is used for tonal analysis
  • Global masking curve is a mixture between calculated FFT+MDCT curves and ATH curves overlayed
  • Floor 1 or the noise-floor (envelope) is calculated using the global masking curve & piecewise linear approximation divided by spectrum to generate the residue (fine detail). The Levinson-Durbin LPC model in Floor 0 is no longer used, however the code still exists
  • Noise normalization is applied to compensate for energy lost in certain frequency bands due to quantization
  • The channels are coupled strictly by residue using (point/phase stereo and lossless)
  • Multistage Vector quantization is used for coding the noise-floor and residue backend using trained codebooks.
  • Huffman coding is used to minimize vector codeword redundancy


Software

Encoders

  • Oggenc official command-line encoder (Win32/Posix)
  • OggdropXPd advanced drag-and-drop encoder by John33 (Win32)
  • foo_vorbisenc vorbis encoder library for Foobar2000 (Win32)
  • Lancer SSE-optimized vorbis encoder utility and libraries by BlackSword (Win32)

Decoders

ReplayGain

Cutter

The following utilities are used to cut Vorbis files without decoding/reencoding. This list is derived from this HA thread

Tagger

Most tagger supporting Ogg Vorbis are listed in the download page.


Supporting Digital Audio Players

The following list contains some players that support Vorbis playback.

A longer list can be found at xiph's wiki, but this list may be outdated. There may be players out there that support Ogg Vorbis, although they are not marketed as such.


External links

The following links contain information surrounding the Ogg Vorbis codec that can be found on Hydrogenaudio and elsewhere throughout the web.