1<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> 2<html> 3<head> 4 5<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-15"/> 6<title>Ogg Vorbis Documentation</title> 7 8<style type="text/css"> 9body { 10 margin: 0 18px 0 18px; 11 padding-bottom: 30px; 12 font-family: Verdana, Arial, Helvetica, sans-serif; 13 color: #333333; 14 font-size: .8em; 15} 16 17a { 18 color: #3366cc; 19} 20 21img { 22 border: 0; 23} 24 25#xiphlogo { 26 margin: 30px 0 16px 0; 27} 28 29#content p { 30 line-height: 1.4; 31} 32 33h1, h1 a, h2, h2 a, h3, h3 a { 34 font-weight: bold; 35 color: #ff9900; 36 margin: 1.3em 0 8px 0; 37} 38 39h1 { 40 font-size: 1.3em; 41} 42 43h2 { 44 font-size: 1.2em; 45} 46 47h3 { 48 font-size: 1.1em; 49} 50 51li { 52 line-height: 1.4; 53} 54 55#copyright { 56 margin-top: 30px; 57 line-height: 1.5em; 58 text-align: center; 59 font-size: .8em; 60 color: #888888; 61 clear: both; 62} 63</style> 64 65</head> 66 67<body> 68 69<div id="xiphlogo"> 70 <a href="http://www.xiph.org/"><img src="fish_xiph_org.png" alt="Fish Logo and Xiph.org"/></a> 71</div> 72 73<h1>Ogg Vorbis encoding format documentation</h1> 74 75<p><img src="wait.png" alt="wait"/>As of writing, not all the below document 76links are live. They will be populated as we complete the documents.</p> 77 78<h2>Documents</h2> 79 80<ul> 81<li><a href="packet.html">Vorbis packet structure</a></li> 82<li><a href="envelope.html">Temporal envelope shaping and blocksize</a></li> 83<li><a href="mdct.html">Time domain segmentation and MDCT transform</a></li> 84<li><a href="resolution.html">The resolution floor</a></li> 85<li><a href="residuals.html">MDCT-domain fine structure</a></li> 86</ul> 87 88<ul> 89<li><a href="probmodel.html">The Vorbis probability model</a></li> 90<li><a href="bitpack.html">The Vorbis bitpacker</a></li> 91</ul> 92 93<ul> 94<li><a href="oggstream.html">Ogg bitstream overview</a></li> 95<li><a href="framing.html">Ogg logical bitstream and framing spec</a></li> 96<li><a href="vorbis-stream.html">Vorbis packet->Ogg bitstream mapping</a></li> 97</ul> 98 99<ul> 100<li><a href="programming.html">Programming with libvorbis</a></li> 101</ul> 102 103<h2>Description</h2> 104 105<p>Ogg Vorbis is a general purpose compressed audio format 106for high quality (44.1-48.0kHz, 16+ bit, polyphonic) audio and music 107at moderate fixed and variable bitrates (40-80 kb/s/channel). This 108places Vorbis in the same class as audio representations including 109MPEG-1 audio layer 3, MPEG-4 audio (AAC and TwinVQ), and PAC.</p> 110 111<p>Vorbis is the first of a planned family of Ogg multimedia coding 112formats being developed as part of the Xiph.org Foundation's Ogg multimedia 113project. See <a href="http://www.xiph.org/">http://www.xiph.org/</a> 114for more information.</p> 115 116<h2>Vorbis technical documents</h2> 117 118<p>A Vorbis encoder takes in overlapping (but contiguous) short-time 119segments of audio data. The encoder analyzes the content of the audio 120to determine an optimal compact representation; this phase of encoding 121is known as <em>analysis</em>. For each short-time block of sound, 122the encoder then packs an efficient representation of the signal, as 123determined by analysis, into a raw packet much smaller than the size 124required by the original signal; this phase is <em>coding</em>. 125Lastly, in a streaming environment, the raw packets are then 126structured into a continuous stream of octets; this last phase is 127<em>streaming</em>. Note that the stream of octets is referred to both 128as a 'byte-' and 'bit-'stream; the latter usage is acceptible as the 129stream of octets is a physical representation of a true logical 130bit-by-bit stream.</p> 131 132<p>A Vorbis decoder performs a mirror image process of extracting the 133original sequence of raw packets from an Ogg stream (<em>stream 134decomposition</em>), reconstructing the signal representation from the 135raw data in the packet (<em>decoding</em>) and them reconstituting an 136audio signal from the decoded representation (<em>synthesis</em>).</p> 137 138<p>The <a href="programming.html">Programming with libvorbis</a> 139documents discuss use of the reference Vorbis codec library 140(libvorbis) produced by the Xiph.org Foundation.</p> 141 142<p>The data representations and algorithms necessary at each step to 143encode and decode Ogg Vorbis bitstreams are described by the below 144documents in sufficient detail to construct a complete Vorbis codec. 145Note that at the time of writing, Vorbis is still in a 'Request For 146Comments' stage of development; despite being in advanced stages of 147development, input from the multimedia community is welcome.</p> 148 149<h3>Vorbis analysis and synthesis</h3> 150 151<p>Analysis begins by seperating an input audio stream into individual, 152overlapping short-time segments of audio data. These segments are 153then transformed into an alternate representation, seeking to 154represent the original signal in a more efficient form that codes into 155a smaller number of bytes. The analysis and transformation stage is 156the most complex element of producing a Vorbis bitstream.</p> 157 158<p>The corresponding synthesis step in the decoder is simpler; there is 159no analysis to perform, merely a mechanical, deterministic 160reconstruction of the original audio data from the transform-domain 161representation.</p> 162 163<ul> 164<li><a href="packet.html">Vorbis packet structure</a>: 165Describes the basic analysis components necessary to produce Vorbis 166packets and the structure of the packet itself.</li> 167<li><a href="envelope.html">Temporal envelope shaping and blocksize</a>: 168Use of temporal envelope shaping and variable blocksize to minimize 169time-domain energy leakage during wide dynamic range and spectral energy 170swings. Also discusses time-related principles of psychoacoustics.</li> 171<li><a href="mdct.html">Time domain segmentation and MDCT transform</a>: 172Division of time domain data into individual overlapped, windowed 173short-time vectors and transformation using the MDCT</li> 174<li><a href="resolution.html">The resolution floor</a>: Use of frequency 175doamin psychoacoustics, and the MDCT-domain noise, masking and resolution 176floors</li> 177<li><a href="residuals.html">MDCT-domain fine structure</a>: Production, 178quantization and massaging of MDCT-spectrum fine structure</li> 179</ul> 180 181<h3>Vorbis coding and decoding</h3> 182 183<p>Coding and decoding converts the transform-domain representation of 184the original audio produced by analysis to and from a bitwise packed 185raw data packet. Coding and decoding consist of two logically 186orthogonal concepts, <em>back-end coding</em> and <em>bitpacking</em>.</p> 187 188<p><em>Back-end coding</em> uses a probability model to represent the raw numbers 189of the audio representation in as few physical bits as possible; 190familiar examples of back-end coding include Huffman coding and Vector 191Quantization.</p> 192 193<p><em>Bitpacking</em> arranges the variable sized words of the back-end 194coding into a vector of octets without wasting space. The octets 195produced by coding a single short-time audio segment is one raw Vorbis 196packet.</p> 197 198<ul> 199<li><a href="probmodel.html">The Vorbis probability model</a></li> 200<li><a href="bitpack.html">The Vorbis bitpacker</a>: Arrangement of 201variable bit-length words into an octet-aligned packet.</li> 202</ul> 203 204<h3>Vorbis streaming and stream decomposition</h3> 205 206<p>Vorbis packets contain the raw, bitwise-compressed representation of a 207snippet of audio. These packets contain no structure and cannot be 208strung together directly into a stream; for streamed transmission and 209storage, Vorbis packets are encoded into an Ogg bitstream.</p> 210 211<ul> 212<li><a href="oggstream.html">Ogg bitstream overview</a>: High-level 213description of Ogg logical bitstreams, how logical bitstreams 214(of mixed media types) can be combined into physical bitstreams, and 215restrictions on logical-to-physical mapping. Note that this document is 216not specific only to Ogg Vorbis.</li> 217<li><a href="framing.html">Ogg logical bitstream and framing 218spec</a>: Low level, complete specification of Ogg logical 219bitstream pages. Note that this document is not specific only to Ogg 220Vorbis.</li> 221<li><a href="vorbis-stream.html">Vorbis bitstream mapping</a>: 222Specifically describes mapping Vorbis data into an 223Ogg physical bitstream.</li> 224</ul> 225 226<div id="copyright"> 227 The Xiph Fish Logo is a 228 trademark (™) of Xiph.Org.<br/> 229 230 These pages © 1994 - 2005 Xiph.Org. All rights reserved. 231</div> 232 233</body> 234</html> 235