Advanced Audio Coding (AAC) is a standardized, lossy compression and encoding scheme for digital audio. Designed to be the successor of the MP3 format, AAC generally achieves better sound quality than MP3 at many bit rates. AAC has been standardized by ISO and IEC, as part of the MPEG-2 & MPEG-4 specifications. The MPEG-2 standard contains several audio coding methods, including the MP3 coding scheme. AAC is able to include 48 full-bandwidth (up to 96 kHz) audio channels in one stream plus 15 low frequency enhancement (LFE, limited to 120 Hz) channels and up to 15 data streams. AAC is able to achieve good audio quality at data rates of 320 kbit/s for five channels. The quality for stereo is satisfactory to modest requirements at 96 kbit/s in joint stereo mode, however hi-fi transparency demands data rates of at least 192 kbit/s (VBR), as with MP3. AAC's best known use is as the default audio format of Apple's iPhone, iPod, iTunes, and the format used for all iTunes Store audio (with extensions for proprietary digital rights management). AAC is also the standard audio format for Sony’s PlayStation 3 and is supported by Sony's Playstation Portable, latest generation of Sony Walkman, Walkman Phones from Sony Ericsson, Nintendo's Wii (with the Photo Channel 1.1 update installed for Wii consoles purchased before late 2007), the Nintendo DSi, and the MPEG-4 video standard. HE-AAC is part of digital radio standards like DAB+ and Digital Radio Mondiale.
[edit] HistoryAAC was developed with the cooperation and contributions of companies including Fraunhofer IIS, AT&T Bell Laboratories, Dolby, Sony Corporation and Nokia, and was officially declared an international standard by the Moving Pictures Experts Group in April 1997. MPEG-2 AAC-LC profile consists of a base format very much like AT&T's PAC coding format [1] [2] [3], with the addition of TNS[4], the Dolby Kaiser Window described below, a nonuniform quantizer, and a reworking of the bitstream format to handle up to 16 stereo, 16 mono, 16 LFE, and 16 commentary channels in one bitstream. The Main profile adds a set of recursive predictors that are calculated on each tap of the filterbank. The SSR uses a 4-band PQMF filterbank, with four shorter filterbanks following, in order to allow for scalable sampling rates. . [edit] StandardizationIt is specified both as Part 7 of the MPEG-2 standard, and Part 3 of the MPEG-4 standard. As such, it can be referred to as MPEG-2 Part 7 and MPEG-4 Part 3 depending on its implementation, however it is most often referred to as MPEG-4 AAC, or AAC for short. AAC was first specified in the standard MPEG-2 Part 7 (known formally as ISO/IEC 13818-7:1997) in 1997 as a new "part" (distinct from ISO/IEC 13818-3) in the MPEG-2 family of international standards. It was updated in MPEG-4 Part 3 (known formally as ISO/IEC 14496-3:1999) in 1999. The reference software is specified in MPEG-4 Part 4 and the conformance bit-streams are specified in MPEG-4 Part 5. A notable addition in this version of the standard is Perceptual Noise Substitution (PNS). HE-AAC (AAC with SBR) was first standardized in ISO/IEC 14496-3:2001/Amd.1. HE-AAC v2 (AAC with Parametric Stereo) was first specified in ISO/IEC 14496-3:2001/Amd.4. [5] The current version of the AAC standard is ISO/IEC 14496-3:2005 (with 14496-3:2005/Amd.2. for HE-AAC v2[6]) AAC+ v2 is also standardized by ETSI (European Telecommunications Standards Institute) as TS 102005.[5] The MPEG-4 standard also contains other ways of compressing sound. These are low bit-rate and generally used for speech. [edit] AAC’s improvements over MP3AAC was designed to fix many of the serious performance flaws in the MP3 format (which was specified in MPEG-1 and MPEG-2) by the ISO/IEC in 11172-3 and 13818-3. Advanced Audio Coding is designed to be the successor of the MP3 format and demonstrates greater sound quality and transparency than MP3 files coded at the same bit rate. The commonly used AAC format implements the MPEG-4 coding method and is most often downloaded with lower-fi 128 kbit/s quality. Optimal hi-fi quality vs. file ratio is usually specified at 256 kbit/s, taking up a bit more than one-fifth the file size than a 1440 kbit/s uncompressed file size, that a WAV or AIFF (original CD size file) consists of. Improvements include:
Overall, the AAC format allows developers more flexibility to design codecs than MP3 does, and corrects many of the unfortunate design choices made in the original MPEG-1 audio specification. This increased flexibility often leads to more concurrent encoding strategies and, as a result, to more efficient compression. However in terms of whether AAC is better than MP3, the advantages of AAC are not entirely decisive, and the MP3 specification, although antiquated, has proven surprisingly robust in spite of considerable flaws. AAC and HE-AAC are universally accepted as better than MP3 at low bit rates (typically less than 128 kilobits per second). This is especially true at very low bit rates where the superior stereo coding, pure MDCT, and more optimal transform window sizes leave MP3 unable to compete. However, as bit rate increases, the efficiency of an audio format becomes less important relative to the efficiency of the encoder's implementation, and the intrinsic advantage AAC holds over MP3 no longer dominates audio quality. [edit] How AAC worksAAC is a wideband audio coding algorithm that exploits two primary coding strategies to dramatically reduce the amount of data needed to represent high-quality digital audio.
The actual encoding process consists of the following steps:
The MPEG-4 audio standard does not define a single or small set of highly efficient compression schemes but rather a complex toolbox to perform a wide range of operations from low bitrate speech coding to high-quality audio coding and music synthesis.
AAC encoders can switch dynamically between a single MDCT block of length 1024 points or 8 blocks of 128 points.
[edit] Modular encodingAAC takes a modular approach to encoding. Depending on the complexity of the bitstream to be encoded, the desired performance and the acceptable output, implementers may create profiles to define which of a specific set of tools they want to use for a particular application. The standard offers four default profiles:
Depending on the AAC profile and the MP3 encoder, 96 kbit/s AAC can give nearly the same or better perceptual quality as 128 kbit/s MP3.[7] [edit] AAC error protection toolkitApplying error protection enables error correction up to a certain extent. Error correcting codes are usually applied equally to the whole payload. However since different parts of an AAC payload show different sensitivity to transmission errors, this would not be a very efficient approach. The AAC payload can be subdivided into parts with different error sensitivities.
[edit] Error Resilient (ER) AACError Resilience (ER) techniques can be used to make the coding scheme itself more robust against errors. For AAC, three custom-tailored methods were developed and defined in MPEG-4 Audio
[edit] AAC Low DelayThe MPEG-4 Low Delay Audio Coder (AAC-LD) is designed to combine the advantages of perceptual audio coding with the low delay necessary for two-way communication. It is closely derived from the MPEG-2 Advanced Audio Coding (AAC) format. [edit] Licensing and PatentsNo licenses or payments are required to be able to stream or distribute content in AAC format. [8] This reason alone makes AAC a much more attractive format to distribute content than MP3, particularly for streaming content (such as Internet radio). However, a patent license is required for all manufacturers or developers of AAC codecs [9]. It is for this reason FOSS implementations such as FAAC and FAAD are distributed in source form only, in order to avoid patent infringement. (See below under Products that support AAC, Software.) [edit] Products that support AAC[edit] HDTV Standards[edit] Japanese ISDB-TIn December 2003, Japan started broadcasting terrestrial DTV ISDB-T standard that implements MPEG-2 video and MPEG-2 AAC audio. In April 2006 Japan started broadcasting the ISDB-T mobile sub-program, called 1seg, that was the first implementation of video H.264/AVC with audio HE-AAC in Terrestrial HDTV broadcasting service on the planet. [edit] International ISDB-TbIn December 2007, Brazil started broadcasting terrestrial DTV standard called International ISDB-Tb that implements video coding H.264/AVC with audio AAC-LC on main program(single or multi) and video H.264/AVC with audio HE-AACv2 in the 1seg mobile sub-program. [edit] Hardware[edit] iTunes and iPodIn April 2003, Apple Computer brought mainstream attention to AAC by announcing that its iTunes and iPod products would support songs in MPEG-4 AAC format (via a firmware update for older iPods). Customers could download music in a proprietary Digital Rights Management (DRM)-restricted form of AAC (see FairPlay) via the iTunes Store or create files without DRM from their own CDs using iTunes. In later years, Apple began offering music videos and movies, which also use AAC for audio encoding. On May 29, 2007, Apple began selling songs and music videos free of DRM from participating record labels. These files mostly adhere to the AAC standard and are playable on many non-Apple products but they do include custom iTunes information such as album artwork and a purchase receipt, so as to identify the customer in case the file is leaked out onto peer-to-peer networks, it is possible however to remove these custom tags to restore interoperability with players that conform strictly to the AAC specification. iTunes supports a "Variable bit rate" (VBR) encoding option which encodes AAC tracks in an "Average bit rate" (ABR) scheme. As of October 2007, Apple has not added support for HE-AAC which is fully part of the MP4 standard or true VBR encoding to iTunes. [edit] Other Portable Players
[edit] Mobile phonesFor a number of years, many mobile phones from manufacturers such as Nokia, Motorola, Samsung, Sony Ericsson, BenQ-Siemens and Philips have supported AAC playback. The first such phone was the Nokia 5510 released in 2002 which also plays MP3s. However this phone was a commercial failure and such phones with integrated music players did not gain mainstream popularity until 2005 when the trend of having AAC as well as MP3 support continued. Most new smartphones and music-themed phones support playback of these formats.
[edit] Other devices
[edit] SoftwareThe Rockbox Open source firmware (available for multiple portable players) also offers support for AAC to varying degrees, depending on the model of player and the AAC profile. Optional iPod Support (playback of unprotected AAC files) for the Xbox 360 is available as a free download from Xbox Live.[11] [edit] Other software media playersAlmost all current computer media players include built-in decoders for AAC, or can utilize a library to decode it. On Microsoft Windows, DirectShow can be utilized this way with the corresponding filters to enable AAC playback in any DirectShow based player. Software player applications of particular note include:
Some of these players (e.g., foobar2000, Winamp, and VLC) also support the decoding of raw or MP4-contained AAC streamed over HTTP using the SHOUTcast protocol. Plug-ins for Winamp and foobar2000 enable the creation of such streams. [edit] Nero Digital AudioIn May 2006, Nero AG released an AAC encoding tool free of charge, Nero Digital Audio [2], which is capable of encoding LC-AAC, HE-AAC and HE-AAC v2 streams. The tool is a Command Line Interface tool only, and a separate utility is included to decode to PCM WAV. Various tools including the foobar2000 audio player and MeGUI can provide a GUI for the encoder. [edit] FAAC and FAAD2FAAC and FAAD2 stand for Freeware Advanced Audio Coder and Decoder 2 respectively, collectively make up an open source implementation of AAC. [edit] Extensions and improvementsSome extensions have been added to the original AAC standard:
[edit] Container formatsIn addition to the MP4 container format for storage, AAC audio data may be packaged in a more basic format called Audio Data Interchange Format (ADIF),[12] consisting of a single header followed by the raw AAC audio data blocks.[13] Alternatively, it may be packaged in a streaming format called Audio Data Transport Stream (ADTS), consisting of a series of frames, each frame having a header followed by the AAC audio data.[12] Both formats are defined in MPEG-2 part 7, but are only considered informative by MPEG-4, so an MPEG-4 decoder does not need to support either format.[12] Two more formats are defined in MPEG-4 part 3: Low-overhead MPEG-4 Audio Transport Multiplex (LATM), which provides a way to combine separate audio payloads, and Low Overhead Audio Stream (LOAS), a self-synchronizing streaming format.[12] [edit] See also[edit] Notes
[edit] External links
Directorio de Enlaces Directorio dmoz Directorio espejo dmoz Pedro Bernardo | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||