Surround sound is a technique for enriching the sound reproduction quality of an audio source with additional audio channels from speakers that surround the listener (surround channels), providing sound from a 360° radius in the horizontal plane (2D) as opposed to "screen channels" (centre, [front] left, and [front] right) originating only from the listener's forward arc.
Surround sound is characterized by a listener location or sweet spot where the audio effects work best, and presents a fixed or forward perspective of the sound field to the listener at this location. The technique enhances the perception of sound spatialization by exploiting sound localization; a listener's ability to identify the location or origin of a detected sound in direction and distance. Typically this is achieved by using multiple discrete audio channels routed to an array of loudspeakers.
There are various surround sound based formats and techniques, varying in reproduction and recording methods along with the number and positioning of additional channels.
- 1 Fields of application
- 2 Types of media and technologies
- 3 History
- 4 Creating surround sound
- 5 Mapping channels to speakers
- 6 Bass management
- 7 Low Frequency Effects (LFE) channel
- 8 Surround sound specifications
- 9 See also
- 10 Notes
- 11 References
- 12 External links
Fields of application
Though cinema and soundtracks represent the major uses of surround techniques, its scope of application is broader than that as surround sound permits to create an audio-environment for all sorts of purposes. Multichannel audio techniques may be used to reproduce contents as varied as music, speech, natural or synthetic sounds for cinema, television, broadcasting, or computers. In terms of music content for example, a live performance may use multichannel techniques in the context of an open-air concert, of a musical theatre or for broadcasting; for a film specific techniques are adapted to movie theater, or to home (e.g. home cinema systems). The narrative space is also a content that can be enhanced through multichannel techniques. This applies mainly to cinema narratives, for example the speech of the characters of a film, but may also be applied to plays for theatre, to a conference, or to integrate voice-based comments in an archeological site or monument. For example, an exhibition may be enhanced with topical ambient sound of water, birds, train or machine noise. Topical natural sounds may also be used in educational applications. Other fields of application include video game consoles, personal computers and other platforms. In such applications, the content would typically be synthetic noise produced by the computer device in interaction with its user. Significant work has also been done using surround sound for enhanced situation awareness in military and public safety applications.
Types of media and technologies
Commercial surround sound media include videocassettes, DVDs, and HDTV broadcasts encoded as compressed Dolby Digital and DTS, and lossless audio such as DTS HD Master Audio and Dolby TrueHD on Blu-ray Disc and HD DVD, which are identical to the studio master. Other commercial formats include the competing DVD-Audio (DVD-A) and Super Audio CD (SACD) formats, and MP3 Surround. Cinema 5.1 surround formats include Dolby Digital and DTS. Sony Dynamic Digital Sound (SDDS) is an 8 channel cinema configuration which features 5 independent audio channels across the front with two independent surround channels, and a Low-frequency effects channel. Traditional 7.1 surround speaker configuration introduces two additional rear speakers to the conventional 5.1 arrangement, for a total of four surround channels and three front channels, to create a more 360° sound field.
Most surround sound recordings are created by film production companies or video game producers; however some consumer camcorders have such capability either built-in or available separately. Surround sound technologies can also be used in music to enable new methods of artistic expression. After the failure of quadraphonic audio in the 1970s, multichannel music has slowly been reintroduced since 1999 with the help of SACD and DVD-Audio formats. Some AV receivers, stereophonic systems, and computer soundcards contain integral digital signal processors and/or digital audio processors to simulate surround sound from a stereophonic source.
In 1967, the rock group Pink Floyd performed the first-ever surround sound concert at "Games for May", a lavish affair at London’s Queen Elizabeth Hall where the band debuted its custom-made quadraphonic speaker system. The control device they had made, the Azimuth Co-ordinator, is now displayed at London's Victoria and Albert Museum, as part of their Theatre Collections gallery.
The first documented use of surround sound was in 1940, for the Disney studio's animated film Fantasia. Walt Disney was inspired by Nikolai Rimsky-Korsakov's operatic piece, Flight of the Bumblebee to have a bumblebee featured in his musical Fantasia and also sound as if it was flying in all parts of the theatre – the unsuccessful experimentation led to the music being excluded from the film and the eventual invention of "surround sound".
The initial multichannel audio application was called 'Fantasound', comprising three audio channels and speakers. The sound was diffused throughout the cinema, initially by an engineer using some 54 loudspeakers. The surround sound was achieved using the sum and the difference of the phase of the sound. In the 1950s, the German composer Karlheinz Stockhausen experimented with and produced ground-breaking electronic compositions such as Gesang der Jünglinge and Kontakte, the latter using fully discrete and rotating quadraphonic sounds generated with industrial electronic equipment in Herbert Eimert's studio at the Westdeutscher Rundfunk (WDR). Edgar Varese's Poeme Electronique, created for the Iannis Xenakis designed Philips Pavilion at the 1958 Brussels World's Fair, also utilised spatial audio with 425 loudspeakers used to move sound throughout the pavilion. There are also many other composers that created ground-breaking surround sound works in the same time period.
In 1978, a concept devised by Max Bell for Dolby Laboratories called "split surround" was tested with the movie "Superman". This led to the 70mm stereo surround release of "Apocalypse Now," which became the first formal release in cinemas with 3 channels in the front and 2 in the rear. The "Apocalypse Now" encoder/decoder was designed by Michael Karagosian, also for Dolby Laboratories. The surround mix was produced by an Oscar-winning crew led by Walter Murch for American Zoetrope. The format was also deployed in 1982 with the stereo surround release of Blade Runner.
5.1 surround sound originated in 1987 at the famous French Cabaret Moulin Rouge. A French engineer, Dominique Bertrand used a mixing board specially designed in cooperation with Solid State Logic, based on 5000 series and including 6 channels. Respectively: A left, B right, C centre, D left rear, E right rear, F bass. The same engineer had already achieved a 3.1 system in 1974, for the International Summit of Francophone States in Dakar Senegal.
Creating surround sound
Surround sound is created in several ways. The first and simplest method is using a surround sound recording technique—capturing two distinct stereo images, one for the front and one for the back or by using a dedicated setup, e.g. an augmented Decca tree, the OCT (Optimized Cardioid Triangle) or XYtri configuration—and/or mixing-in surround sound for playback on an audio system using speakers encircling the listener to play audio from different directions. A second approach is processing the audio with psychoacoustic sound localization methods to simulate a two-dimensional (2-D) sound field with headphones. A third approach, based on Huygens' principle, attempts reconstructing the recorded sound field wave fronts within the listening space; an "audio hologram" form. One form, wave field synthesis (WFS), produces a sound field with an even error field over the entire area. Commercial WFS systems, currently marketed by companies sonic emotion and Iosono, require many loudspeakers and significant computing power.
The Ambisonics form, also based on Huygens' principle, gives an exact sound reconstruction at the central point; less accurate away from center point. There are many free and commercial software programs available for Ambisonics, which dominates most of the consumer market, especially musicians using electronic and computer music. Moreover, Ambisonics products are the standard in surround sound hardware sold by Meridian Audio In its simplest form, Ambisonics consumes few resources, however this is not true for recent developments, such as Near Field Compensated Higher Order Ambisonics. Some years ago it was shown that, in the limit, WFS and Ambisonics converge.
Finally, surround sound can also be achieved by mastering level, from stereophonic sources as with Penteo, which uses Digital Signal Processing analysis of a stereo recording to parse out individual sounds to component panorama positions, then positions them, accordingly, into a five-channel field. However, there are more ways to create surround sound out of stereo, for instance with the routines based on QS and SQ for encoding Quad sound, where instruments were divided over 4 speakers in the studio. This way of creating surround with software routines is normally referred to as "upmixing,", which was particularly successful on the Sansui QSD-series decoders that had a mode where it mapped the L ↔ R stereo onto an ∩ arc.
Mapping channels to speakers
|This section does not cite any references or sources. (January 2010)|
In most cases, surround sound systems rely on the mapping of each source channel to its own loudspeaker. Matrix systems recover the number and content of the source channels and apply them to their respective loudspeakers. With discrete surround sound, the transmission medium allows for (at least) the same number of channels of source and destination; however, one-to-one, channel-to-speaker, mapping is not the only way of transmitting surround sound signals.
The transmitted signal might encode the information (defining the original sound field) to a greater or lesser extent; the surround sound information is rendered for replay by a decoder generating the number and configuration of loudspeaker feeds for the number of speakers available for replay – one renders a sound field as produced by a set of speakers, analogously to rendering in computer graphics. This "replay device independent" encoding is analogous to encoding and decoding an Adobe PostScript file, where the file describes the page, and is rendered per the output device's resolution capacity. The Ambisonics and WFS systems use audio rendering; the Meridian Lossless Packing contains elements of this capability
Surround replay systems may make use of bass management, the fundamental principle of which is that bass content in the incoming signal, irrespective of channel, should be directed only to loudspeakers capable of handling it, whether the latter are the main system loudspeakers or one or more special low-frequency speakers called subwoofers.
There is a notation difference before and after the bass management system. Before the bass management system there is a Low Frequency Effects (LFE) channel. After the bass management system there is a subwoofer signal. A common misunderstanding is the belief that the LFE channel is the "subwoofer channel". The bass management system may direct bass to one or more subwoofers (if present) from any channel, not just from the LFE channel. Also, if there is no subwoofer speaker present then the bass management system can direct the LFE channel to one or more of the main speakers.
Low Frequency Effects (LFE) channel
The LFE is a source of some confusion in surround sound. The LFE channel was originally developed to carry extremely low "sub-bass" cinematic sound effects (with commercial subwoofers sometimes going down to 30 Hz, e.g., the loud rumble of thunder or explosions) on their own channel. This allowed theaters to control the volume of these effects to suit the particular cinema's acoustic environment and sound reproduction system. Independent control of the sub-bass effects also reduced the problem of intermodulation distortion in analog movie sound reproduction.
In the original movie theater implementation, the LFE was a separate channel fed to one or more subwoofers. Home replay systems, however, may not have a separate subwoofer, so modern home surround decoders and systems often include a bass management system that allows bass on any channel (main or LFE) to be fed only to the loudspeakers that can handle low-frequency signals. The salient point here is that the LFE channel is not the "subwoofer channel"; there may be no subwoofer and, if there is, it may be handling a good deal more than effects.
Some record labels such as Telarc and Chesky have argued that LFE channels are not needed in a modern digital multichannel entertainment system. They argue that all available channels have a full frequency range and, as such, there is no need for an LFE in surround music production, because all the frequencies are available in all the main channels. These labels sometimes use the LFE channel to carry a height channel, underlining its redundancy for its original purpose. The label BIS generally uses a 5.0 channel mix.
Surround sound specifications
The descriptions of surround sound specifications below distinguish between the number of discrete channels encoded in the original signal and the number of channels reproduced for playback. The number of channels reproduced for playback can be changed by using matrix decoding. A distinction is also made between the number of channels reproduced for playback and the number of speakers used to reproduce (each channel may refer to a group of speakers). The graphics to the right of each specification description represent the number of channels, not the number of speakers.
This notation, e.g. "5.1", reflects the number of full range channels; including a ".1" to reflect the limited range of the LFE channel.
E.g. 2 basic stereo speakers with no LFE channel = 2.0
5 full-range channels + 1 LFE channel = 5.1
It can also be expressed as the number of full-range channels in front of the listener, separated by a slash from the number of full-range channels beside or behind the listener, separated by a decimal point from the number of limited-range LFE channels.
E.g. 3 front channels + 2 side channels + an LFE channel = 3/2.1
This notation can then be expanded to include the notation of Matrix Decoders. Dolby Digital EX, for example, has a sixth full-range channel incorporated into the two rear channels with a matrix. This would be expressed:
3 front channels + 2 rear channels + 3 channels reproduced in the rear in total + 1 LFE channel = 3/2:3.1
Note: The term stereo, although popularised in reference to two channel audio, can also be properly used to refer to surround sound, as it strictly means "solid" (actually meaning three-dimensional sound) sound. However this is no longer a common usage and "stereo sound" is almost exclusively used to describe two channel, left and right, sound.
In accordance with ANSI/CEA-863-A
|Zero-based order within multi-channel
|Channel name||Color-coding on commercial receiver and cabling|
|6||6||Surround back left||Brown|
|7||7||Surround back right||Khaki|
|Front left||Center||Front right|
|Surround left||Surround right|
|Surround back left||Surround back right|
Sonic Whole Overhead Sound
Ambisonics is a series of recording and replay techniques using multichannel mixing technology that can be used live or in the studio and which recreates the soundfield as it existed in the space, in contrast to traditional surround systems, which can only create illusion of the soundfield if the listener is located in a very narrow sweetspot between speakers. Any number of speakers in any physical arrangement can be used to recreate a sound field. With 6 or more speakers arranged around a listener, a 3-dimensional ("periphonic", or full-sphere) sound field can be presented. Ambisonics was invented by Michael Gerzon.
Panor-Ambiophonic (PanAmbio) 4.0/4.1
PanAmbio combines a stereo dipole and crosstalk cancellation in front and a second set behind the listener (total of four speakers) for 360° 2D surround reproduction. Four channel recordings, especially those containing binaural cues, create speaker-binaural surround sound. 5.1 channel recordings, including movie DVDs, are compatible by mixing C-channel content to the front speaker pair. 6.1 can be played by mixing SC to the back pair.
Standard speaker channels
This table shows the various speaker configurations that are commonly used for end-user equipment. The order and identifiers are those specified for the channel mask in the standard uncompressed WAV file format (which contains a raw multichannel PCM stream) and are used according to the same specification for most PC connectible digital sound hardware and PC operating systems capable of handling multiple channels. While it is certainly possible to build any speaker configuration, there isn't a lot of commercially available movie or music content for alternative speaker configurations. Such cases, however, can be worked around by remixing the source content channels to the speaker channels using a matrix table specifying how much of each content channel is played through each speaker channel.
|Channel name||Identifier||Index||Flag||1.0 Mono[Note 1]||2.0 Stereo[Note 2]||3.0 Stereo||3.0 Surround||4.0 Quad||4.0 Surround||5.0||5.0 Side[Note 3]||6.0||6.0 Side[Note 3]||7.0||7.0 Side[Note 4]||7.0 Surround[Note 3]||9.0 Surround||11.0 Surround|
|Front Left of Center||SPEAKER_FRONT_LEFT_OF_CENTER||6||0x00000040||No||No||No||No||No||No||No||No||No||No||Yes||Yes||No||No||Yes|
|Front Right of Center||SPEAKER_FRONT_RIGHT_OF_CENTER||7||0x00000080||No||No||No||No||No||No||No||No||No||No||Yes||Yes||No||No||Yes|
|Front Left Height||SPEAKER_LEFT_HEIGHT||12||0x00001000||No||No||No||No||No||No||No||No||No||No||No||No||No||Yes||Yes|
|Front Right Height||SPEAKER_RIGHT_HEIGHT||14||0x00004000||No||No||No||No||No||No||No||No||No||No||No||No||No||Yes||Yes|
Any of the channel configurations above may include a low frequency effects (LFE) channel (the channel played through the subwoofer.) This would make the configuration ".1" instead of ".0". Most modern multichannel mixes will contain an LFE.
10.2 surround sound
10.2 is the surround sound format developed by THX creator Tomlinson Holman of TMH Labs and University of Southern California (schools of Cinema/Television and Engineering). Developed along with Chris Kyriakakis of the USC Viterbi School of Engineering, 10.2 refers to the format's promotional slogan: "Twice as good as 5.1". Advocates of 10.2 argue that it is the audio equivalent of IMAX[weasel words].
22.2 surround sound
22.2 is the surround sound component of Ultra High Definition Television, and has been developed by NHK Science & Technical Research Laboratories. As its name suggests, it uses 24 speakers. These are arranged in three layers: A middle layer of ten speakers, an upper layer of nine speakers, and a lower layer of three speakers and two sub-woofers. The system was demonstrated at Expo 2005, Aichi, Japan, the NAB Shows 2006 and 2009, Las Vegas, and the IBC trade shows 2006 and 2008, Amsterdam, Netherlands.
- 3D audio effect
- Dolby Surround
- Four-channel Compact Disc Digital Audio
- MPEG Surround
- Precedence effect
- Soundfield microphone
- Virtual surround
- For historical reasons, when using (1.0) mono sound, often in technical implementations the first (left) channel is used, instead of the center speaker channel, in many other cases when playing back multichannel content on a device with a mono speaker configuration all channels are downmixed into one channel. The way standard mono and stereo plugs used for common audio devices are designed ensures this as well.
- Stereo (2.0) is still the most common format for music, as most computers, television sets and portable audio players only feature two speakers, and the red book Audio CD standard used for retail distribution of music only allows for 2 channels. A 2.1 speaker set does generally not have a separate physical channel for the low frequency effects, as the speaker set downmixes the low frequency components of the two stereo channels into one channel for the subwoofer.
- THX 5.1 Surround Sound Speaker set-up. This is the correct speaker placement for 5.0/6.0/7.0 channel sound reproduction for Dolby and Digital Theater Systems.
- "Sony Print Master Guidelines"This plus an LFE is the correct speaker placement for 8-track Sony Dynamic Digital Sound.
- Channels Defined by Audiogurus
- Mick M Sawaguchi, and Akira Fukada (1999), Multichannel sound mixing practice for broadcasting. IBC Conference, 1999 Article
- Eliasson, Jens; Leijon, Ulrika; Persson, Emil (2001). "Multichannel cinema sound". p. 8. CiteSeerX: 10
.1 .1 .150 .854.
- Graham Healy, and Alan F. Smeaton (2009). Spatially augmented audio delivery: applications of spatial sound awareness in sensor-equipped indoor environments. In: ISA 2009: First International Workshop on Indoor Spatial Awareness, 18 May 2009, Taipei, Taiwan. ISBN 978-1-4244-4153-2. Abstract
- Christos Manolas, and Sandra Pauletto (2009). "Enlarging the Diegetic Space: Uses of the Multi-channel Soundtrack in Cinematic Narrative". The soundtrack, 2(1), August 2009, pp. 39–55, doi:10.1386/st.2.1.39_1, Print ISSN: 1751-4193 , Electronic ISSN: 1751-4207, Abstract
- Josephine Anstey, Dave Pape, Daniel J. Sandin (2000). Building a VR Narrative. Proc. SPIE, Vol. 3957, 370, doi:10.1117/12.384463. Abstract
- Mark Kerins (2006). "Narration in the Cinema of Digital Sound". University of Texas Press, The Velvet Light Trap, 58, Fall 2006, pp. 41–54. doi:10.1353/vlt.2006.0030. Abstract
- Marc S. Dantzker (2004). Acoustics in the Cetaceans Environment: A Multimedia Educational Package. Article
- Dan Gärdenfors (2003). Designing sound-based computer games". Digital Creativity, 14, 2, June 2003 , pp. 111–114. doi:10.1076/digc.188.8.131.52863. Abstract
- Timothy Roden, Ian Parberry (2005). Designing a narrative-based audio only 3D game engine. ACM International Conference Proceeding Series; Vol. 265, Proceedings of the 2005 ACM SIGCHI International Conference on Advances in computer entertainment technology, Valencia, Spain, pp. 274–277, ISBN 1-59593-110-4. Abstract
- Stephan Schütze (2003). "The creation of an audio environment as part of a computer game world: the design for Jurassic Park – Operation Genesis on the XBOX as a broad concept for surround installation creation". Cambridge University Press, Organised Sound, 8 : 171–180. doi:10.1017/S1355771803000074. Abstract
- Mike Jones (2000). "Composing Space: Cinema and Computer Gaming. The Macro-Mise En Scene and Spatial Composition". Article
- Durand Begault et al (2005). "Audio-Visual Communication Monitoring System for Enhanced Situational Awareness" 
- "Pink Floyd Astounds With 'Sound in the Round'". WIRED. May 12, 1967.
- "pink floyd". Retrieved 2009-08-14.
- Tomlinson, Holman (2007). Surround sound: up and running. Focal Press. p. 3,4. ISBN 978-0-240-80829-1. Retrieved 2010-04-03.
- Emil Torick (1998). "Highlights in the history of multichannel sound". Journal of the Audio Engineering Society, 46:1/2, pp. 27–31, February 1998 Abstract
- Ron Steicher (2003): The DECCA Tree—it's not just for stereo any more
- Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format
- Further Investigations of High Order Ambisonics and Wavefield Synthesis for Holophonic Sound Imaging
- Multichannel Music Mixing by Dolby Laboratories, Inc.
- Consumer Electronics Association standards: Setup and Connection
- "Updated: Player 6.3.1 with mp3 Surround support now available!".
- Creating 7.1 Audio
- "Multiple channel audio data and WAVE files". Microsoft.
- Josh Coalson. "FLAC - format".
- Avisynth.org, GetChannel
- Hydrogenaudio, 5.1 Channel Mappings
- "KSAUDIO_CHANNEL_CONFIG structure". Microsoft.
- Header file for OpenSL, containing various identifier definitions
|Wikibooks has more on the topic of: Surround sound|