Spatial Audio Production Workflows

Spatial Audio Production Workflows

Spatial audio production workflows define how a spatial idea moves from concept and studio work into the real conditions of an exhibition, installation, pavilion or immersive environment.

They include sketches, previews, simulations, real-time prototypes, file preparation, system integration, communication with project partners and the final tuning of sound in the actual space. Their task is not only to produce audio content, but to protect the spatial intention as it travels between people, tools, rooms, audiences and playback systems.

For KLING KLANG KLONG, this is a central part of sound scenography. A spatial audio workflow is successful when it keeps the relationship between composition, architecture, dramaturgy, interaction and listening condition alive from the first idea to the installed experience.

This Technical Note is the companion to Spatial Audio Systems & Renderers and Spatial Audio Formats & Methods. Those articles explain playback systems and technical formats. This one explains how spatial audio is developed, tested and translated through production.


IN BRIEF


A spatial audio production workflow connects spatial concept, content creation, technical setup, preview, system integration and on-site tuning.

Studio previews can help teams hear and discuss spatial behaviour before the final system exists. They are useful for orientation, communication and early decision-making, but they are not the final listening condition.

Real-time engines, simulations and binaural previews can support spatial planning, especially in VR, XR, interactive installations and complex media environments. Their value depends on knowing what they can show and what they cannot show.

The decisive test is the installed situation: architecture, acoustics, loudspeaker layout, visitor movement, neighbouring media, operational reliability and the way sound behaves over time.


From Studio To Site


Spatial audio rarely moves in a straight line from composition to playback. It passes through several states: concept, sketch, mock-up, preview, technical planning, content production, rendering, integration, calibration, on-site testing and maintenance.

Each state answers a different question. A studio sketch may ask whether a sound should move, surround, localise or withdraw. A prototype may ask how interaction affects timing or proximity. A system plan may ask how many playback zones are needed. An on-site test may reveal how reflections, spill, visitor routes or neighbouring exhibits change the result.

The workflow has to carry the spatial idea through all of these states without confusing them. A preview is not a system. A simulation is not the room. A render is not the experience. Each step is useful when its role is clear.

For exhibitions, museums and pavilions, this matters because the final audience does not listen from one ideal position. Visitors enter at different times, move through the space, turn away, gather in groups, pass between zones and encounter sound together with architecture, light, objects, video, text and other people.


Listening Behaviour And Audience Expectations


Spatial audio production is also shaped by how people have learned to listen.

Games, VR, headphones and interactive media have trained many audiences to understand sound as information in space. A distant cue can suggest direction. A change in reverb can imply scale. A rustle, tone, voice or moving layer can guide attention before anything is explained visually.

This listening literacy matters for exhibitions and installations. Visitors may not know the technical vocabulary of spatial audio, but many already understand that sound can behave: it can signal, respond, orient, warn, attract, conceal, reveal or make a space feel alive.

For production, this changes the question. The workflow has to test whether the intended sonic behaviours are legible in the actual encounter. Does a visitor understand where a sound belongs? Does a response feel connected to movement? Does the environment invite active listening, or does the sound become only atmosphere?


This is where the thought article How Games Trained Us To Listen becomes relevant. Spatial listening is no longer only a specialist concern; it is becoming part of how audiences read environments.


Previewing Spatial Behaviour


Before the final loudspeaker system is available, spatial behaviour often has to be sketched, tested and discussed through other means. Binaural previews, real-time engines, VR/XR prototypes and simulations can all help make spatial ideas audible during production.

Binaural previews are useful when a team needs to review direction, movement, proximity, layering or general spatial behaviour over headphones. They can support early design conversations, remote review, audio walks, interactive prototypes and communication with clients, exhibition designers, architects or media partners.

Real-time engines become useful when sound has to respond to movement, position, gesture, data, timing or changing visual scenes. In interactive installations, VR, XR and simulation-based workflows, they allow spatial behaviour to be tested as part of a living system rather than as a fixed timeline.

These tools can answer questions that a static audio file cannot answer. What happens when a visitor enters a zone? How does sound change with distance? How does a sonic event follow a moving object? How does the sound remain coherent when the route is not linear?

Their value depends on clarity about their limits. A binaural preview is still headphone-based. A simulation is still selective. A real-time prototype can show how a system behaves, but it cannot fully predict how sound will be perceived in the actual architecture.


Client And Partner Communication


Spatial audio can be difficult to communicate because it is relational. A still image cannot show how sound moves through a room. A stereo file cannot explain how visitors pass through several acoustic zones. A technical diagram cannot fully communicate atmosphere, distance or dramaturgy.

Production workflows therefore need translation tools. These can include binaural previews, listening sessions, annotated spatial scores, system diagrams, interaction prototypes, timing maps, show-control descriptions, zone plans and on-site demonstrations.

The goal is not to make every partner an audio specialist, but to make spatial decisions discussable. Architects, exhibition designers, curators, producers, media designers, technical planners and clients need to understand what role sound plays, what decisions are still open and what conditions the final experience depends on.

Clear communication protects the work. It helps prevent spatial audio from being reduced to a late technical layer, and it makes the connection between concept, content, system, audience behaviour and site easier to maintain.


Limits Of Previewing


Every preview leaves something out.

Headphones leave out the shared room. A stereo mock-up leaves out spatial depth. A renderer preview may leave out the final loudspeaker layout. A simulation may leave out reflections, materials, noise, bodies, neighbouring media or operational constraints. A studio playback may leave out scale.

This does not make previews weak. It makes them specific. A good workflow knows what each preview is for: discussion, orientation, timing, behaviour, content review, technical testing or stakeholder alignment. Problems begin when a preview is mistaken for the final condition. Spatial audio is especially sensitive to this because small changes in loudspeaker position, room acoustics, listening distance, visitor movement and neighbouring sound can change the experience significantly.


The useful question is not whether a preview is accurate in every respect, but which decision it can responsibly support.


Translation To The Final System


At some point, the work has to move from preview into the actual playback setup. This translation can involve rendering object-based material, decoding Ambisonics, preparing fixed multichannel files, configuring playback zones, integrating show control, mapping sensors, routing network audio or adapting content to installed loudspeakers.

This is where spatial intention meets technical constraint. Some movements may need to be simplified. Some sounds may need clearer localisation. Some zones may need less density. Some transitions may need more time. Some elements that worked in headphones may need another treatment in a room.

Translation is not a purely technical step. It is part of composition. The spatial score may need to adapt so that the intended behaviour remains audible under real conditions. This is why composition, system design and integration have to stay connected. The format, renderer and playback setup are chosen because they help the sound behave meaningfully in the space, not because they look impressive in isolation.



On-Site Testing And Tuning


On-site tuning is where spatial audio becomes real.

Only the actual space reveals how loudspeakers, architecture, materials, reflections, masking, visitor paths and media coexistence affect the work. A sound that feels clear in the studio may travel too far in a museum. A movement that feels precise in headphones may blur in a reverberant room. A layer that supports atmosphere may become too present when another exhibit is running.

Testing therefore has to include listening from different positions, walking the visitor route, checking thresholds between zones, balancing speech and music, adjusting levels over time, testing interaction and observing how the sound behaves together with other media.

For long-running exhibitions and installations, tuning also includes reliability. The system has to be understandable, maintainable and stable. Documentation, playback logic, naming, routing, calibration settings and handover material become part of the spatial audio workflow.

The final result is not the preview made louder. It is the spatial idea rebalanced for the room.


How We Work


We begin by defining the role of sound in the experience. Does it guide attention, create atmosphere, connect rooms, respond to movement, support narrative, form a spatial score or give an installation its sonic body?

From there, we choose the workflow that can carry that role through the project. This may include binaural previews, real-time prototypes, simulation, multichannel sketches, object-based workflows, spatial scores, system diagrams, interaction tests or on-site mock-ups.

Previews are used to make spatial decisions audible and discussable, while their limits remain clear. The final playback condition stays the reference point.

During integration, we test how the work behaves in relation to architecture, acoustics, loudspeaker layout, media systems, visitor movement and long-term operation.

The aim is a workflow in which sound does not become detached from space. It remains part of the scenographic structure from studio to site.



RELATED PRACTICES


Spatial audio production workflows connect technical planning with sound scenography, spatial composition, interaction and the practical realities of exhibitions and installations.


Spatial Audio Systems & Renderers


Spatial Audio Formats & Methods


Spatial Audio


Sound Scenography


Spatial Score


Exhibition Sound Design


Immersive Sound


Interactive Sound


Generative Sound



‍FAQ


What is a spatial audio production workflow?

A spatial audio production workflow is the process that connects concept, composition, preview, technical planning, system integration and on-site tuning for a spatial listening experience.


Why are previews important in spatial audio?

Previews help teams hear and discuss spatial behaviour before the final system is available. They support early decisions, communication and testing, but they do not replace the final playback condition.


Can binaural previews replace a loudspeaker installation?

No. Binaural previews are useful over headphones, especially for early review or VR/XR workflows, but they cannot fully predict how sound behaves in a shared physical space with loudspeakers, acoustics and visitor movement.


How are real-time engines used in spatial audio production?

Real-time engines can test sound behaviours that respond to movement, position, interaction, data or changing visual scenes. They are especially useful for interactive installations, VR, XR and simulation-based workflows.


Why is on-site tuning necessary?

On-site tuning is necessary because architecture, materials, loudspeaker placement, reflections, neighbouring media and visitor movement change how spatial audio is perceived.


When should spatial audio workflow planning begin?

It should begin early, together with architecture, exhibition design, media planning and interaction design. This makes it possible to treat sound as part of the spatial concept rather than as a late production layer.


What do games have to do with spatial audio workflows?

Games have trained many audiences to read sound as spatial information: direction, distance, danger, proximity, atmosphere and response. For exhibitions and installations, this makes it useful to test not only how a system plays back sound, but how visitors understand and act on sonic behaviour.




Planning a spatial audio installation, exhibition or immersive environment?

We help develop spatial audio workflows that carry the sound from concept and studio preview into the real conditions of the site.

Start a project