Formats & Standards
Dialogue Normalization (Dialnorm)
Dialnorm is a metadata parameter embedded in Dolby Digital (AC-3), Dolby Digital Plus (E-AC-3), and AC-4 audio bitstreams that tells a decoder how loud the dialogue content is, enabling automatic playback attenuation for loudness consistency. The decoder attenuates the audio in the digital domain before D/A conversion to normalize dialogue loudness to a reference target, preventing volume jumps between programs and advertisements.
What Dialnorm Does
Dialnorm is a metadata parameter embedded in Dolby Digital (AC-3), Dolby Digital Plus (E-AC-3), and AC-4 audio bitstreams. It tells a decoder how loud the dialogue content is, enabling automatic playback attenuation for loudness consistency across different programs and platforms. When a decoder receives a Dialnorm value, it attenuates the audio in the digital domain before D/A conversion, reducing playback gain to normalize dialogue loudness to a target reference level.
The Scale and Attenuation Mechanism
Dialnorm is expressed in two compatible notational conventions in the literature. Some references use a negative dB scale ranging from −1 to −31, where the decoder calculates attenuation in dB using the formula: attenuation (dB) = 31 + dialnorm-value. For example, a Dialnorm setting of −21 results in 10 dB of attenuation (31 + (−21) = 10). Other references describe Dialnorm on a positive integer scale from 1 to 31, where 1 corresponds to maximum attenuation (roughly −30 dB gain) and 31 corresponds to no attenuation (0 dB, unity gain). Both conventions represent the same underlying parameter; the choice of notation is a matter of documentation preference.
The reference target level for dialogue is −31 dBFS Leq(A) within the Dolby Digital decoder itself. The decoder automatically applies attenuation based on the embedded Dialnorm value to bring dialogue loudness toward this reference.
Standards and Measurement: ATSC A/85 and ITU-R BS.1770
In broadcast television, Dialnorm values are typically set by measuring dialogue loudness using the ITU-R BS.1770 standard, which defines algorithms for measuring audio loudness using K-weighting, mean-square calculation, gating, and true-peak measurement. This produces integrated loudness in LUFS/LKFS units.
ATSC A/85 (published November 2009) specifies that dialogue in U.S. digital television broadcasts should be normalized to −24 dB LKFS with a tolerance of ±2 dB. Under ATSC A/85's fixed-loudness method, if dialogue measures −24 LKFS using BS.1770 measurement, the encoder sets Dialnorm to 24, allowing decoders to normalize playback toward the broadcast target level. Note that this −24 LKFS ATSC broadcast target is distinct from the −31 dBFS internal decoder reference level mentioned above. They represent different stages in the encoding and playback pipeline.
EBU R128, the European equivalent loudness standard, measures and normalizes to −23 LUFS. Both R128 and ATSC A/85 rely on the same underlying ITU-R BS.1770 measurement engine. However, 'Dialnorm' as a named metadata field is specifically a Dolby Digital/E-AC-3/AC-4 bitstream parameter. EBU R128 itself does not define or require Dialnorm; it defines a loudness target and tolerance framework that can be applied independent of Dolby's proprietary metadata field.
Real-World Consequences of Incorrect Dialnorm
When Dialnorm values are incorrectly set or mismatched between the encoder and actual content loudness, decoders apply wrong attenuation, resulting in audibly inconsistent playback levels that are either too loud or too quiet. A common scenario occurs when broadcasters leave Dialnorm at factory defaults without measuring actual dialogue levels. For example, if dialogue actually measures −19 dBFS but Dialnorm remains set to −27 dBFS, the decoder applies 8 dB too much gain, creating an 8 dB volume spike. This is precisely the loudness jump that Dialnorm was designed to prevent.
In practice, Dialnorm values are often set arbitrarily rather than through proper measurement, with encoders using default settings without running content through loudness measurement tools. This leads to widely varying Dialnorm values even for identical content on different platforms. Streaming workflows are particularly vulnerable to Dialnorm loss because content passes through fragmented chains of suppliers, transcoders, aggregators, and ad-tech partners where each re-encode can strip Dialnorm metadata or alter it.
Streaming Platforms and Transparency
Dolby Digital Plus (E-AC-3) is used in some Netflix, Disney+, Apple TV+, and Amazon Prime Video streams (particularly for 5.1 and Atmos-via-JOC audio tracks) and carries Dialnorm metadata for loudness control. Not all content or all playback paths on these platforms use E-AC-3. Many also serve AAC or lossless formats without Dialnorm. Industry commentary has noted loudness jumps as among the reasons cited for viewer abandonment of FAST (Free Ad-Supported TV) channels and streaming platforms, though authoritative quantitative audit data on the magnitude of this problem remains limited.
By design, Dialnorm-based attenuation operates automatically in the decoder without requiring explicit viewer action. Most consumer AVRs, soundbars, and TVs do not expose a user-facing Dialnorm override in their standard menus, though some professional and prosumer decoders do allow Dialnorm override or dynamic-range-compression settings that indirectly affect this behavior.
Sources
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]