EN 301 549 7.1 -- Caption Processing Technology

For reference only -- not part of a11ybot's automated checks.

What It Is

ETSI EN 301 549 v3.2.1 clause 7.1 "Caption processing technology" is an obligation on the ICT -- the player, decoder, transcoder, set-top box, conferencing client, or streaming pipeline -- not on the person who authored the captions. Where ICT displays video with synchronised audio and the source carries captions, the technology must decode those captions, keep them in sync with the audio, preserve them across recording and transcoding, and let the user style them[1].

The clause decomposes into four sub-requirements: 7.1.1 Captioning playback (the ICT must actually display the captions when they are present), 7.1.2 Captioning synchronization (audio-to-caption drift within 100 ms for both recorded and live material), 7.1.3 Preservation of captioning (transmit, convert, and record operations must not strip caption data), and 7.1.4 Captions characteristics (the user must be able to change background, font, and opacity, except for bitmap captions that cannot be re-rendered)[1].

Why It Matters

Captions are authored upstream, but they only reach users if every component in the delivery chain passes them through. The failure modes are mechanistic: an HTML5 player that does not honour <track kind="captions">; a transcoding pipeline that re-encodes the video elementary stream and drops the CEA-608/708 sidecar; an ingest service that accepts a file with embedded captions but publishes only the video track; a screen-share tool that records the window pixels and loses the caption layer entirely; a smart TV that renders HD video but ignores TTML/IMSC subtitle tracks. Clause 7.1 puts the obligation on the technology so that an author's caption file cannot silently disappear between encoder and viewer[1].

How It Relates to WCAG

WCAG 2.2 1.2.2 Captions (Prerecorded) and 1.2.4 Captions (Live) require the author to provide captions for synchronised media. Clause 7.1 requires the technology to transport them. The two are complementary and a product can satisfy one while failing the other: a site can ship a valid WebVTT file and still fail 7.1 if its player cannot render it, and a player can be fully 7.1-conformant and still fail WCAG 1.2.2 if the content owner never authored captions. EN 301 549 clause 7.1 covers the gap WCAG does not -- the obligation on the decoder, not the content.

Practical Implications

  • Support at least one standard caption format end-to-end. For HTML5 video, use <track kind="captions" src="captions.vtt"> and a player that honours WebVTT cues[2].
  • For broadcast-adjacent pipelines, preserve CEA-608/CEA-708 (US closed captions), DVB Subtitles (European digital broadcast), or TTML/IMSC (international streaming) through every transcode step. Re-mux rather than re-encode where possible, and verify captions survive the round-trip.
  • Audio-to-caption drift must stay within 100 ms during seek, playback-rate changes, and live ingest (clause 7.1.2)[1].
  • Recording, download, and clip-export features must preserve caption data (clause 7.1.3)[1].
  • User styling controls for background, font, and opacity must be exposed unless the caption source is a non-editable bitmap (clause 7.1.4)[1].
  • Caption rendering must not be occluded by overlays, letterboxing, or ad insertions.

Related Clauses

Sources

  • ETSI EN 301 549 v3.2.1, clause 7.1 Caption processing technology
  • W3C WebVTT 1 (the format HTML5 <track> is required to support)
  • SMPTE ST 2052-1 (IMSC1)