EN 301 549 7.3 -- User Controls for Captions and Audio Description
What It Is
ETSI EN 301 549 v3.2.1 clause 7.3 requires that user controls to activate subtitling (captions) and audio description be provided at the same level of interaction -- that is, the same number of steps to complete the task -- as the primary media controls[1]. If Play/Pause, Volume, and Fullscreen are one tap away, Caption Toggle and Audio Description Toggle must be one tap away too. The clause is a parity-of-effort rule, not a "controls exist somewhere" rule.
The Mechanism Failure
A player where Play/Pause is a single tap but captions live four menus deep -- Settings > Accessibility > Closed Captions > Enable -- fails 7.3 even though every control technically exists. The failure is not availability; it is asymmetry of interaction depth. Users who depend on captions or audio description to parse the content at all have to spend measurably more effort than users who do not, and the population that needs these features overlaps heavily with the population least able to navigate nested settings menus. Clause 7.3 closes that gap by making the step count the normative requirement[2].
The Fix
Surface the caption toggle and the audio description toggle on the primary player toolbar at the same interaction depth as Play/Pause, Volume, and Fullscreen. Concretely:
- CC and AD toggles live on the primary control bar, not behind a
Settingsgear or an overflow menu. - Each toggle is reachable in the same number of discrete interactions as Play/Pause -- count the steps for both and compare.
- Toggles expose state (on/off, selected language/track) through the accessible name or
aria-pressed, so assistive technology can report the current state without opening a sub-panel. - Controls are reachable by keyboard, pointer, touch, and remote-control D-pad -- not gated on hover, which excludes touch and keyboard users.
- The last-used caption and description preference persists across sessions so the cost of enabling them is paid once, not per video.
How It Relates to WCAG
WCAG addresses whether captions and audio description exist for prerecorded and live media -- 1.2.2 Captions (Prerecorded), 1.2.5 Audio Description (Prerecorded), and 1.2.3[3]. WCAG 2.2 does not mandate where those controls live in the player UI. EN 301 549 clause 7.3 fills that gap by making the discoverability of the controls normative: the content can meet every WCAG 1.2.x criterion and still fail 7.3 if the toggle is buried four menus deep.