WCAG (Level AAA) SC 1.2.9 Audio-only (Live) (w3.org)
Issue description
WCAG 1.2.9, “Audio-only (Live)” recognizes that live audio content, such as webcasts, online meetings, and presentations, needs to be accessible to people who are deaf or hard of hearing. It requires that a text-based alternative be provided for any live audio-only content. This is important because:
- Real-time access: People who cannot hear rely on visual alternatives to understand audio information. In live situations, this means providing a real-time text equivalent of what is being said.
- Inclusion in live events: Without a text alternative, people with hearing disabilities would be excluded from understanding and participating in live audio events.
This guideline typically requires real-time transcription, which can be achieved through:
- Live captioning services: A trained operator listens to the audio and types what they hear, which is then displayed as text.
- Automatic Speech Recognition (ASR): AI-powered software that converts speech to text in real-time.
While this is a Level AAA criterion, meaning it’s a higher level of accessibility, it’s becoming increasingly important as more events and communications move online. Providing a text alternative for live audio ensures that everyone can access and participate in these events, regardless of their hearing abilities.
Related requirements
The following WCAG source criteria are often related to this as well. They can provide additional insights into specific challenges you may be encountering.
- WCAG (Level A) SC 1.2.3 Audio Description or Media Alternative (Prerecorded)
- WCAG (Level AA) SC 1.2.4 Captions (Live)
- WCAG (Level AA) SC 1.2.5 Audio Description (Prerecorded)
- WCAG (Level AAA) SC 1.2.6 Sign Language (Prerecorded)
- WCAG (Level AAA) SC 1.2.7 Extended Audio Description (Prerecorded)
- WCAG (Level AAA) SC 1.2.8 Media Alternative (Prerecorded)
Who this issue impacts
Follow the links for additional information on user impairments:
Suggestions for remediation
Remediating WCAG 1.2.9, “Audio-only (Live)” involves providing real-time text alternatives for live audio content, making it accessible to people who are deaf or hard of hearing. Here’s how:
Choose a transcription method
- Human captioner: A trained captioner (often a stenographer) listens to the audio and types what they hear, which is then displayed as text on the screen. This is generally the most accurate method but can be expensive.
- Automatic Speech Recognition (ASR): AI-powered software that automatically converts speech to text. This is becoming increasingly accurate and affordable but may require human monitoring and editing for optimal quality.
Implementation options
- Integrated with streaming platform: Many live streaming platforms (e.g., YouTube, Zoom, Microsoft Teams) offer built-in or integrated captioning features, including options for manual captioning, ASR, or third-party integrations.
- Third-party captioning services: Several companies specialize in providing real-time captioning services, offering human captioners or ASR solutions.
Displaying the transcript
- On-screen display: The transcript can be displayed directly within the video player or on a designated area of the screen.
- Separate window: The transcript can be displayed in a separate window, which can be helpful for users who prefer to customize the size and position of the text.
Best practices
- Accuracy: Strive for high accuracy in the transcript, minimizing errors and delays.
- Synchronization: The transcript should be synchronized with the audio, appearing on the screen as close to real-time as possible.
- Speaker identification: If there are multiple speakers, identify each speaker in the transcript.
- Formatting: Use clear and easy-to-read fonts, colors, and positioning.
Testing
- Monitor quality: If using ASR, monitor the transcript in real-time to catch and correct any errors.
- User feedback: Gather feedback from users with hearing disabilities to ensure the transcript is effective and meets their needs.
Example
In a live podcast interview, the transcript might look like this:
[Host]: Welcome to our show! Today we’re talking about…
[Guest]: It’s great to be here. I’m excited to discuss…
By providing a real-time text alternative for live audio content, you ensure that everyone, regardless of their hearing abilities, can access and participate in the experience. This promotes inclusivity and makes your content accessible to a wider audience.

