Glossary
accessibility

Caption

Definition

Synchronised text displayed with video content, showing dialogue and important sounds for deaf and hard-of-hearing viewers.

What are Captions?

Captions are text overlays synchronised with video content, displaying dialogue and relevant sounds (like [doorbell rings] or [music playing]). They enable deaf and hard-of-hearing viewers to understand video content fully.

Captions differ from subtitles – subtitles translate dialogue for viewers who can hear but don't understand the language, while captions also describe non-speech audio.

Why Captions Matter

Accessibility

Approximately 11 million people in the UK have hearing loss. Captions make video content accessible to them.

WCAG Requirement

Captions for video are a Level A requirement under WCAG. All pre-recorded video content needs captions.

Legal Compliance

Accessibility laws increasingly require captioned video content, especially for public sector and large organisations.

Broader Benefits

Many people use captions in sound-off situations – commuting, in offices, at night. Captions expand your audience.

Types of Captions

Closed Captions (CC)

Can be turned on or off by viewers. The standard for web video. Usually provided as a separate file.

Open Captions

Permanently burned into the video. Cannot be disabled. Sometimes used for social media.

Automatic Captions

Machine-generated by platforms like YouTube. Often inaccurate, especially for accents, jargon, or multiple speakers.

Caption Best Practices

Accuracy

Captions should match the spoken word exactly. Summarising or paraphrasing loses information.

Timing

Captions should appear and disappear in sync with speech. Viewers need time to read them.

Speaker Identification

When multiple people speak, identify who's talking: [Sarah] "Hello everyone."

Sound Descriptions

Include relevant non-speech sounds: [phone ringing], [applause], [ominous music].

Readability

2-3 lines maximum. Break at natural pause points. Ensure sufficient contrast.

Creating Captions

Manual Transcription

Most accurate method. Watch the video and type what you hear. Tools like Descript or Rev help.

Automatic Generation + Editing

Use YouTube's auto-captions as a starting point, then correct errors. Faster than starting from scratch.

Professional Services

Caption providers like Rev, 3Play Media, or Verbit offer professional captioning.

File Formats

  • SRT – Most common, widely supported
  • VTT – Web standard, used in HTML5 video
  • SCC – Broadcast standard

Platform-Specific

YouTube

Upload SRT/VTT files or use their captioning interface. Auto-captions need editing.

Social Media

Most platforms support caption files. Consider open captions for autoplay videos.

Your Website

HTML5 video supports VTT captions via the <track> element.

Want to Learn More?

Check out our in-depth guides on web design, SEO, and digital marketing.