A voice-over brief is a short, complete order/specification that enables the voice actor (and the studio) to deliver the right voice, tone, and technical format without follow-up questions.
The key points in brief
- A good voice-over brief is less about 'feeling' and more about clear decisions: goals, audience, pronunciation, tempo, and delivery format.
- The most common mistakes are names/pronunciation, length requirements, where the audio will be used, and who approves.
- If you want to get it done quickly: fill out the template, attach the script + reference, and write what is 'must' vs 'nice to have'.
What a voice-over brief needs to contain (and why)
Agency briefs often break down at the same points: you're in a hurry, you don't want to overload anyone with details, and you leave gaps that still must be filled later. The result becomes ping-pong: 'Which pronunciation?', 'Should it sound more salesy?', 'What file type do you want?', 'Is it for social or TV?'
To avoid that, your voice-over brief needs to cover three things:
- Message and goals: What should the audience understand or do afterwards? (Not campaign copy, just decisions.)
- Voice interpretation: Tone, energy, tempo, and any references. Preferably described with words that can be recorded: 'calm, factual, a little smile' beats 'premium and modern'.
- Practical and delivery: Manuscript status, pronunciation, timings, version needs, usage rights, file formats, and who approves.
The point is simple: if something is unclear in the brief, it will still become a question. The difference is whether the question comes before recording (cheap) or after (expensive and slow).
Common mistakes I see in agency orders
This is the kind of thing that recurs in real projects, even when the producer 'knows how to do it'. Not because people are sloppy, but because voice-over often ends up late in the chain.
- Script without status: Is it locked or 'almost ready'? If the script is still moving, you need to say so. Otherwise everyone plans as if it's final.
- No clear module length: '30 seconds' isn't enough if the film also has to be cut to 15/6 or if the voice must sit on exact timecodes.
- Pronunciation and names left to chance: Product names, place names, person names, English terms. It’s quick to write phonetically or link to a reference. It takes longer to redo.
- Distribution is not mentioned: Organic social, paid social, web, radio, events, internal training. It affects both expression and rights. Say where it will be used, even if you don't have all the details.
- Too many references: Three links with 'something in that direction' often have the opposite effect. Choose one reference for tempo/energy and one for tone (if needed).
- Unclear who decides: If five people are going to weigh in but no one owns the decision, the final round will always be late. State who approves.
The payoff when you cover the points above is concrete: fewer rounds, less risk of retakes, and a greater chance that the first take is 'right enough' to cut directly.
Brief template
Fill this in before sending the order. It takes a few minutes. It is not pretty. It is complete. The goal is to capture what would otherwise be missed when you are in a hurry.
- Project: [name / working title]
- Format and channel: [e.g. explainer for web, social ad 15s, e-learning module, IVR]
- Target audience: [who is listening?]
- Purpose: [what should the listener understand or do afterwards?]
- Script status: [final / near-final / draft — attach file]
- Tone/energy: [3–5 words, e.g. "calm, factual, a little smile"]
- Tempo: [calm / normal / fast]
- References: [1–2 links to tone/tempo examples]
- Pronunciation: [list product names, place names, abbreviations, English terms — phonetic where needed]
- Timings: [exact length per version, any plus/minus, pauses for graphics?]
- Versions needed: [one version / A-B text / different CTAs / different lengths]
- Delivery format: [WAV/MP3, mono/stereo, sample rate, split per sentence/section?]
- Usage and rights: [channels, geography, period — or "unclear, likely X"]
- Deadline: [date]
- Approver: [name — the single person who owns the final decision]
Copy this into an email, a project tool or use it as a checklist before sending the order.
If you want to hear voice options before you lock the brief, check out the demos.
Process / checklist
- 1) Set the framework What is the project (format and channel)? When do you need delivery? Who approves?
- 2) Send manuscript + status Attach the manuscript, and write 'final' or 'can be changed'. If there are timecodes: attach them.
- 3) Define voice interpretation Choose 3–5 words for tone ('factual, warm, confident'). Specify tempo (calm/normal/fast) and energy (low/medium/high). Include max 1–2 references.
- 4) List what must not go wrong Pronunciation (phonetic), names, numbers, units of measurement, trademark terms. Mark sensitive lines in the manuscript.
- 5) State how you want the delivery File format (WAV/MP3), mono/stereo, sample rate (e.g., 48kHz), any level requirements you have, and whether you want split per sentence/section.
- 6) Version needs One version, or several (e.g., 'A/B text', different CTAs, different lengths). State it now, otherwise it will be a new order later.
- 7) Confirm usage Where will the material be used and roughly for how long? If you are unsure: write 'likely X, may be Y'. It is better than silence.
Next steps
Fill in the template above with what you know and mark the rest as "unclear". Send the brief and manuscript in the same thread. Then you can start working right away without needing to chase details afterwards.
If you want me to review the brief before recording (to catch those classic misses) the fastest way is to send it via contact.
FAQ
How detailed does a voice-over brief need to be?
Enough for someone to be able to record without guessing: goals, tone/tempo, pronunciation, delivery format, and deadline. Everything else is secondary.
We haven't fully decided the tone. What do we write then?
Write two clear options and what determines the choice (e.g., 'more factual if compliance weighs most, more light if reach weighs most'). Attach a reference for each option.
How do we avoid missing important info when we're in a hurry?
Use the template as a checklist and fill in 'unclear' where you don't know. The important thing is that pronunciation, usage, version needs, and delivery specs are not left blank.
Do we need to specify file formats and technical details?
Yes, if you have requirements. If not: write 'standard for video (48kHz WAV, mono)'. Then you won't have to guess afterwards.
What is the minimum we can send to get started?
Manuscript + deadline + desired voice description in 3–5 words + where it will be published. But expect follow-up questions if pronunciation and delivery format are missing.
Read more: