Voice-Over Pricing Models and What's Included

Professional voice-over pricing varies by model: per finished minute, per project, or per word. Understanding which model applies to your need, and what the fee actually covers, prevents confusion when comparing quotes.

What It Is

Voice-over pricing covers two distinct elements that are often confused. The first is the production fee: the voice actor's time, preparation, recording in a professional studio, basic editing (EQ, compression, noise removal), and delivery in the format you need. The second element — usage rights — is priced separately and depends on where the recording will be used (see how usage rights affect pricing for details).

This article focuses on production models and how to compare them fairly. When you receive two quotes with different numbers, the gap often comes from comparing different pricing structures, not from different quality or professionalism.

How It Works

Per-Finished-Minute Model

You pay based on the length of the final audio file delivered to you. If your completed script runs 2 minutes and 30 seconds, you pay the per-minute rate multiplied by that duration.

When it's used: E-learning, corporate videos, explainers, internal training content, documentaries.

Why it works here: The client's budget need is clear ("we need 5 minutes of narration") and so is what they're paying for. A per-minute model scales predictably.

What affects the price: Accents or dialects that require a specific regional voice, script difficulty (names, numbers, technical terms), and the number of revision rounds included before delivery.

Pros: Straightforward; you know the final cost upfront if the script length is locked.

Cons: Doesn't account for whether recording was quick or required many takes. A 2-minute script that flows naturally costs the same as a 2-minute technical script with proper noun titles, even if the latter required 3x more recording sessions.

Per-Project Model

You pay a flat fee for the entire project, regardless of script length or revision rounds within defined limits. The fee accounts for preparation, all recording sessions, editing, and delivery.

When it's used: Commercials, advertising campaigns, broadcast spots, bespoke narration, one-off corporate videos.

Why it works here: The project scope varies too much to use time-based metrics. An 8-second radio spot can take as long to record and perfect as a 5-minute documentary voice-over.

What affects the price: Exclusivity period (whether you retain the voice exclusively during a campaign), usage channel (internal vs paid advertising), and delivery timeline (rush projects cost more).

Pros: You know the full cost upfront; no surprise charges for re-records or revisions within the agreed scope.

Cons: The supplier carries more risk if the script is complex or revisions exceed expectations. This is why per-project quotes often require a locked script before recording begins.

Per-Word Model

You pay based on the word count of the script. A 500-word script costs a set amount per word; a 1200-word script costs proportionally more.

When it's used: E-learning modules, IVR (phone systems), long-form narration, training content, documentaries.

Why it works here: Content is often standardized. A 500-word module typically requires 3–4 minutes of recorded audio, making word count a reliable proxy for production time.

What affects the price: Whether the script contains difficult terminology or proper nouns, language/accent combination, and turnaround speed.

Pros: Scales fairly for longer content; you can estimate cost before starting.

Cons: A 500-word script with heavy technical terms or complex pronunciation may require more takes than an easy 700-word conversational script, but you'd pay less.

When It Matters

The choice of pricing model affects your budget in three key scenarios:

Scenario 1: Short, High-Difficulty Content

A 30-second commercial with brand names and specific tonality might take 2 hours to record and perfect. On a per-minute model, you'd be paying for 0.5 minutes of output but the artist spent the full time. On a per-project model, you're paying for the actual production time, which is transparent and fair.

Scenario 2: Long Content with Variable Complexity

A 20-minute e-learning course with a mix of easy sections and technical modules. If priced per-minute, the entire project has a fixed cost. If priced per-word, a 5000-word script is straightforward to quote. If priced per-project, the supplier factors in that some sections will require more takes than others.

Scenario 3: Revisions and Retakes

Per-project quotes usually include a set number of revision rounds (typically 2–3). Once exceeded, additional re-records are billed separately. Per-minute or per-word quotes sometimes treat revisions differently — some suppliers include unlimited revisions up to a certain percentage of the script; others charge per session. This is where miscommunication happens.

What You Should Do

Follow this process when requesting and comparing quotes.

Step 1: Define Your Script

Before approaching a voice-over artist, have your script finalized or nearly final. Word count, estimated finished length (if you know it), and any notes on tone, pacing, or dialect should be clear. Locked scripts result in more accurate quotes and faster delivery.

Step 2: Specify the Pricing Model You're Requesting

Don't ask three suppliers for quotes without clarifying which model you need. Instead, say: "I need a quote on a per-minute basis for 3–4 minutes of narration" or "I need a flat project fee for a 15-second commercial spot."

If you're unsure which model makes sense for your project, explain the scope (script length, content type, revision tolerance) and ask the supplier to propose the most sensible approach.

Step 3: Note What's Included in the Production Fee

Ask the supplier to confirm:

  • Preparation time: research into the content, any pronunciation guides needed, script markup for pacing.
  • Recording session(s): how many takes are typical for this type of content, and whether additional sessions beyond the initial estimate are included or billed separately.
  • Basic editing: normalization, EQ, noise reduction, silence trimming, and basic mastering.
  • Delivery format: WAV, MP3, both; mono or stereo; sample rate and bit depth (e.g., 48 kHz / 24-bit for broadcast, 44.1 kHz / 16-bit for web).
  • Revision rounds: how many re-records or edits are included before additional work is billable.

Step 4: Separate Production from Rights

Ask for usage rights to be quoted separately, even if they're bundled in the final invoice. You should see something like:

  • Production fee: [amount]
  • Usage rights (internal, web, or specified channel and period): [amount]
  • Total: [amount]

This prevents comparing an internal-use quote with a paid-advertising quote as if they were equivalent.

Step 5: Normalize Multiple Quotes

Once you have quotes, create a simple comparison table:

Aspect Supplier A Supplier B Supplier C
Model Per-minute Per-project Per-word
Script Length 3 min 45 sec 3 min 45 sec 5000 words
Production Fee $X $Y $Z
Revisions Included 2 rounds unlimited (within scope) 1 round
Delivery Format WAV 48/24 MP3 + WAV WAV 48/24
Usage Rights Internal Internal + web organic Internal
Total Cost $X+Y $Y $Z+Y

Now you can compare apples to apples. If costs still differ significantly, ask suppliers to explain the gap: Is it based on turnaround time, voice selection, or studio equipment? Understanding the reason helps you decide what offers actual value.

Step 6: Confirm Everything in Writing

Before recording begins, confirm:

  • Final script (word count and length estimate).
  • Pricing model and total production fee.
  • Revision limits and what counts as a revision vs. a new session.
  • Delivery timeline and format.
  • Usage rights details (channel, geography, period).

Written confirmation prevents disputes and ensures both parties understand the scope.

Conclusion

Voice-over pricing models exist because production work varies. A per-minute model makes sense for predictable, modular content like e-learning. A per-project model makes sense for one-off work where scope can't be reduced to a single metric. Per-word works for standardized, longer content.

The key to fair pricing is understanding which model applies to your project, confirming what the production fee includes, and separating production cost from usage rights. Once you can see these elements clearly, comparing quotes becomes straightforward and decisions based on budget rather than confusion become possible.

For examples of how I structure pricing in practice, see my rates page. If you're comparing quotes and want a second opinion on whether the terms are clear, contact me.

FAQ

Why can two per-minute quotes differ significantly if the script is the same length?

Per-minute quotes can vary based on voice selection, accent/dialect (some require more specialized talent), script difficulty, inclusion of revision rounds, delivery timeline, and studio overhead. A British narrator might cost more than a Nordic English speaker. A technical script may command a higher per-minute rate than conversational narration. Ask suppliers to explain the difference.

If I provide a script that's 2000 words, how many minutes of audio should I expect?

Typical professional narration runs 120–150 words per minute depending on pace, genre, and whether there are pauses for emphasis or transitions. A 2000-word script usually produces 13–17 minutes of finished audio. If you're unsure, ask the supplier for an estimate based on your specific script.

Do revision rounds mean unlimited changes to the recording?

No. Revision rounds typically mean re-recordings of sections due to pronunciation, emphasis, or minor edits within the locked script. If you change the script itself — rewriting sentences, adding new passages, or removing sections — that's usually a new session and billed separately. Confirm the boundary in your quote.

What if the final audio runs longer or shorter than the script estimate?

Speak to the supplier before recording. A script can run longer or shorter depending on pacing, pauses, and transitions. If you have specific timing requirements (the audio must fit a 3-minute video), lock that down beforehand so the supplier can adjust pacing during recording rather than re-recording afterward.

Can I get a discount for ordering multiple projects or longer content?

Many suppliers offer tiered pricing: higher volume or longer total duration may reduce the per-minute or per-word rate. It's worth asking if you have multiple projects in the pipeline or are considering extending a project. However, don't expect large discounts for single projects; voice-over is skilled labor and rates reflect that.

What format should I request for delivery?

For broadcast or professional production: WAV, 48 kHz, 24-bit, mono (unless stereo is specified). For web, corporate videos, or e-learning: WAV or MP3, 44.1 kHz or 48 kHz, 16-bit or 24-bit, mono. Ask your video editor or platform what they need; the voice-over artist can deliver accordingly.


Read more: