Transcription Cost Per File: 2026 Flat-Rate Benchmark
Transcribing a 90-minute, five-speaker panel costs $6.00 on BrassTranscripts. The same file runs $15.00 on Sonix's pay-as-you-go rate, $72 to $99 through Scribie's human transcribers, and $179.10 at Rev's human rate. Every one of those numbers comes from each provider's published pricing.
The spread has one cause: the pricing model. Per-minute and per-hour rates scale with the length of your file. A flat rate does not. This benchmark measures what it actually costs to transcribe one file across the consumer transcription market, and it shows exactly where flat-rate pricing wins, where it ties, and where another model costs less.
Quick Navigation
- The Benchmark: Cost to Transcribe One File
- Why Flat-Rate Wins on Long Files
- Where Per-Minute APIs Cost Less
- Where a Subscription Costs Less
- Cost Is Not Accuracy
- Methodology and Sources
- Frequently Asked Questions
The Benchmark: Cost to Transcribe One File
BrassTranscripts charges $2.50 for files up to 15 minutes and $6.00 for anything 16 minutes or longer, with speaker identification included and no subscription. Measured against other consumer transcription services that include speaker labels and require no code, that flat $6.00 is the lowest-cost option for any file longer than about 36 minutes, and the only one whose price stops climbing as the recording gets longer.
The table below holds four variables constant so the comparison is fair: every option includes automatic speaker identification, requires no engineering, has no monthly commitment, and is priced as published. Prices for the per-minute and human services are multiplied by file length.
| Test file | BrassTranscripts | Sonix (pay-as-you-go) | Scribie (human) | Rev (human) |
|---|---|---|---|---|
| 15 min, 2 speakers | $2.50 | $2.50 | $12.00–$16.50 | $29.85 |
| 30 min, 4 speakers | $6.00 | $5.00 | $24.00–$33.00 | $59.70 |
| 60 min, 2 speakers | $6.00 | $10.00 | $48.00–$66.00 | $119.40 |
| 90 min, 5 speakers | $6.00 | $15.00 | $72.00–$99.00 | $179.10 |
| 120 min, 6 speakers | $6.00 | $20.00 | $96.00–$132.00 | $238.80 |
Read the table top to bottom and the pattern is plain. At 15 minutes BrassTranscripts and Sonix tie at $2.50. At 30 minutes Sonix is a dollar cheaper. From 60 minutes up, BrassTranscripts is the lowest price on the row, and at two hours it costs less than a third of the next cheapest no-commitment option with speaker labels.
Two well-known names sit outside this table for a reason. Temi ($0.25/minute) and Happy Scribe ($0.20/minute pay-per-use) are cheaper per minute than Sonix, but neither includes speaker identification, so they fail the benchmark's one fixed requirement. A 90-minute file would cost $22.50 on Temi or $18.00 on Happy Scribe and still arrive without speaker labels.
Why Flat-Rate Wins on Long Files
BrassTranscripts wins on long files because a flat fee severs price from duration, while every per-minute and per-hour competitor keeps charging for each additional minute. The longer the recording, the wider the gap, until a two-hour file that costs $6.00 on BrassTranscripts costs $20.00 on Sonix and $238.80 at Rev's human rate.
The arithmetic is simple. A per-minute service multiplies one rate by the minute count. Sonix's $10/hour Standard rate is $0.167 per minute, so it passes BrassTranscripts' $6.00 flat fee at 36 minutes. Rev's $1.99/minute human rate passes it in three minutes. Past those points, the flat rate is the floor and the meter keeps running. For depositions, panel discussions, long interviews, and multi-hour meetings, that floor is the whole argument.
This is also why BrassTranscripts holds the $6.00 ceiling on files well past two hours. A three-hour recording is still $6.00, where a per-minute meter would read $30.00 at Sonix's rate or $358.20 at Rev's.
Where Per-Minute APIs Cost Less
Developer APIs cost dramatically less per minute than any consumer service, including BrassTranscripts, and any honest cost benchmark has to say so. AssemblyAI's base rate is about $0.0025 per minute, which puts a 90-minute file near $0.26 before add-ons. Deepgram, AWS Transcribe, Google Cloud Speech-to-Text, and the OpenAI Whisper API all land in the same low-per-minute territory.
The catch is what that price leaves out. These are APIs, not finished services. Using one means building and maintaining an integration. You handle the authentication and the polling. You format the raw JSON into a readable TXT or SRT file yourself. On most of them speaker diarization costs extra, and there is no preview before you pay.
The hidden costs of DIY transcription are engineering time and ongoing maintenance, and they land on a developer's plate rather than in a single checkout. For a team with engineers and steady volume, an API is the right call. For everyone who just wants to upload a file and download a transcript, it is a different product.
Where a Subscription Costs Less
A monthly subscription costs less per file than BrassTranscripts for users who transcribe a lot, every month, on a predictable schedule. Otter's Pro plan is $16.99/month for 1,200 minutes; a user who actually fills that allocation pays an effective rate far below $6.00 per file. The same logic applies to Sonix's $22/month Premium tier and Trint's $80/month Starter plan.
The condition is regular, high volume. Subscriptions carry monthly fees whether or not you use them, minute caps and file-import limits, and per-seat charges on team plans. The subscription model stops paying off the moment your usage turns occasional or seasonal, which is the case for most people transcribing an interview, a deposition, or a one-off meeting. BrassTranscripts is built for that case: $6.00 for the file in front of you, nothing the month you transcribe nothing. Our pay-per-file pricing and the deals-and-pricing comparison lay out the full breakdown.
Cost Is Not Accuracy
Cost and accuracy are separate axes, and this benchmark measures only cost. Human services like Rev and Scribie charge 10 to 40 times more than BrassTranscripts per long file precisely because a person types and checks the transcript, which buys accuracy that AI transcription does not match on hard audio. Paying $179 instead of $6 for a 90-minute panel can be the correct decision for a court filing or a medical record.
For most business and content work on clear audio, AI transcription delivers professional-grade results at a fraction of the price, which is the trade BrassTranscripts is built around. What the evidence actually shows about accuracy — and why a single accuracy percentage is misleading — is documented in the AI transcription accuracy investigation and the BrassTranscripts Research Index. Read those for the accuracy side; this page is about the bill.
Methodology and Sources
This benchmark measures one number: the total out-of-pocket cost to transcribe a single file, with automatic speaker identification included, no code required, and no monthly commitment. That definition is the scope, and it is why developer APIs and monthly subscriptions appear in their own sections rather than the headline table — both win under conditions the table holds fixed, and both are documented above rather than hidden.
Competitor prices are taken as published on each provider's official pricing page and verified in our pricing research between October 2025 and January 2026. Per-minute and per-hour figures are multiplied by file length; ranges reflect a provider's own stated rate range.
- BrassTranscripts: $2.50 up to 15 minutes, $6.00 for 16+ minutes (any length), speaker identification included. See pricing.
- Sonix: $10/hour ($0.167/minute) Standard pay-as-you-go, speaker identification included. See the Sonix pricing breakdown.
- Scribie: $0.80–$1.10/minute, human transcription.
- Rev (human): $1.99/minute, human transcription. Rev's automated API is covered in the Rev.ai pricing analysis.
- AssemblyAI: ~$0.0025/minute base, speaker identification a paid add-on. See the AssemblyAI cost breakdown.
- Otter.ai: $16.99/month Pro (1,200 minutes). See the Otter pricing analysis.
- Temi ($0.25/minute) and Happy Scribe ($0.20/minute pay-per-use): no speaker identification, excluded from the speaker-ID table.
Pricing changes. The methodology is published so the table can be re-run against current rates at any time. For the case where a no-cost tool is the right answer, see when a no-cost transcription tool wins.
Frequently Asked Questions
What is the cheapest way to transcribe a long multi-speaker file?
For a one-off long recording with speaker identification included and no code required, BrassTranscripts at $6.00 flat is the lowest-cost option above about 36 minutes. A 90-minute file is $6.00 on BrassTranscripts versus $15.00 on Sonix's pay-as-you-go rate, $72-$99 through Scribie's human transcribers, and $179.10 at Rev's human rate. Developer APIs cost less per minute but require building an integration and pay extra for speaker labels.
Does flat-rate pricing actually beat per-minute for transcription?
Flat-rate beats per-minute once a file is long enough that the per-minute total passes the flat fee. BrassTranscripts charges $6.00 for any file 16 minutes or longer, so a per-minute service has to stay under that to win. Against Sonix's $0.167/minute pay-as-you-go rate the crossover is about 36 minutes; past that, BrassTranscripts costs less and the gap grows with every minute of audio.
Are developer APIs like AssemblyAI cheaper than BrassTranscripts?
Per minute, yes. AssemblyAI's base rate of roughly $0.0025/minute makes a 90-minute file cost about $0.26 plus a speaker-identification add-on, far below $6.00. The trade is that developer APIs require writing and maintaining an integration, formatting the raw output into TXT/SRT/VTT yourself, and paying extra for speaker labels. For anyone who wants to upload a file and download a finished transcript, that is a different product, not a cheaper version of the same one.
When does a transcription subscription cost less than pay-per-file?
A subscription wins for regular high-volume users. Otter's Pro plan at $16.99/month includes 1,200 minutes, so a user transcribing many hours every month pays a low effective rate per file. BrassTranscripts wins for occasional and unpredictable use: there is no monthly fee, no minute cap, and no commitment, so a single 90-minute file costs $6.00 instead of a recurring subscription.
Is speaker identification included in the transcription price?
On BrassTranscripts, automatic speaker identification is included in the $2.50 or $6.00 price on every file. Several services in this benchmark charge separately for it or omit it: Rev's API, AssemblyAI, Deepgram, and AWS Transcribe all treat speaker diarization as an add-on, and pay-per-minute services Temi and Happy Scribe do not include speaker labels at all.
Run your own file through the benchmark: BrassTranscripts is $2.50 for files up to 15 minutes and $6.00 for anything longer, speaker identification included, with a 30-word preview before you pay. No subscription, no per-minute meter.