What is the industry standard and fair way to charge for transcription work? Why do transcribers charge by the audio minute rather than by the typed word? This article explains why and offers a fair and standard pricing structure, too. It’s part of a series, and you can find the other articles in the series and a link to my popular book on the subject at the bottom of this article.
I was working with an agency on presenting an offer for a transcription job to a company. As usual, we provided a per-audio-minute rate. This works well and is the industry standard, as it’s predictable in advance and doesn’t change according to how long it takes the transcriber to do the job (of course, it’s up to the transcriber to check the tape and make sure they’re charging a per-minute rate that’s fair to them and the client. Mine is based on two speakers, a clear tape and non-urgent turnaround time, with fair and transparent add-ons per minute for more speakers / tape issues / urgent turnaround).
In this case, the client wanted a quotation by the number of words typed and/or the time it was going to take me to transcribe the tape. So they wanted to know my words-per-minute typing speed for a standard transcription.
Is there such a thing as a standard transcription speed?
In a word: No. There is no such thing as a standard transcription typing speed.
For a start, the speeds you can calculate from your own documents are not worked out in the same way the typing test people work out your official typing speed. That’s done on the basis of a standard five-letter word plus one space (I worked this out, because I’m like that, and a document that showed as 11,582 words would be 10,459 “standard words” which gave me a typing speed of 50 or 45.5 words per minute).
For another thing, the typing speeds you are measured on as a copy-typer are different from those you can achieve doing audio typing / transcription. I can type at about 70 wpm, but my transcription speeds vary WILDLY, as you can see below. If a client is used to hearing about a good typist typing 70 wpm, are they going to be impressed if we offer them a price based on 35 wpm? Probably not.
Of course, when transcribing, it’s rare to be able to keep up with the speakers without pausing the tape. It’s also rare to be able to hear everything perfectly first time – everyone has to rewind and check. In addition, a good transcriber will fact-check as they go along – company names, people’s names, the names of albums … and this slows things down, too, of course.
In addition, it’s completely impossible to calculate a standard transcription speed as it will vary according to
- Number of speakers
- Accents of speakers
- Speed that the speakers speak
- Turn-taking versus overlapping speech
- Background noise
- Quality of the tape
- Degree of accuracy / in-transcription editing the client wants (e.g. turning non-standard English into standard English, transcribing every um, er and repetition vs. tidying the tape up slightly to not include ums, ers and repetitions)
I actually went back and checked a few transcriptions that I’d done recently (I note how long jobs take me and could take the word count from the Word document. My words-per-minute varied between 35 wpm and 60 wpm over a range of transcriptions, and that variation was not predictable by the type of client or the type of content (I do mainly journalists’ interviews and corporate work transcribing presentations, videos and conferences).
What is a fair way to charge for transcription?
The fair way to charge for transcription is by the audio minute. This is fair on the transcriber, if they have a range of pricing to suit different situations, and is fair for the client because they will in most cases know the charge up front (an exception to this would only come if they booked in 30 minutes and sent 90 minutes of tape with more speakers than expected and suddenly super urgent: if the client specifies exactly what they have, the transcriber will be able to quote clearly in advance for them).
I charge …
- A minimum rate per audio minute for up to 2 speakers, speaking clearly on a good quality tape and not urgent (with 24 hours for up to a 60-minute tape)
- A certain amount extra per audio minute for each additional speaker
- A certain amount extra per audio minute for a particularly challenging tape quality (checked beforehand and only used if it’s a truly terrible tape or with huge amounts of background noise)
- A certain amount extra per audio minute for urgent turnaround (under 24 hours for up to 60 minutes; negotiable over that tape length)
This charging structure has worked well for me over my transcription career so far.
If you are asked to provide other kinds of pricing, do bear in mind my points above, and feel free to refer your client to this article to explain further!
If you’ve found this article useful, please click to share!
Related posts in the series: