โ† Back to Blog Feature Guide

Verbatim vs. Polished Transcription: Choosing the Right Mode

April 23, 2026 ยท 5 min read
Verbatim vs polished transcription comparison

Not all transcripts are created equal. A customer support manager reviewing agent performance needs to hear every "um," hesitation, and self-correction. A business analyst just needs the clean facts. That's why VoxParse offers two transcription modes: verbatim and polished.

Both modes use the same AI pipeline and cost the same per hour. The only difference is how faithfully the transcript represents the original speech. Listen to the sample call below, then compare the output side-by-side.

Listen to the original call

This is a real customer service call (telephony quality, 8kHz mono). The customer is calling about a device exchange offer.

๐ŸŽง Sample call โ€” AT&T inbound, device exchange (~3 minutes)

Side-by-side comparison

Both transcripts below were generated from the same audio file using VoxParse's POST /v1/transcribe endpoint. The only difference is the mode parameter.

โœจ Polished Mode

Agent: Thanks for calling AT&T. My name is Erica. Can I have the wireless telephone number you are calling about today?

Customer: 406-539-1202.

Agent: Confirm that number is 406-539-1202. Is that correct?

Customer: Yes.

Agent: How may I help you today?

Customer: I got a letter saying I can get the BlackBerry Torch.

Agent: Okay, I'd be happy to help with that, but first I need to verify some information for security.

...

Agent: Once you receive the return label, attach it to the phone box and drop it in any USPS blue collection box.

Agent: I have completed the exchange setup. Do you have any questions, Mr. Milgen?

Customer: No.

Agent: We value and appreciate your business. Thank you for calling AT&T. Have a great day.

๐Ÿ“ Verbatim Mode

Agent: Thanks for calling AT&T, my name is Erica. Can I have the wireless telephone number that you're calling about today?

Customer: 406-539-1202.

Agent: So confirm that number was 406-539-1202, is that correct?

Customer: Yes.

Agent: So how may I help you today?

Customer: Oh, I got this letter saying I can get the BlackBerry Torch.

Agent: Mm-hmm. Okay, I don't worry, I'd be more than happy to see with us, but before we proceed, for security reasons of your account, can you please verify some information for me first?

...

Agent: Okay, and also once you receive the Black... I mean the return label, just simply have fixed it to the phone box and drop it in any USPS blue box, okay?

Customer: Drop it in what?

Agent: In any USPS blue box.

Agent: Okay, so I'm done feeling that all the information needed here for the exchange. So do you have any questions regarding the information that I provide to you, Mr. Milligan?

Agent: Okay, we do value and appreciate your business. Thank you for calling AT&T and have a great day.

Highlighted spans show elements preserved in verbatim that polished removes: filler words, false starts, self-corrections, and natural speech patterns.

Key differences

AspectPolishedVerbatim
Filler wordsRemoved (um, uh, so, oh)Preserved exactly as spoken
False startsCleaned upKept ("the Black... I mean the return label")
Self-correctionsSmoothed to final intentBoth attempts preserved
ASR errorsFixed (e.g., "USBs" โ†’ "USPS")Fixed (same โ€” ASR errors are always corrected)
ReadabilityProfessional, report-readyNatural speech, may read rough
Word count~15-30% shorterFull length
CostSame ($0.49/hr)Same ($0.49/hr)

When to use each mode

Use Polished for:

Use Verbatim for:

How to use it

Add the mode parameter to your POST /v1/transcribe request. Default is polished.

# Polished mode (default)
curl -X POST https://api.voxparse.com/v1/transcribe \
  -H "X-API-Key: YOUR_API_KEY" \
  -F "[email protected]"

# Verbatim mode
curl -X POST https://api.voxparse.com/v1/transcribe \
  -H "X-API-Key: YOUR_API_KEY" \
  -F "[email protected]" \
  -F "mode=verbatim"

Both modes return the same structured JSON response with AI analysis, compliance flags, sentiment, and financial extraction. The only difference is the transcript_cleaned field in the response.

Pro tip: Run the same file through both modes and store both transcripts. Use polished for your dashboard and verbatim for your compliance archive.

Try both modes free

Start with $10 in prepaid credits. Same price for both modes โ€” $0.49/hr, everything included.

Get your API key โ†’

Bottom line

Most transcription APIs give you one output and call it done. VoxParse gives you the choice: a polished transcript for business consumption, or a verbatim transcript for when every word matters. Same audio, same price, same API call โ€” just add mode=verbatim.