SaladCloud's Transcription API starts at $0.08/hr (Lite). Sounds great on paper — until you benchmark it against a production workload. Here's what we found.
| Metric | VoxParse | SaladCloud Lite ($0.08/hr) | SaladCloud Full ($0.20/hr) |
|---|---|---|---|
| Processing time | 12 seconds | 90 seconds | 88 seconds |
| Name accuracy | ✓ "Jesús" (accent correct) | ✗ "Sue", "401" (garbled) | ✗ Inconsistent |
| Speaker diarization | ✓ Agent / Customer | ⚠ Messy labels | ✓ Clean |
| Structured output | ✓ Full JSON (20+ fields) | ✗ Raw text only | ✗ Raw text only |
| Sentiment analysis | ✓ Included | ✗ Not available | ✗ Not available |
| Compliance / PCI | ✓ Included | ✗ Not available | ✗ Not available |
| Call summary | ✓ Included | ✗ Not available | ✗ Not available |
| Hallucinations | ✓ None (stripped by AI) | None | None |
| Feature | VoxParse | SaladCloud |
|---|---|---|
| All-inclusive pricing | ✓ $0.49/hr flat | ✗ Transcription only |
| Synchronous API | ✓ Single HTTP response | ✗ Async webhook required |
| Processing speed (24 min call) | ✓ ~12 seconds | ~90 seconds |
| Output format | ✓ Structured JSON (20+ fields) | Raw text |
| Speaker labels | ✓ Agent / Customer | Speaker 0 / 1 (Lite: messy) |
| Sentiment analysis | ✓ Included | ✗ Not available |
| PII / PCI redaction | ✓ Included | ✗ Not available |
| Call summarization | ✓ Included | ✗ Not available |
| Financial data extraction | ✓ Payments, balances, charges | ✗ Not available |
| Compliance analysis | ✓ Recording disclosure, auth, PII types | ✗ Not available |
| Action items / agreements | ✓ Included | ✗ Not available |
| Custom AI instructions | ✓ Included (2,000 chars) | ✗ Not available |
| Name accuracy (Lite) | ✓ Accent-correct ("Jesús") | ✗ Garbled ("Sue", "401") |
| Languages | 97+ | 97+ |
| Infrastructure | Enterprise cloud | Consumer GPUs (shared) |
| Audio data retention | ✓ Deleted after processing | Configurable |
Structured JSON. Call intelligence. 7-second processing. One API call.
Get Your Free API KeyNo credit card required · Enterprise-grade security · Audio deleted post-processing