C
Beta

CloviTranscribe

Transcription that stays on your side.

$8.6B global ASR market (growing 17% CAGR) is dominated by cloud-only, per-minute pricing models. 15–20% of that market (400K+ SMBs in food, ag, legal, HR) has data sovereignty requirements that make cloud forbidden. CloviTranscribe is category-of-one at SMB pricing ($29/month) for self-hosted, white-label, API-embeddable transcription — capturing a structurally under-served niche with 41% gross margins and predictable unit economics.

Betaclovitranscribe.clovitek.comCloviTek Platform
CloviTranscribe preview
What it is

An overview of CloviTranscribe

CloviTranscribe turns any audio or video file into a searchable, speaker-labeled transcript on your own infrastructure — no cloud upload, no per-minute fees, complete data privacy. Core engine is OpenAI's optimized faster-whisper fork (3–7% error rate) with pyannote speaker diarization and Claude Haiku AI summaries. Launched as CloviTek platform service, native to DeckSpeaksAI (auto-captions), SpectroScience (lesson transcripts), AudiobookSmith (chapter QA), and CloviLeads CRM (action items).

What makes it different

Built to win, not to blend in

Self-hosted/on-premise deployment at $29/month (competitors require $5K+ contracts or prohibit it entirely)

Zero per-minute billing — flat-rate, predictable costs on already-provisioned VPS hardware

White-label embed capability for SaaS builders (Business tier, $79/month) — no competitor offers this at SMB pricing

Speaker diarization (up to 8 speakers) included on Pro tier; competitors charge $18–$30/seat or $0.65/min surcharge

Full-stack CloviTek ecosystem integration — auto-captions DeckSpeaksAI/NIR course, feeds CloviLeads CRM, replaces custom pipelines

Capabilities

Key features

Instant transcription with 99-language support
Upload MP3, MP4, WAV, or paste a YouTube/Zoom link; faster-whisper large-v3-turbo processes at 2–3x realtime with 3–7% word error rate.
Speaker diarization (8 speakers, Pro tier)
Automatically labels who said what. Re-name speakers in viewer; labels persist. Food/agriculture operators tested on background noise and non-native English.
AI meeting summary (Pro tier)
Claude Haiku generates TL;DR, action items, key decisions, open questions post-job. Integrated into transcript viewer; one-click copy per section.
Full-text search across library (Pro tier)
SQLite FTS5 index on all stored transcripts. Find any phrase, decision, or speaker across 20+ jobs. Transforms tool from one-off to knowledge base.
Multi-format export (all tiers)
TXT (plain), SRT/VTT (captions), JSON (word-level timestamps). Business tier supports custom vocabulary injection and OpenAI Whisper API fallback.
REST API + webhooks (Business tier)
Submit jobs programmatically, receive webhook on completion, embed in SaaS product. Supports custom domain white-label (transcript.yourcompany.com).
CloviTek platform integration
Zero-friction activation for all CloviTek company accounts via cl_session SSO. Auto-captions DeckSpeaksAI videos, indexes NIR course, feeds CRM action items.
No per-minute cost ceiling
Marginal cost of transcription is zero (CPU already provisioned). $29/month Pro = unlimited transcription vs. $2,220/month AssemblyAI at 100 hours.
See it in action

A glimpse of CloviTranscribe

CloviTranscribe ships as a fully branded, production-grade product on the CloviTek platform — integrated auth, billing, and a polished interface your team and customers can use from day one.

CloviTranscribe interface preview
Ecosystem

Built on the CloviTek stack

CloviTranscribe composes shared platform infrastructure and sibling products — the integrated ecosystem that makes each new product faster to build and stickier to keep. Linked tiles open that platform's page.

CCloviTek platformSSO, cl_session auth, company infrastructureCCloviTek engine agentsbackend build system
faster-whisper (OpenAI Whisper optimized fork)
pyannote.audio (speaker diarization)
Claude Haiku (AI summaries)
EmailIt (job notifications)
Stripe (payment processing)
Integrations & developer features

Connects to your stack

REST API (POST/GET jobs, export, search)
Webhooks (HMAC-signed job completion delivery)
CloviTek SSO (cl_session cookie auth)
DeckSpeaksAI (auto-VTT caption pipeline)
SpectroScience (NIR course caption replacement)
AudiobookSmith (ACX SRT generation, accuracy QA via transcript diff)
CloviLeads CRM (action item extraction → pipeline import)
Zapier/n8n (generic webhook template in automation marketplace)
Google Drive/Docs (OAuth2 export)
YouTube/Vimeo (native caption upload)
Slack (job completion notification)
Calendar/Zoom/Teams (optional Year 2: meeting recording link detection)
OpenAI Whisper API (optional Business tier premium accuracy mode)
Who it's for

Designed for these teams

Maya — Operations Manager at mid-sized food distributor; 15 vendor calls/week, compliance-sensitive, IT blocks cloud transcription; values self-hosted, structured export for audits.

Jordan — Indie podcast producer; 3 B2B podcasts, clients demand private transcripts not on third-party SaaS; needs white-label option and no per-minute overage.

Arjun — CTO of HR compliance SaaS; needs to embed transcription inside product, customers require on-premise data, AssemblyAI $2,200/mo is not viable; needs API and DPA.

Target audience

Investors

Use cases

Put to work

01

Operations managers automating vendor/client call documentation (food distribution, legal, HR, field services) — 15+ calls/week, compliance-sensitive, IT blocks cloud.

02

Podcast producers and freelance content creators needing client-private transcripts with diarization and show-note export — avoid Descript minute caps and cloud privacy concerns.

03

SaaS builders embedding transcription inside products (HR platforms, compliance tools, meeting intelligence) — need API, webhooks, no per-minute cost surprises.

04

Audiobook/training video producers generating closed captions and searchable transcripts from narration — ACX SRT export, accuracy QA via transcript diff.

05

Researchers and academics transcribing interviews, lectures, focus groups — need affordable bulk transcription with structured export and full-text search.

06

Food and agriculture companies processing call recordings for USDA/FSMA compliance — data residency-locked, require self-hosted, searchable audit trail.

Roadmap

Where it's going

Q1 (Days 1–15): P0 core engine (faster-whisper subprocess, 4 export formats, Stripe billing gates, brand fixes). P1 diarization reliability, job queue transparency, email notifications. Target: Free tier MVP, 15 paying customers, $500 MRR.

Q2 (Months 4–6): Full-text search (FTS5), AI summary UI polish, Stripe upgrade flows, URL transcription (yt-dlp). Webhook framework for B-tier. Target: 60 paying customers, $2,100 MRR, 400 free users.

Q3 (Months 7–9): Business tier REST API + HMAC webhooks, white-label subdomain config, custom vocabulary hints, OpenAI Whisper fallback. AppSumo lifetime deal ($69 Pro). Target: 200 paying customers, $7,500 MRR.

Q4 (Months 10–12): DeckSpeaksAI auto-caption pipeline, SpectroScience NIR integration, AudiobookSmith QA, multi-language UI. CloviTek per-company quota in admin. Target: 450 paying customers, $18,000 MRR, 2,500 free users.

Year 2: Multi-user team accounts, enterprise on-premise with HIPAA BAA, calendar/Zoom link detection, real-time streaming transcription. CloviTranscribe as platform-layer service bundled into CloviTek Growth/Scale plans.

Pricing

Plans & tiers

Free
$0/month

30 min/month, file upload only, TXT export, no timestamps or diarization. No credit card. Evaluators and one-off users.

Starter
$9/month ($84/yr)

5 hours/month, URL transcription, SRT/VTT/TXT export with timestamps. Individuals with light-moderate volume.

Pro
$29/month ($276/yr)

Unlimited minutes, 8-speaker diarization, AI summary, full-text search, JSON export, 90-day history. Core offering for ops managers and podcasters.

Business
$79/month ($756/yr)

Pro + REST API, webhooks, custom vocabulary, white-label domain, OpenAI fallback, 365-day retention, priority queue, SLA support.

Enterprise
Custom

On-premise deploy, HIPAA BAA, SOC 2, custom SLA, multi-user/team accounts. Annual starting $5K+. Scoped by volume and infrastructure.

Pricing reflects planned standalone tiers. Platform tenants are billed through their CloviTek subscription.

Get early access to CloviTranscribe

CloviTranscribe is part of the CloviTek platform — request access and we'll provision a fully branded instance for your team and customers.