Transcription Accuracy

Names and technical terms are misspelled

Cause: The transcription model does not know your company-specific terms, product names, or people’s names.Fix: Add terms to your custom vocabulary:

Go to Settings > Transcription > Custom vocabulary.
Add names, acronyms, product names, and technical jargon.
Future transcriptions will prioritize these terms.

Examples: “Kubernetes” (not “Cooper Netty’s”), “Figma” (not “fig ma”), “OKRs” (not “oh cares”).

Wrong language detected

Cause: Automatic language detection misidentified the spoken language, often when meetings include code-switching or technical English mixed with another language.Fix: Set a default language:

Go to Settings > Transcription > Default language.
Select the primary language of your meetings.
Mavio will still detect language switches but will default to your selection when uncertain.

Speakers talking over each other (crosstalk)

Cause: Overlapping speech is inherently difficult for any transcription system. When two people talk simultaneously, words get lost.Fix: There is no technical fix for crosstalk after the fact. To prevent it:

Encourage “raise hand” behavior in meetings
Use the meeting bot (which captures cleaner audio from the platform) rather than mobile recording
Edit the transcript manually for critical sections

Strong accents or non-native speakers

Cause: While Mavio supports diverse accents well, very strong accents combined with fast speech can reduce accuracy.Fix:

Ensure audio quality is high (headset, quiet environment)
Add commonly mispronounced terms to custom vocabulary
Use cloud mode rather than privacy mode for better accent handling

Background noise affecting accuracy

Cause: Cafe noise, air conditioning, traffic, and other ambient sounds compete with speech.Fix:

Use a headset or close-talking microphone
Record in a quiet room when possible
Use the meeting bot or system audio capture (which bypass the microphone entirely)
Enable High quality recording mode for better noise handling

Phone or dial-in audio is garbled

Cause: Phone lines transmit audio at 8 kHz (narrowband), which is half the quality of computer-based audio (16 kHz+). This reduces the detail available for transcription.Fix: Ask participants to join via computer audio instead of dialing in. If phone dial-in is unavoidable, the accuracy reduction is expected and cannot be fully compensated.

Factor	Impact	Your control
Audio quality	High	Use headsets, quiet rooms
Speaker clarity	High	Encourage clear speech
Background noise	High	Control environment
Number of speakers	Medium	Fewer simultaneous speakers helps
Speaking speed	Medium	Normal pace is best
Technical jargon	Medium	Custom vocabulary
Accent strength	Low-Medium	Custom vocabulary helps
Language	Low	Most languages well-supported
Recording method	Medium	Bot and system audio are cleanest

Transcription Accuracy

Common accuracy issues and fixes

Factors that affect accuracy

Improving accuracy over time

Custom vocabulary

Speaker corrections

Transcript edits

Reprocessing a transcript

When to expect lower accuracy

​Common accuracy issues and fixes

​Factors that affect accuracy

​Improving accuracy over time

​Custom vocabulary

​Speaker corrections

​Transcript edits

​Reprocessing a transcript

​When to expect lower accuracy

Common accuracy issues and fixes

Factors that affect accuracy

Improving accuracy over time

Custom vocabulary

Speaker corrections

Transcript edits

Reprocessing a transcript

When to expect lower accuracy