How speaker identification works
Mavio uses voice fingerprinting to distinguish between speakers. The system analyzes vocal characteristics like pitch, cadence, and tone — not the words spoken — to create a unique acoustic signature for each person.Voice enrollment
A speaker records a short voice snippet (15-30 seconds) or Mavio extracts a sample from an existing meeting recording.
Fingerprint generation
Mavio AI processes the audio to create a compact voice fingerprint that captures the speaker’s unique vocal characteristics.
Real-time matching
During future meetings, Mavio compares incoming audio against enrolled fingerprints to label each speaker in the transcript.
Creating a speaker profile
From a meeting recording
- Open any meeting with the speaker present.
- Navigate to the Transcript tab.
- Click on a correctly labeled speaker segment.
- Select Create Speaker Profile from the context menu.
- Enter the speaker’s name, email, and optionally their role or organization.
- Mavio extracts a voice sample from the selected segment.
From a dedicated recording
- Go to Settings > Speaker Profiles > Add Profile.
- Enter the speaker’s details (name, email, role).
- Click Record Voice Sample and have the speaker talk naturally for 15-30 seconds.
- Mavio processes the recording and creates the profile.
Managing profiles
Profile information
Each speaker profile contains:| Field | Description |
|---|---|
| Name | Display name used in transcripts |
| Linked email address for matching with calendar participants | |
| Role | Optional job title or role for context |
| Organization | Optional company or team name |
| Voice snippets | One or more enrolled voice samples |
| Meetings | List of meetings where this speaker was identified |
Editing a profile
Open Settings > Speaker Profiles, select a profile, and update any field. Changes to the display name are applied retroactively to all past transcripts.Merging duplicate profiles
If Mavio creates separate profiles for the same person (common when someone joins from different devices):- Open Settings > Speaker Profiles.
- Select one profile and click Merge.
- Select the duplicate profile to merge into the first.
- Voice snippets from both profiles are combined. Past transcripts are updated.
Voice snippets
Each profile can have multiple voice snippets to improve accuracy across different conditions:- Quiet environment — standard voice sample
- Noisy environment — sample with background noise
- Phone audio — sample from a phone or low-quality microphone
- Different energy levels — samples from calm discussion and animated debate
Cross-meeting speaker tracking
Once a profile is created, Mavio tracks that speaker across all meetings:- Meeting history — see every meeting a speaker has participated in
- Talk time analytics — total and average speaking time per meeting
- Topic associations — what subjects this speaker most frequently discusses
- Interaction patterns — who they speak with most often
Speaker analytics
| Metric | Description |
|---|---|
| Total talk time | Cumulative speaking time across all meetings |
| Average talk ratio | Percentage of meeting time this speaker typically occupies |
| Meeting count | Number of meetings the speaker has appeared in |
| Most common co-speakers | Other speakers frequently in the same meetings |
| Topic frequency | Most discussed topics by this speaker |
Advanced settings
Adjusting identification sensitivity
Adjusting identification sensitivity
In Settings > AI > Speaker Identification, adjust the confidence threshold. A higher threshold produces fewer false matches but may leave some segments as “Unknown Speaker”. A lower threshold catches more segments but may occasionally misidentify speakers.
Handling unknown speakers
Handling unknown speakers
When Mavio encounters a voice that does not match any profile, it labels the speaker as “Speaker 1”, “Speaker 2”, etc. After the meeting, you can assign these labels to existing profiles or create new ones.
Privacy considerations
Privacy considerations
Voice fingerprints are stored securely and encrypted at rest. They are never shared outside your workspace. Speakers can request deletion of their voice profile from Settings > Speaker Profiles > [Profile] > Delete.
Improving accuracy for similar voices
Improving accuracy for similar voices
If Mavio confuses two speakers who sound similar, enroll additional voice snippets for both. Snippets from the same meeting where both speakers are present are especially helpful for training the model to distinguish them.
Speaker profile enrollment consumes 3 AI credits per voice snippet processed. See AI Credits for details.