Privacy GuideJune 27, 202610 min read

Privacy Risks of AI Transcription Services: Otter.ai, Fireflies, Zoom, and More

SC

By Sarah Chen

Head of Privacy Research

Privacy Risks of AI Transcription Services: Otter.ai, Fireflies, Zoom, and More

Don't want to do this by hand? We remove your info from 100+ broker sites automatically.

AI-powered transcription has become a fixture of modern work. Services like Otter.ai, Fireflies.ai, Rev, and built-in tools from Zoom, Microsoft Teams, and Google Meet promise to capture every word of your meetings, interviews, and calls. But behind the convenience lies a serious question: what happens to your voice, your words, and your data after the recording ends? The answer, for most services, is more alarming than most users realize.

The Explosion of AI Transcription

The shift to remote and hybrid work accelerated the adoption of AI transcription tools dramatically. By 2026, an estimated majority of enterprise meetings involve some form of automated transcription or note-taking. Standalone services like Otter.ai and Fireflies.ai have tens of millions of users, while platform-native features in Zoom, Teams, and Google Meet are enabled by default in many organizations. Personal use has surged as well — journalists, students, therapists, and lawyers all rely on these tools to capture conversations.

This rapid adoption has outpaced most people's understanding of what data these services collect and how they handle it.

What Data AI Transcription Services Collect

AI transcription tools do not just produce a text file and discard everything else. Most services collect and retain a significant amount of data, including:

  • Full audio recordings — the complete audio of your meeting or conversation, not just the transcript
  • Transcribed text — a searchable, storable record of everything said
  • Speaker identification — voice profiles that identify who said what, often using voice biometric analysis
  • Meeting metadata — participant names, email addresses, calendar details, meeting duration, frequency, and attendee lists
  • Behavioral data — who speaks most, sentiment analysis, and engagement metrics in some enterprise tools

Taken together, this is far more than a simple transcript. It is a detailed record of your conversations, your relationships, and your voice itself.

Key Privacy Risks

Audio Stored on Third-Party Servers

Most AI transcription services upload your audio to cloud servers for processing. In many cases, these recordings are stored indefinitely unless you manually delete them. This means your private conversations — business strategy discussions, medical consultations, legal advice — sit on servers you do not control, potentially for years.

Your Recordings May Train AI Models

Some transcription services use customer recordings to improve their speech recognition models. This means your private conversations could be reviewed by human annotators or fed into machine learning pipelines. While some services have moved away from this practice or offer opt-outs, the default settings often favor data collection over privacy.

Check Your Service's AI Training Policy

Many transcription services bury their AI training practices deep in their terms of service. If a service does not clearly state that your recordings are never used for model training, assume they may be. Always look for an explicit opt-out toggle in your account settings and enable it immediately.

Voice Biometric Data Collection

Speaker identification features require the service to create a voice profile — essentially a biometric fingerprint of your voice. In states like Illinois, which has the Biometric Information Privacy Act (BIPA), collecting voice biometrics without explicit consent is illegal and has led to significant lawsuits. Yet many transcription services collect this data with minimal disclosure.

Sensitive Information in Transcripts

Transcripts can contain some of the most sensitive information imaginable: proprietary business strategies, medical diagnoses discussed during telehealth appointments, privileged attorney-client conversations, financial details, and personal disclosures. Once transcribed and stored in the cloud, this information is only as secure as the service's infrastructure — and a data breach could expose it all.

Third-Party AI Processing

Some transcription services do not process audio entirely in-house. They may route recordings through third-party AI providers for speech-to-text conversion, summary generation, or other features. Each additional party that handles your data increases the attack surface and reduces your control over how the data is used.

Data Retention That Outlasts Your Expectations

Many users assume their recordings are processed and discarded. In reality, default data retention policies often keep recordings and transcripts for months or even indefinitely. Even after you delete a recording from your account, copies may persist in backups, logs, or AI training datasets.

Skip the manual opt-outs

One opt-out won't stop them — brokers relist your data. PrivacyOn removes your info from 100+ sites and keeps it removed.

Start your free scan

Legal Considerations

Two-Party Consent Laws

In the United States, recording laws vary by state. In two-party consent states — including California, Illinois, Florida, Pennsylvania, Washington, and several others — all participants in a conversation must consent to being recorded. An AI bot that silently joins a meeting and begins transcribing may violate these laws if participants are not clearly informed and given the opportunity to object.

HIPAA and Medical Conversations

If a transcription service is used during telehealth appointments or any conversation involving protected health information (PHI), HIPAA regulations apply. Most consumer-grade transcription tools are not HIPAA-compliant, meaning using them for medical conversations could expose both patients and providers to legal liability.

Attorney-Client Privilege

Lawyers who use transcription services for client conversations risk compromising attorney-client privilege. If recordings are stored on third-party servers and potentially accessed by service employees or used for AI training, the privileged nature of those communications may be legally undermined.

Specific Service Concerns

Otter.ai

Otter.ai stores all recordings and transcripts in the cloud. Its free and pro tiers have historically included provisions allowing data use for service improvement. While Otter offers a business tier with stronger data protections, most individual users are on plans with fewer privacy guarantees. Recordings persist until manually deleted, and the service retains metadata about your meetings and contacts.

Fireflies.ai

Fireflies.ai is designed to automatically join meetings by integrating with your calendar. This means it can appear in meetings and begin recording, sometimes without all participants fully understanding what is happening. While Fireflies sends a notification when it joins, the consent mechanism is passive — participants who do not actively object are recorded by default.

Automatic Meeting Bots Are a Growing Concern

Services like Fireflies.ai and similar tools that auto-join calendar events have drawn criticism from privacy advocates. If you see an unfamiliar bot join your meeting, you have every right to ask what it is and to request that it be removed. Many organizations are now implementing policies that require explicit approval before AI recording tools are used.

Zoom, Microsoft Teams, and Google Meet

Platform-native transcription features in Zoom, Teams, and Google Meet raise their own questions. Where does the transcription processing happen — on-device or in the cloud? How long are transcripts retained? Are they used to train the platform's AI features? The answers vary by platform and plan tier, but enterprise users generally have more control than individual or small-business users. Zoom's AI Companion features, for example, process data in Zoom's cloud, and the company's AI privacy policies have evolved multiple times in response to user backlash.

How to Protect Yourself

You do not have to stop using transcription services entirely, but you should take deliberate steps to minimize risk:

  1. Always inform all participants when a meeting is being recorded or transcribed. This is both a legal requirement in many jurisdictions and a basic courtesy.
  2. Check if the service uses your data for AI training and opt out immediately. Look for settings labeled "model improvement," "service improvement," or similar terms.
  3. Use local or on-device transcription tools whenever possible. OpenAI Whisper can be run entirely on your own computer, and Apple's on-device transcription processes audio without sending it to the cloud. These options keep your recordings under your control.
  4. Delete recordings promptly after extracting the information you need. Do not let audio files accumulate on cloud servers indefinitely.
  5. Review data retention settings in your transcription service and set the shortest retention period available. Disable features that store recordings beyond what you need.
  6. Use enterprise tiers that offer Business Associate Agreements (BAAs), data processing agreements, and stronger privacy commitments. These plans typically include better data handling, geographic data residency options, and contractual protections.
  7. Avoid transcribing highly sensitive conversations — attorney-client discussions, medical consultations, and confidential business negotiations — with consumer-grade cloud services.

How PrivacyOn Helps Protect Your Privacy

AI transcription services do not operate in a vacuum. The meeting metadata they collect — your name, email, company, job title, and professional contacts — can end up in data broker databases. Data brokers aggregate information from dozens of sources to build detailed profiles that can include your communication patterns, professional relationships, and contact details.

PrivacyOn helps reduce your exposure by:

  • Removing your personal information from 100+ data broker sites that aggregate and sell your contact details, professional data, and personal information
  • Continuously monitoring for your data reappearing on people search sites and broker databases, and re-removing it as needed
  • Reducing the amount of publicly available data that can be linked to your meeting activity, making it harder for brokers and bad actors to build comprehensive profiles from your digital footprint

In an era where every spoken word can be captured, transcribed, and stored, taking control of your broader digital privacy is more important than ever. PrivacyOn gives you that control by keeping your personal information out of the databases that feed the data economy.

SC
Sarah Chen

Head of Privacy Research

CIPP/US CertifiedIAPP MemberB.S. Computer Science

CIPP/US-certified privacy researcher with over a decade of experience helping consumers remove their personal information from data brokers.

Ready to Protect Your Privacy?

Let PrivacyOn automatically remove your personal information from data broker sites and keep it removed.