Privacy GuideMay 21, 20269 min read

How to Protect Your Privacy From AI Resume Scrapers

SC

By Sarah Chen

Head of Privacy Research

How to Protect Your Privacy From AI Resume Scrapers

Your resume contains some of the most detailed personal information you have ever compiled in one place: your full name, contact details, home city, work history, education, skills, and sometimes even your salary expectations. AI-powered scraping tools are now harvesting this data from LinkedIn, job boards, and professional networking sites at an industrial scale, feeding it into databases that are sold to recruiters, marketers, and data brokers. Here is what you need to know and how to protect yourself.

How AI Resume Scraping Works

AI resume scrapers use automated bots and machine learning algorithms to extract professional data from publicly accessible sources. These tools crawl platforms like LinkedIn, Indeed, Glassdoor, GitHub, and personal websites, pulling structured information including names, job titles, employers, skills, education history, certifications, email addresses, and phone numbers.

The scale of this activity is staggering. In 2025, security researchers uncovered an unsecured 16-terabyte database containing 4.3 billion records scraped from LinkedIn profiles, including names, emails, job details, and photos. Companies like ProAPIs were alleged to have run industrial-scale scraping operations using networks of fake LinkedIn accounts, selling access to this data for as much as $15,000 per month.

Modern scraping tools do more than just copy text. AI-powered parsers can extract structured data from unstructured resume documents, categorize skills and experience levels, identify career trajectories, and even infer information like approximate age from graduation dates or salary ranges from job titles and locations.

Who Is Scraping Resume Data and Why

Several types of organizations profit from scraped professional data:

  • Recruiting technology companies: Build massive candidate databases that they sell access to as a service. Tools like Proxycurl (operated by Nubela) offered APIs that returned detailed LinkedIn profile data on demand, until LinkedIn sued them in January 2025.
  • Data brokers and enrichment services: Combine scraped professional data with information from other sources to build comprehensive consumer profiles for sale to marketers and advertisers.
  • AI training pipelines: Scraped resume and profile data is used to train machine learning models for hiring tools, chatbots, and other AI applications, often without the knowledge or consent of the people whose data is being used.
  • Scammers and threat actors: Use scraped professional information for targeted phishing attacks, fake job scams, and identity theft schemes.

LinkedIn's Ongoing Battle Against Scrapers

LinkedIn has ramped up enforcement significantly in 2025 and 2026, deploying browser fingerprinting, rate-based heuristics, and IP reputation scoring to detect bots. The company filed federal lawsuits against Nubela (Proxycurl) and ProAPIs, and removed the LinkedIn company pages of Apollo.io and Seamless.AI for terms-of-service violations. Despite these efforts, the volume of scraping continues to grow as AI tools become more sophisticated at evading detection.

The Real Risks to Your Privacy

Having your resume data scraped and circulated creates concrete dangers:

Identity Theft and Phishing

A scraped resume gives attackers your name, employer, job title, email, phone number, and sometimes your home city. This is more than enough to craft highly convincing spear-phishing emails that appear to come from your employer, a recruiter, or a professional contact. These targeted attacks are far more effective than generic phishing attempts.

Discrimination and Bias

Scraped resume data can reveal or allow inference of your age (from graduation dates), gender, ethnicity (from names or affiliations), disability status, and other protected characteristics. When this data feeds into AI hiring tools, it can enable discriminatory screening, whether intentional or through algorithmic bias.

Career Reputation Damage

Outdated or inaccurate information circulating in scraped databases can follow you for years. An old job title, a position at a company that later became controversial, or skills you no longer practice can surface in searches and influence how employers and colleagues perceive you.

Unauthorized AI Training

LinkedIn acknowledged in late 2025 that it was sharing user data with Microsoft and its subsidiaries for AI model training. Users were given limited time to opt out, and many never saw the notification. Your professional history may already be embedded in AI systems you have no visibility into.

Your Resume Data Persists Even After You Delete It

Once your resume or profile data has been scraped and sold, deleting your original profile does not remove the copies. Scraped data lives in third-party databases, data broker records, and AI training datasets indefinitely. Courts have ordered some companies to destroy scraped data (as in the hiQ Labs v. LinkedIn settlement), but enforcement is limited and most scraped data is never recovered.

How to Protect Your Privacy

1. Tighten Your LinkedIn Privacy Settings

  • Limit profile visibility: Under Settings, set your public profile to show minimal information. Consider hiding your email, phone number, and specific dates.
  • Disable data sharing for AI training: Go to Data Privacy settings and opt out of LinkedIn using your data for AI model training.
  • Control who sees your connections: Hide your connections list to prevent scrapers from mapping your professional network.
  • Turn off profile visibility to search engines: This prevents Google and other search engines from indexing your profile, which is one of the primary ways scrapers discover profiles.

2. Be Strategic on Job Boards

  • Set resumes to private or confidential: On sites like Indeed and Monster, choose the option that hides your resume from public search and limits visibility to employers you apply to directly.
  • Remove resumes after your job search: Do not leave active resumes on job boards indefinitely. Delete or deactivate them when you are no longer actively searching.
  • Use platform-specific email addresses: Create a dedicated email for job searching so scraped contact information cannot be linked to your primary accounts.
  • Omit sensitive details: Leave out your full home address, date of birth, and personal phone number from uploaded resumes. A city and state are sufficient for location.

3. Remove Your Data From Broker Sites

Scraped resume data frequently ends up in data broker databases, where it is combined with other personal information and resold. Manually finding and opting out of each broker is a slow, repetitive process. PrivacyOn removes your personal information from over 100 data broker sites on an ongoing basis, including brokers that aggregate professional and employment data. With 24/7 monitoring, PrivacyOn detects when your data reappears and submits new removal requests automatically, starting at just $8.33 per month.

4. Limit What You Share Publicly

  • Audit your online presence: Search for your name combined with your employer or job title to see what is publicly accessible. Request removal of any outdated or unauthorized postings.
  • Be selective on professional sites: You do not need a detailed profile on every platform. Focus your presence on the one or two sites most relevant to your industry and minimize detail elsewhere.
  • Use a professional website you control: Instead of relying solely on third-party platforms, consider hosting your professional information on your own domain where you control indexing, access, and content.

5. Monitor for Exposure

Regularly check whether your professional data has appeared in unexpected places. Set up Google Alerts for your name combined with employer names or job titles. PrivacyOn's dark web monitoring can alert you if your professional or personal data surfaces in breached databases. Family plans cover up to 5 people, making it practical to protect everyone in your household.

6. Use Strong Security Practices

  • Enable two-factor authentication on all professional platforms, especially LinkedIn, Indeed, and any site where your resume is stored.
  • Use unique passwords for each career platform to prevent a breach on one site from compromising your accounts elsewhere.
  • Review app permissions: Third-party apps connected to your LinkedIn account may have access to your profile data. Revoke access for any apps you no longer use.

The Legal Landscape Is Shifting

The legal status of resume scraping remains unsettled. The landmark hiQ Labs v. LinkedIn case, which initially suggested scraping public data might be permissible, ultimately ended with hiQ paying $500,000 in damages and agreeing to destroy all scraped data and algorithms derived from it. LinkedIn's aggressive enforcement actions in 2025 and 2026 signal that platforms are increasingly willing to pursue legal remedies against scrapers.

Regulations are also tightening. CCPA gives California residents the right to demand data deletion from brokers within 45 days. GDPR provides similar rights in Europe. Several U.S. states have passed or proposed new privacy laws that specifically address automated data collection and AI training data.

The Bottom Line

Your professional history is valuable, and AI scrapers are working around the clock to harvest it. While you cannot fully prevent scraping of publicly accessible information, you can significantly reduce your exposure by tightening privacy settings, minimizing what you share, removing data from broker sites, and monitoring for unauthorized use. Taking these steps now, especially using a service like PrivacyOn to handle the ongoing work of data removal and monitoring, means less of your personal and professional information ends up in databases you never authorized and cannot control.

SC
Sarah Chen

Head of Privacy Research

CIPP/US CertifiedIAPP MemberB.S. Computer Science

CIPP/US-certified privacy researcher with over a decade of experience helping consumers remove their personal information from data brokers.

Ready to Protect Your Privacy?

Let PrivacyOn automatically remove your personal information from data broker sites and keep it removed.