Not everything belongs in AI. Here's what to keep out and why—plus a simple test to decide what's safe.

🎯 Find Out What AI Can Automate in Your Business

Get a free AI-powered analysis of your workflows. See which tasks to automate first, how much time you'll save, and get a personalized implementation plan.

Get Free Analysis → No signup required • Results in 30 seconds

The Never-Upload List

These categories should never go into third-party AI systems:

1. Authentication Credentials

  • Passwords
  • API keys
  • Private keys and certificates
  • OAuth tokens

Why: If leaked, these give direct access to your systems. AI systems can potentially expose data or be used to generate similar credentials.

2. Protected Personal Information

  • Social Security Numbers
  • Passport numbers
  • Driver's license numbers
  • Bank account and credit card numbers

Why: Regulatory requirements (GDPR, CCPA, etc.) and identity theft risk. Most AI platforms aren't designed to handle this data.

3. Protected Health Information (PHI)

  • Medical records
  • Health insurance numbers
  • Treatment information
  • Any data linkable to health status

Why: HIPAA requires HIPAA-compliant systems. Standard AI platforms don't meet these requirements without specific configuration.

4. Trade Secrets & Intellectual Property

  • Proprietary algorithms
  • Source code for competitive products
  • Formulas and recipes
  • Strategic plans not yet public

Why: Some AI vendors train on customer data. You could be giving away your competitive advantage.

5. Legal & Compliance Risks

  • Ongoing litigation documents
  • Attorney-client communications
  • Documents under NDA
  • Export-controlled information

Why: Uploading could waive privilege or breach contracts.

6. Financial Records (Unnecessary)

  • Detailed financial statements
  • Tax documents
  • Audit reports

Why: Often not needed for AI tasks and creates unnecessary exposure risk.

The Simple Test

Before uploading data to any AI system, ask:

"Would this damage us if it appeared publicly?"

If yes, don't upload it to a public or shared AI system.

Gray Areas: When It Depends

Data TypePublic AIEnterprise AIOn-Premise AI
Customer names + emails⚠️ Check policy✅ Usually OK✅ Safe
Internal documents❌ Risky✅ Usually OK✅ Safe
Sales data⚠️ Strip PII first✅ Usually OK✅ Safe
Employee records❌ No⚠️ Check compliance✅ Safe

What's Generally Safe

These are usually fine for most AI systems:

  • Public marketing content
  • Anonymized/analytics data
  • General business knowledge
  • Process documentation (without sensitive details)
  • Templates and SOPs

Questions to Ask Your AI Vendor

  1. Do you train on my data?
  2. Is my data encrypted at rest and in transit?
  3. Who can access my data?
  4. Do you have SOC 2 or ISO 27001 certification?
  5. Where is my data stored?
  6. How do you handle data deletion requests?

Not sure what's safe to automate?

Book a free consultation. We'll help you identify what AI can safely handle and what should stay manual.

Get Security Assessment →