Amazon Macie — A fully managed data security and data privacy service that uses machine learning and pattern matching to discover and protect sensitive data in AWS.


Overview

Macie automatically discovers and classifies sensitive data (PII, PHI, financial data) in your S3 buckets, helping you meet compliance requirements like GDPR, HIPAA, and PCI DSS.

Key Insight: Macie is like an automated data auditor — it scans your S3 buckets to find sensitive information you might not even know exists, helping prevent data leaks.


Core Macie Concepts

ConceptDescriptionKey Point
Sensitive DataPII, PHI, financial data, credentialsAutomatically discovered
Data ClassificationCategorizes data by sensitivityML-powered detection
FindingDiscovered sensitive data issueSeverity levels
Managed Data IdentifiersPre-built patterns for common data typesPII, credentials, financial
Custom Data IdentifiersYour own regex patterns for specific dataCompany-specific formats
S3 Bucket InventoryList of all buckets with sensitivity scoresPrioritize remediation

How Macie Works

flowchart TD
    subgraph Buckets["Your S3 Buckets"]
        B1["Bucket A"]
        B2["Bucket B"]
        B3["Bucket C"]
        BN["Bucket N"]
    end

    Scan["Macie scans S3 objects"]
    Detect["ML + pattern matching identify sensitive data"]
    Classify["Classify by data type<br/>(PII, PHI, financial, credentials)"]
    Findings["Generate findings with severity and context"]

    B1 --> Scan
    B2 --> Scan
    B3 --> Scan
    BN --> Scan
    Scan --> Detect --> Classify --> Findings

    Findings --> Hub["Security Hub"]
    Findings --> SNS["SNS alerts"]
    Findings --> EB["EventBridge automation"]

Data Types Macie Detects

Managed Data Identifiers

CategoryExamples
PIINames, addresses, phone numbers, email addresses
FinancialCredit card numbers, bank account numbers, tax IDs
CredentialsAWS keys, API keys, passwords
HealthMedical record numbers, diagnoses
LegalPassport numbers, driver’s license numbers

Recent Enhancement (January 2025)

Macie now uses Amazon Textract to detect sensitive data in images stored in S3 (PDFs, photos, scanned documents).


Key Features

FeatureDescription
Automated DiscoveryContinuously scans S3 for sensitive data
Machine LearningImproves detection accuracy over time
PII DetectionFinds personally identifiable information
Bucket InventoryView all buckets with sensitivity scores
FindingsDetailed alerts with severity and remediation
Integration with Security HubCentralized findings management

Finding Severity Levels

SeverityDescriptionExample
HighLarge amount of sensitive data exposed publicly10,000+ credit card numbers in public bucket
MediumSensitive data in accessible locationPII in bucket with known access
LowSmall amount or limited exposureFew records in private bucket
InformationalData discovery, not necessarily a riskInternal bucket contains PII

Use Cases

Use CaseDescription
GDPR ComplianceDiscover and protect EU citizen data
HIPAA ComplianceFind protected health information (PHI)
Data Loss PreventionPrevent accidental data exposure
Data InventoryKnow what sensitive data you have
Incident ResponseInvestigate potential data breaches

Pricing

ComponentPriceFree Trial
Data ClassificationPer GB evaluated30-day free trial
Bucket MonitoringPer bucket per monthIncluded in free tier

⚠️ Pricing Disclaimer: AWS pricing is subject to change. Always verify current pricing at the official Macie pricing page.


Macie vs Other AWS Security Services

ServiceFocusComplementarity
MacieSensitive data discoveryFinds WHAT sensitive data exists
GuardDutyThreat detectionDetects active attacks/anomalies
ConfigConfiguration complianceEnsures S3 is properly secured
ShieldDDoS protectionProtects against attacks

TL;DR

  • Amazon Macie = ML-powered sensitive data discovery for S3
  • Scans = S3 buckets for PII, PHI, financial data, credentials
  • Detection = Machine learning + pattern matching
  • Enhanced 2025 = Now scans images with Amazon Textract
  • Pricing = Per GB evaluated; 30-day free trial
  • Use Cases = GDPR/HIPAA compliance, data loss prevention, data inventory
  • Integrates with = Security Hub (centralized findings)

Resources

Amazon Macie Documentation Complete Macie user guide.

Macie Pricing Detailed pricing breakdown.

Macie Data Identifiers List of all managed data identifiers.