Data Classification

Define Classification Levels - Establish clear categories for data sensitivity (e.g. Public, Private, Restricted, Confidential). Familiarize yourself with classification models like Carnegie Mellon Data Classification Guidelines

Catalog Data Assets - Identify all the data assets in your environment and assign ownership. This includes databases, file shares, cloud storage, backups, etc.

Conduct Data Discovery - Use automated scanning tools to identify sensitive data based on predefined patterns (e.g. credit card numbers, SSNs). Don't forget about unstructured data like documents and emails. Popular tools include:

Amazon Macie - Discovers and protects sensitive data in AWS
Microsoft Purview - AI powered data classification
Varonis Data Classification - Classifies data on-premises and in the cloud

Apply Classifications - Tag data with its assigned classification metadata. Most tools allow you to configure classification rules and apply tags automatically. Be sure to involve data owners to validate results.

Implement Handling Procedures - Define and enforce policies for how each class of data should be used, stored, transmitted, and disposed of. Communicate these requirements to all employees. Audit for compliance.

Review & Maintain - Data classification is not a one-time exercise. As data is created and modified, classifications need to be kept up-to-date. Conduct periodic reviews and reclassify data as needed.

Where did this come from?

Who should care?

What is the risk?

What's the care factor?

When is it relevant?

What are the trade offs?

How to make it happen?

What are some gotchas?

What are the alternatives?

Explore further

Learn cloud security with our research blog