PII masking — replace sensitive data with synthetic values.
Detects PII columns (email, phone, name, SSN, etc.) via column name
heuristics and profiler pattern detection, then replaces values with
realistic synthetic data while preserving null patterns and distributions.
Classes
MaskConfig
dataclass
Configuration for data masking.
MaskResult
dataclass
Result of masking operation.
Methods:
summary()
Return a human-readable summary of the masking result.
DataMasker
Replace PII in real data with synthetic values preserving distributions.
Methods:
mask(tables, config=None)
Mask PII columns across all tables.
Parameters
tables:
Mapping of table name to DataFrame.
config:
Optional masking configuration. Defaults are sensible.
Returns
MaskResult with masked DataFrames and statistics.