Synthetically identical customer data

Automate the discovery and masking of sensitive data, generating synthetically identical customer data that:

✔ Looks and behaves just like real data (so your applications, AI models and tests function as expected).
✔ Maintains relationships and consistency across tables, files, databases and environments.
✔ Cannot be reverse-engineered back to real customer data - ensuring compliance with GDPR, CCPA, HIPAA and more.

Illustration of credit card data that has been masked

How it works

As data leaves your production environment, DataMasque simply and irreversibly replaces the customer data with synthetically identical customer data that is realistic, functional and consistent.

Warning exclamation mark
Source Prod DB
John Smith
Customer # 20395
4423-5956-0021-2020
john@gmail.com
123 Tizard Road Mosman, Sydney
$48.50
Roaming Plus
Shield with checkmark
Staging
Joe Blogs
Customer # 20395
4123-2233-4455-6677
joe.j.b@gmail.com
48 Ashbourne Grove Geelong, Victoria
$48.50
Roaming Plus
Secure Zone
Arrow, indicating transition from secure to isolated data
Database icon
Development
Database icon
Testing
Database icon
Analytics
Database icon
Data Lakes
Database icon
Training
Database icon
Machine Learning

Why DataMasque? 

Discover sensitive data

DataMasque provides powerful discovery functionality, to identify and classify sensitive customer data including metadata keyword search and in-data patterns from both our built-in keywords and patterns, as well as custom patterns set by the user.

Automated and secure

DataMasque has been designed with modern enterprise environments in mind. With horizontal scalability, containerised deployment model, and automation API, DataMasque can seamlessly slot into your existing data provisioning process or DevOps pipeline – regardless of whether you’re running on-prem or in a public cloud environment.

Consistency of data

DataMasque provides consistency of data across different environments, including tables, files, databases, clouds, and even organizations. DataMasque supports masking of primary keys and unique keys, and automatically maintains referential integrity of foreign keys. This ensures data consistency across all occurrences of information ‘types’, which are masked using the same algorithm across files, tables, databases, and database engines.

No compromises

DataMasque generates synthetically identical customer data that is realistic and representative of production datasets meaning data quality and security are no longer a compromise. DataMasque ‘imitates’ datasets by ensuring that the value of the data is retained without any personally identifiable information, giving you the same data functionality and statistical benefit.

Data Irreversibility

If the data is dynamically masked or encrypted, it means your sensitive data can be unlocked to its original state. With DataMasque, we remove your PII and replace it with synthetically identical customer data that looks just like the original but poses no risk of re-identification.

Proactive Schema Alerting

The only constant is change which is why DataMasque developed a proactive sensitive data alert. This alerts you when there has been a schema change that has been flagged as containing potentially sensitive data. Our sensitive data discovery doesn’t just look for what we think is sensitive, you can upload your own data dictionary for our tool monitor.

DataMasque’s synthetically identical customer data

Customer Data
NameID NumberCard Number
John/
Smith
221-22-11115565-5700-2427-7079
Kimberly.
Berry
K00-09-7443-
Walter
Allan
AA876078TT3415-4157-7075-207
Traditional Masking Challenges
NameID NumberCard Number
Brian/
Main
004-99-2756
8573232224112097
Jane/
Doe
400-23-86202116593810151072
Tom/
Hanks
765-76-8736
9771222776019352
Ineffective test data
Data cleansing
Breaks application logic
DataMasque
NameID NumberCard Number
Brian/
Main
004-99-27565565-0294-1312-1209
Jane.
Doe
T39-32-2876-
Tom
Hanks
VD576873RE3415-1200-6511-123
Checkmark
Quality data for meaningful testing
Checkmark
Format-preserving masking
Checkmark
Maintains application logic

What platforms does DataMasque support?

DataMasque supports the following platforms and data sources.
Don’t see your preferred technology listed?
Get in touch and let us know what you’re after.
Get in touch
Platforms
Cohesity Marketplace
AWS Marketplace
Azure Marketplace
Software Platforms
PlatformVersion
Ubuntu20.04, 22.04, or 24.04 LTS (x86 64-bit)
Red Hat Enterprise LinuxRHEL 8.7+, 9.x (x86 64-bit)
Amazon EKSKubernetes 1.29-1.32
Databases
DatabaseVersion
Oracle Database 11gR2, 12gR1, 12gR2, 18c, 19c
Microsoft SQL Server2012, 2014, 2016, 2017, 2019, 2022
MariaDB10.11, 11.2
MySQL5.7, 8
PostgreSQL9.6, 10, 11, 12, 13, 14, 15, 16
IBM Db2 LUW11.5
Amazon Aurora-
Amazon RDS-
Amazon Redshift-
Amazon DynamoDB-
Azure SQL Server-
File Masking
Storage Platforms AWS S3, Azure Blob Storage, NFS and SMB
TabularCSV (and other character delimited), Fixed Width & Parquet files
ObjectJSON & XML files
Multi-RecordApache Avro & NDJSON files

Our Plans

Choose
your plan
Free Trial
(30 days)
Cloud
Consumption
Business
Enterprise
Get in touchGet in touchGet in touchGet in touch
Mask Data SizeUp to 1000 rows per runUp to 500 GBUnlimitedUnlimited
Data sources:
  • Relational Databases
  • NoSQL Databases
  • Data Warehouses
  • Files
CheckmarkOne (1) data source included of any typeOne (1) data source included of any typeAll data sources included
RESTful APICheckmarkCheckmarkCheckmarkCheckmark
Sensitive Data Discovery CheckmarkCheckmarkCheckmarkCheckmark
Compliance ReportingCheckmarkCheckmarkCheckmarkCheckmark
Automatic masking ruleset generationCheckmarkCheckmarkCheckmarkCheckmark
Parallelism and Multi-processingCheckmarkCrossmarkCheckmarkCheckmark
Proactive Data DiscoveryCheckmarkCrossmarkCheckmarkCheckmark
Cross Data Store ConsistencyCheckmarkCrossmarkCheckmarkCheckmark
SSO/SAML CheckmarkCrossmarkCheckmarkCheckmark
Role Based Access Control CheckmarkCrossmarkCheckmarkCheckmark
Safe Data Exchange CheckmarkCrossmarkCheckmarkCheckmark
Masking Templates CheckmarkCrossmarkCheckmarkCheckmark
Product Training CrossmarkCrossmark2hrs2hrs
Support Level Crossmark
Email only
Standard
Premium

Commonly Asked Questions

What is DataMasque?

DataMasque is a data masking tool that enables you to replace sensitive values in databases with artificial but realistic alternatives while maintaining data integrity. Data masking can enhance testing, development and training by providing realistic and functional data in these scenarios without compromising security, privacy or operational accuracy.

What database engines are supported?

DataMasque currently supports most database and file formats. If we don’t have your database listed in our Support Matrix, please reach out to the team as we have connectors and workarounds for even the most obscure databases.

What deployment platforms does DataMasque support?

DataMasque is a data masking tool that enables you to replace sensitive values in databases with artificial but realistic alternatives while maintaining data integrity. Data masking can enhance testing, development and training by providing realistic and functional data in these scenarios without compromising security, privacy or operational accuracy. DataMasque supports the following deployment options:

  • As a Cohesity App through the Cohesity marketplace.
  • As a set of Docker containers deployed on Red Hat Enterprise Linux or Ubuntu with Docker Compose.
  • As a pre-configured product through the AWS Marketplace and Azure Marketplace.
What are the interfaces to access DataMasque?

DataMasque can be accessed using a browser-based web interface, which provides a centralized single-pane-of-glass view of your entire data masking environment. DataMasque also provides a robust set of REST API endpoints, allowing you to easily utilize its various features through automated processes.

Can DataMasque discover sensitive data?

DataMasque can discover sensitive data by performing a metadata keyword search to provide you with a starting point for data masking, allowing you to prioritize which data sources to protect. Sensitive data discovery tasks can also be included with every data masking run to identify new sensitive information, providing proactive and ongoing protection.

How does DataMasque support irreversible data masking?

DataMasque removes sensitive data from your database and replaces it with realistic masked data. By default, masked values are randomly generated, preventing recovery of the original data. When data consistency across tables is required, DataMasque generates masked values using cryptographically secure SHA-512 salted hashes of the original values.

Can I achieve consistency across tables, databases and database engines?

DataMasque provides masking algorithms that generate values based on one or more unmasked column values using a cryptographically secure SHA-512 salted hash. Use of these algorithms ensures deterministic masking across columns, tables, and database engines.

Can I mask the primary key? How does the masked table maintain database relationships?

DataMasque supports masking of primary keys and unique keys. DataMasque automatically maintains referential integrity of foreign keys and also allows users to propagate masked values to implied foreign keys.

Does DataMasque provide APIs for all operations?

DataMasque currently provides a REST API for triggering and monitoring masking runs using pre-configured connections and rulesets. We have plans to open more API endpoints in the future.

Recommended by

“DataMasque helps us lower the likelihood of a breach and secures our customers’ data, which means we avoid expensive regulatory fines.”
Joseph Landucci, Director of Technology Management, Best Western Hotels and Resorts

Ready to see DataMasque in action?

Request a personalized demo.
Request a demo