View all posts

The Future of Test Data Management: Why Synthetic Data is Your Competitive Edge

Speakers
Wim Kees Janssen
Wim Kees JanssenCEO & founder​
Related resources
What you can expect from this webinar
  • Test data management complexities and challenges
  • How to use synthetic data for test data management
  • Production-like data with Test data platform:
    • Integrating with your local testing pipelines and automation of the de-identification process
    • Creating a subset of the entire database for cost-optimization and more manageable environments
    • AI-powered PII scanner for automatic data discovery and protection
    • 150+ mockers and masking functions that support consistent generation across tables, databases, systems
    • Rule-based synthetic data with calculated columns 
  • Q&A

Frequently Asked Questions

Why should mock data, even if it is PII-related, be protected?

PII, or Personally Identifiable Information, refers to sensitive data linked to individuals. Privacy regulations make it challenging to use personal data for testing purposes, so it is essential to protect this data accordingly.

How do you make sure that all PII like birthdate is detected?

The PII scanner detects all PII attributes and identifiers. While a birthdate alone may not uniquely identify an individual, you can customize the scanner to include attributes like birthdate and other variables as needed. Then, our PII scanner can also detect non-identifiers such as the birthdate.

The scanner offers both “shallow” and “deep” scans: a shallow scan reviews metadata, such as column names and data types, while a deep scan leverages advanced entity recognition to analyze actual data in depth. This flexibility allows you to specify which PII types to detect.

Does Syntho have the capability to handle Blobs?

Syntho supports handling Blob data, both by duplication and exclusion of such columns. Details can be found in our User Documentation. We can deepdive further into this with you, if desired.

Can PII information be detected and adapted?

Yes, Syntho can detect and adapt PII data as configured during setup and as demonstrated during the webinar.

More information about our PII scanner can be found here.

More information about our mockers to adapt PII can be found here.

How do you check the validity of mock data?

Syntho offers over 150 mock data generators that accurately mimic real-world data characteristics. Rule-based synthetic data can also be customized to suit specific requirements.

Can Syntho generate synthetic versions of complex relational datasets (beyond simple tree structures)?

Syntho’s Test Data Management solutions are designed to mask and de-identify sensitive data at scale, including complex relational datasets. Syntho’s consistent mapping feature is important to realize preserving consistency and referential integrity for complex relational datasets and works across tables, across databases, across systems and even over time.

Can we download the PII scan report in Excel or Notepad, or is it only viewable in the tool?

It is both viewable in the tool, as well as there is an option to export it as text.

Are synthetic data generated “in compliance” with implicit business rules? In other words, is the generator capable of inferring business rules?

Yes, Syntho’s AI-powered generation automatically captures patterns and complex relationships between columns, reproducing them in the generated synthetic data.

Additionally, Syntho offers rule-based synthetic data methods, including calculated columns, to model business rules from scratch, e.g. for cases where you don’t have any data yet.

As a finance company, data security is our top priority. Does Syntho support on-premise deployment, and if so, are all features available on-premise?

Yes, we facilitate on-premise deployments and all features are available on-premise.

Can Syntho detect and mask PII in text and unstructured data? Does Syntho work with unstructured data in general?

Yes, Syntho has a PII text scanner that can identify and mask PII in unstructured text data. For example, it can detect and replace PII in text fields, such as doctor’s notes, by tagging and obfuscating sensitive information like names, dates, and SSNs, while creating mock replacements.

More information can be found on this page under the “Introducing the PII text scanner” section.

Save your synthetic data guide now

What is synthetic data?

How does it work?

Why do organizations use it?

How to start?

Privacy Policy

Join our newsletter

Keep up to date with synthetic data news