The Future of Test Data Management: Why Synthetic Data is Your Competitive Edge

Published
October 11, 2024

Event Information

Date: Wednesday, 30th October

Time: Europe: 4:00pm CET 

Duration: 45 minutes 

Agenda

  • Test data management complexities and challenges
  • How to use synthetic data for test data management
  • Production-like data with Test data platform:
    • Integrating with your local testing pipelines and automation of the de-identification process
    • Creating a subset of the entire database for cost-optimization and more manageable environments
    • AI-powered PII scanner for automatic data discovery and protection
    • 150+ mockers and masking functions that support consistent generation across tables, databases, systems
    • Rule-based synthetic data with calculated columns 
  • Q&A

Q&A

Questions raised during the session with detailed answers

PII, or Personally Identifiable Information, refers to sensitive data linked to individuals. Privacy regulations make it challenging to use personal data for testing purposes, so it is essential to protect this data accordingly.

The PII scanner detects all PII attributes and identifiers. While a birthdate alone may not uniquely identify an individual, you can customize the scanner to include attributes like birthdate and other variables as needed. Then, our PII scanner can also detect non-identifiers such as the birthdate.


The scanner offers both “shallow” and “deep” scans: a shallow scan reviews metadata, such as column names and data types, while a deep scan leverages advanced entity recognition to analyze actual data in depth. This flexibility allows you to specify which PII types to detect.

Syntho supports handling Blob data, both by duplication and exclusion of such columns. Details can be found in our User Documentation. We can deepdive further into this with you, if desired.

Yes, Syntho can detect and adapt PII data as configured during setup and as demonstrated during the webinar.


More information about our PII scanner can be found here.


More information about our mockers to adapt PII can be found here.

Syntho offers over 150 mock data generators that accurately mimic real-world data characteristics. Rule-based synthetic data can also be customized to suit specific requirements.

Syntho’s Test Data Management solutions are designed to mask and de-identify sensitive data at scale, including complex relational datasets. Syntho’s consistent mapping feature is important to realize preserving consistency and referential integrity for complex relational datasets and works across tables, across databases, across systems and even over time.

It is both viewable in the tool, as well as there is an option to export it as text.

Yes, Syntho’s AI-powered generation automatically captures patterns and complex relationships between columns, reproducing them in the generated synthetic data.

Additionally, Syntho offers rule-based synthetic data methods, including calculated columns, to model business rules from scratch, e.g. for cases where you don’t have any data yet.

Yes, Syntho has a PII text scanner that can identify and mask PII in unstructured text data. For example, it can detect and replace PII in text fields, such as doctor’s notes, by tagging and obfuscating sensitive information like names, dates, and SSNs, while creating mock replacements.


More information can be found on this page under the “Introducing the PII text scanner” section.

Speakers

As founder and CEO of Syntho, Wim Kees aims to turn privacy by design into a competitive advantage with AI-generated test data. He aims to solve key challenges introduced by classic test Data Management tools, which are slow, require manual work, do not offer production-like data, and consequently introduce “legacy by design.”

Wim Kees Janssen

CEO & AI-generated test data expert

Uliana is helping enterprise clients unlock privacy-sensitive data, make smarter data decisions and faster data access, so that organizations can realize data-driven innovation.

Uliana Matyashovska

Business development executive

Marijn has an academic background in computing science, industrial engineering, and finance. Marijn is now acting as founder and CPO at Syntho, driving innovation and strategic vision at the forefront of technology.

Marijn Vonk

CPO & AI-generated test data expert

Build better and faster with synthetic data

Unlock data access, accelerate development, and enhance data privacy. Book a session with our experts now.