In the previous part of the series, we’ve examined the second approach to filling the database in with data for testing and development purposes. It is the synthetic data generation approach.
Tag: synthetic data generation
Synthetic Data Generation. Part 3: Backup and Restore
Implementation of the general script for data sanitization and the secret data changes
We’ve examined simple examples for each type of altered data in the previous articles (Part 1: Data Copying, Part 2: Data Changing):
- Changing the date and time;
- Changing the numerical value;
- Changing the byte sequence;
- Changing the characters’ data.
However, the examples described above don’t meet the criteria 2 and 3 for the data altering scripts:
Read MoreSynthetic Data Generation. Part 2: Data Changing
Character Data Change
Here, we take an example for the English and Russian alphabets, but you can do it for any other alphabet. The only condition is that its characters must be present in the NCHAR types.
We need to create a function that accepts the line, replaces every character with a pseudorandom character, and then puts the result together and returns it.
Read MoreSynthetic Data Generation. Part 1: Data Copying
Introduction
Sooner or later, any information system gets a database, often – more than one. With time, that database gathers very much data, from several GBs to dozens of TBs. To understand how the functionals will perform with the data volumes increasing, we need to generate the data to fill that database.
All scripts presented and implemented will execute on the JobEmplDB database of a recruiting service. The database realization is available here.
Read More