Frequently asked questions about the DATPROF solution
Is DATPROF cloud based or on premise?
DATPROF is an on premise software solution. Because we connect to databases that may contain privacy sensitive data, the software is often installed in the same database domain as the other database. Because the data is not extracted from the database into our software, our software has to connect to the databases to do the processing there. No data is leaving the database domain.
Can you support data masking and synthetic test data?
With the help of DATPROF Privacy we’re able to mask data and we’re able to synthetically generate test data. We’ve got several built in masking rules in DATPROF Privacy and several synthetic test data generators or seed lists. So you’re able to generate new social security numbers, bank accounts or give new first names based upon for example seed lists.
Do you support messaging, like XML files?
We support several flat file types, for example Comma Separated Values (CSV), Fixed Length (FL), Recursive Fixed Length (RFL) and Extensible Markup Language (XML).
Which database technologies do you support?
We support several Database Technologies. We have native support for Oracle, Microsoft SQL Server, DB2, Postgress and EDB. If you want more information about the version check our docs.
What if I've got another database?
There are two options, one is you could export the data to a database we do support. Execute the masking or subsetting process and the import it back into the database. We’ve done this in several occasions and works out perfectly. The last option is ask us politely what the effort should be to develop your database technology.
Can we schedule or automate our runs?
Yes you can. With the help of our scheduler inside DATPROF Runtime you’re able to schedule all your runs. You could also use the API of DATPROF Runtime to make the test data runs part of your CI/CD pipeline if needed. With our API you’re able to automate nearly everything. For more information about the AP check out our docs.
Do you support End-to-End Test Data Management?
Yes we do. This has always been one of our major efforts. We want to mask or subset test data over all your different systems. This is a necessity.
How reusable is the DATPROF Solution?
We wanted to make sure that you develop once and deploy multiple times. So every app you develop in DATPROF Privacy or DATPROF Subset, you only develop once. Afterwards you deploy these apps in DATPROF Runtime and you can execute them.
What about data model changes?
Data models change over time. So when these changes occur you need to adapt to these changes. With our synchronization wizard you’re able to easily adapt to new data models and within a few clicks you DATPROF apps is ready to go.
Loading and selecting databases, selecting the input
How do you select a database? Well that’s pretty easy, you’ll need the connection address plus credentials. Then you’ll select the database you want to mask or anonymize. After you selected the database for masking you’ll also need to insert the metadata.
How to use the metadata of a data model?
After you have entered the connection credentials you want to import the metadata. The metadata describes the characteristics of the data(base), for example you’ll find information about table names, column names, sizing, data types etc. On import the foreign keys found in the metadata are valuable for DATPROF Privacy and DATPROF Subset.
Sometimes foreign keys are not stored as part of the metadata of the database. So we also are able to import the metadata stored in a table or with a script before we start a masking process. If all these options don’t resolve the need for metadata, we are also able to update the tool with additional relationships. These will only be known within the tool used and are in no way transported into the database itself.
The metadata for helping the data masking process within DATPROF Privacy. The metadata enables you to put masking functions on tables and columns easily.
How are masking functions selected on tables and/or columns?
After the metadata is imported the tool can present the tables and columns to you for developing data masking functions. These functions can be made by selecting any given table and within this table by selecting one or more columns you want to put a particular data masking function on. You are than able to select this masking option by right-mouse-click or clicking the function button. A variety of functions are available enabling you to fulfill all your anonymization needs.
After adding these functions to columns, you can also easily edit these functions. If a function should cover more than one column, you can easily edit the function and add more columns to a function. For example if you want to shuffle first name, it might be interesting to shuffle gender accordingly. Thus keeping the relation between the first name and the recorded gender.
How to synchronize an existing template with a changed database model?
Changes in a database model do appear. So when you create a template with DATPROF Privacy it is important that service and support for these kind of changes are kept to a bare minimum. For this purpose the synchronize metadata button was created. Within the project setting screen of DATPROF privacy, you are able to synchronize the meta data model. It searches for differences between the meta data saved within the template and the meta data as present within the connected database. As an user of DATPROF Privacy you are able to accept or discard these changes.
After finishing the synchronization, the template can be updated and executed. The only thing to keep in mind is, when some new tables or columns are introduced, those might contain privacy sensitive data which should be masked. If that is the case you can create new or edit existing functions as needed.
How is the data masking process started? Can it be monitored?
With DATPROF Privacy there are two ways to start a data masking process, directly from the DATPROF Privacy application. The second way of starting a masking process is by DATPROF Runtime. With DATPROF Runtime any created template of DATPROF Privacy can be scheduled and executed. For example to be able to start the data masking process during the night.
The process can be monitored with the logging within DATPROF Privacy. So if a run failed, you are able to see where the run failed to execute and solve the problem at hand.