Test Data Management Overview
Test Data Management Overview
Efficient management of data used for testing is essential to maximizing return on investment and
supplementing the testing efforts for the highest levels of success and coverage. If the data used in
testing does not promote ease of use and adaptation, poorly represents the sampled source, or
consumes excessive resources for preparation and maintenance, a negative impact on the desired
outcome quickly manifests and continues to degrade the quality of results. To balance in favor of
positive results and improved returns, consider the process, potential challenges, and possible solutions
involved in TDM.
Planning
Analysis
Design
Build
Maintenance
Tools
The use of quality tools promotes quality results in any line of work, and it is no different when it comes
to TDM. Links with useful tools are provided below.
http://www-01.ibm.com/software/data/optim/
http://www.informatica.com/us/products/
http://www.grid-tools.com/test_data_managementsolutions/test_data_management_tools/
White Paper
Challenges
There are many challenges that can complicate the TDM process such as sensitive data masking and
resource consumption. An overlooked challenge can cause major setbacks. Several common topics for
consideration have been listed below.
Data masking and de-identification is essential to comply with privacy laws and standards. There are
several approaches that may be taken to use realistic data without betraying the confidentiality of
sensitive data:
You could go through and remove all sensitive information, such as credit cards or social
security numbers, but this may not always be the correct method to accurately cover test
requirements.
One method is to generate fake data from scratch that fits the appropriate format. This can be
time consuming for personnel; however, an automated script can be used to quickly generate
required data.
If you need to return the data to its original format, in some circumstances, a reversible
algorithm can be used to alter the data. However, if the algorithm is known or discovered this
could potentially allow for the private data to be compromised.
A numeric variance, such as +/- 10%, can be used to change information (finance,
demographics, etc.) just enough to make it untrue but still valid enough for appropriate use.
Data Encryption is a very extensive approach that may not be as effective as it appears if access
rights are carelessly given out.
Masking out with viewed values being changed, such as with XX or **, can allow systems to still
use the data without making the data available for easy access.
White Paper
Solutions
Once challenges are reviewed we need to consider solutions to help mitigate the impact of these
challenges. Considerations for TDM improvement have been listed below:
Summary
Efficient Test Data Management (TDM) improves quality of testing results. Improved results lead to an
improved product and higher return on investment. A process with good understanding and meeting of
requirements, coupled with quality solutions to relevant challenges, will help provide the efficiency
desired in TDM. Once TDM is optimized, increases in productivity, results, and profitability should quickly
manifest, allowing more resources and focus can be utilized on continuing quality products and services.