posted on 2013-08-22, 15:21authored byTeodora Sandra Buda
Populating the testing environment with relevant data represents
a great challenge in software validation, generally
requiring expert knowledge about the system under development,
as its data critically impacts the outcome of the tests
designed to assess the system. Current practices of populating
the testing environments generally focus on developing
e cient algorithms for generating synthetic data or use the
production environment for testing purposes. The latter is
an invaluable strategy to provide real test cases in order to
discover issues that critically impact the user of the system.
However, the production environment generally consists of
large amounts of data that are di cult to handle and analyze.
Database sampling from the production environment
is a potential solution to overcome these challenges.
In this research, we propose two database sampling methods,
VFDS and CoDS, with the objective of populating the
testing environment. The rst method is a very fast random
sampling approach, while the latter aims at preserving
the distribution of data in order to produce a representative
sample. In particular, we focus on the dependencies
between the data from di erent tables and the method tries
to preserve the distributions of these dependencies.
History
Publication
The International Symposium in Software Testing and Analysis (ISSTA) Conference as part of the Doctoral Symposium;pp. 366-369