Generation of test databases using sampling methods

Buda, Teodora Sandra

Generation of test databases using sampling methods

conference contribution

posted on 2013-08-22, 15:21 authored by Teodora Sandra Buda

Populating the testing environment with relevant data represents a great challenge in software validation, generally requiring expert knowledge about the system under development, as its data critically impacts the outcome of the tests designed to assess the system. Current practices of populating the testing environments generally focus on developing e cient algorithms for generating synthetic data or use the production environment for testing purposes. The latter is an invaluable strategy to provide real test cases in order to discover issues that critically impact the user of the system. However, the production environment generally consists of large amounts of data that are di cult to handle and analyze. Database sampling from the production environment is a potential solution to overcome these challenges. In this research, we propose two database sampling methods, VFDS and CoDS, with the objective of populating the testing environment. The rst method is a very fast random sampling approach, while the latter aims at preserving the distribution of data in order to produce a representative sample. In particular, we focus on the dependencies between the data from di erent tables and the method tries to preserve the distributions of these dependencies.

History

Publication

The International Symposium in Software Testing and Analysis (ISSTA) Conference as part of the Doctoral Symposium;pp. 366-369

Publisher

Association for Computing Machinery

Note

peer-reviewed

Other Funding information

SFI

Rights

"© ACM, 2013. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in The International Symposium in Software Testing and Analysis (ISSTA) Conference as part of the Doctoral Symposium, pp. 366-369 http://dx.doi.org/10.1145/2483760.2492397

Language

English

External identifier

http://dx.doi.org/10.1145/2483760.2492397

Generation of test databases using sampling methods

History

Publication

Publisher

Note

Other Funding information

Rights

Language

External identifier

Usage metrics

Categories

Keywords

Licence

Exports