University of Limerick Institutional Repository

Towards realistic sampling: generating dependencies in a relational database

DSpace Repository

Show simple item record

dc.contributor.author Buda, Teodora Sandra
dc.contributor.author Murphy, John
dc.contributor.author Kristiansen, Morten
dc.date.accessioned 2013-11-28T12:22:23Z
dc.date.available 2013-11-28T12:22:23Z
dc.date.issued 2013
dc.identifier.uri http://hdl.handle.net/10344/3479
dc.description peer-reviewed en_US
dc.description.abstract Managing large amounts of information is one of the most expensive, time-consuming and non-trivial activities and it usually requires expert knowledge. In a wide range of application areas, such as data mining, histogram construction, approximate query evaluation, and software validation, handling exponentially growing databases has become a dif- cult challenge, and a subset of the data is generally preferred. As a solution to the current challenges in managing large amounts of data, database sampling from the operational data available has proved to be a powerful technique. However, none of the existing sampling approaches consider the dependencies between the data in a relational database. In this paper, we propose a novel approach towards constructing a realistic testing environment, by analyzing the distribution of data in the original database along these dependencies before sampling, so that the sample database is representative to the original database. en_US
dc.language.iso eng en_US
dc.publisher Association for Computing Machinery en_US
dc.relation.ispartofseries ACM ICUIMC’13;Article no. 12
dc.relation.uri http://dx.doi.org/10.1145/2448556.2448568
dc.rights "© ACM, 2013. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM ICUIMC’13, article no. 12, http://dx.doi.org/10.1145/2448556.2448568 en_US
dc.subject algorithms en_US
dc.subject measurement en_US
dc.subject experimentation en_US
dc.title Towards realistic sampling: generating dependencies in a relational database en_US
dc.type info:eu-repo/semantics/conferenceObject en_US
dc.type.supercollection all_ul_research en_US
dc.type.supercollection ul_published_reviewed en_US
dc.contributor.sponsor SFI en_US
dc.relation.projectid 10/CE/I1855 en_US
dc.rights.accessrights info:eu-repo/semantics/openAccess en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search ULIR


Browse

My Account

Statistics