Automatic configuration of the Cassandra database using irace

5Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Database systems play a central role in modern data-centered applications. Their performance is thus a key factor in the efficiency of data processing pipelines. Modern database systems expose several parameters that users and database administrators can configure to tailor the database settings to the specific application considered. While this task has traditionally been performed manually, in the last years several methods have been proposed to automatically find the best parameter configuration for a database. Many of these methods, however, use statistical models that require high amounts of data and fail to represent all the factors that impact the performance of a database, or implement complex algorithmic solutions. In this work we study the potential of a simple model-free general-purpose configuration tool to automatically find the best parameter configuration of a database. We use the irace configurator to automatically find the best parameter configuration for the Cassandra NoSQL database using the YCBS benchmark under different scenarios. We establish a reliable experimental setup and obtain speedups of up to 30% over the default configuration in terms of throughput, and we provide an analysis of the configurations obtained.

References Powered by Scopus

Benchmarking cloud serving systems with YCSB

2921Citations
N/AReaders
Get full text

Sequential model-based optimization for general algorithm configuration

1746Citations
N/AReaders
Get full text

The irace package: Iterated racing for automatic algorithm configuration

1235Citations
N/AReaders
Get full text

Cited by Powered by Scopus

KnobTune: A Dynamic Database Configuration Tuning Strategy Leveraging Historical Workload Similarities

0Citations
N/AReaders
Get full text

CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems

0Citations
N/AReaders
Get full text

Characteristics of Databases for Cloud Computing: A Secondary Study

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Silva-Munoz, M., Franzin, A., & Bersini, H. (2021). Automatic configuration of the Cassandra database using irace. PeerJ Computer Science, 7, 1–35. https://doi.org/10.7717/peerj-cs.634

Readers over time

‘22‘23‘24036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

100%

Readers' Discipline

Tooltip

Computer Science 1

100%

Save time finding and organizing research with Mendeley

Sign up for free
0