Best Trade-Off Point Method for Efficient Resource Provisioning in Spark

Peter P. Nghiem, Santa Clara UniversityFollow

Document Type

Article

Publication Date

11-22-2018

Publisher

MDPI

Abstract

Considering the recent exponential growth in the amount of information processed in Big Data, the high energy consumed by data processing engines in datacenters has become a major issue, underlining the need for efficient resource allocation for more energy-efficient computing. We previously proposed the Best Trade-off Point (BToP) method, which provides a general approach and techniques based on an algorithm with mathematical formulas to find the best trade-off point on an elbow curve of performance vs. resources for efficient resource provisioning in Hadoop MapReduce. The BToP method is expected to work for any application or system which relies on a trade-off elbow curve, non-inverted or inverted, for making good decisions. In this paper, we apply the BToP method to the emerging cluster computing framework, Apache Spark, and show that its performance and energy consumption are better than Spark with its built-in dynamic resource allocation enabled. Our Spark-Bench tests confirm the effectiveness of using the BToP method with Spark to determine the optimal number of executors for any workload in production environments where job profiling for behavioral replication will lead to the most efficient resource provisioning.

Comments

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Recommended Citation

Nghiem, P. P. (2018). Best Trade-Off Point Method for Efficient Resource Provisioning in Spark. Algorithms, 11(12), 190. https://doi.org/10.3390/a11120190

Download

Included in

Computer Engineering Commons

COinS

Scholar Commons

Best Trade-Off Point Method for Efficient Resource Provisioning in Spark

Document Type

Publication Date

Publisher

Abstract

Comments

Recommended Citation

Included in

Browse

Search

Author Corner

Links

SelectedWorks Author Gallery

Scholar Commons

Computer Science and Engineering

Best Trade-Off Point Method for Efficient Resource Provisioning in Spark

Authors

Document Type

Publication Date

Publisher

Abstract

Comments

Recommended Citation

Included in

Share

Browse

Search

Author Corner

Links

SelectedWorks Author Gallery