Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures

Saad Hussein Abed Hamed; Mondher Frikha; Heni Bouhamed

doi:https://doi.org/10.54216/JISIoT.180216

Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures

Saad Hussein Abed Hamed ^{1
*} , Mondher Frikha ² , Heni Bouhamed ³

1 ENETCom SFAX, ReDCAD Laboratory, University of Sfax, B.P. 1173, 3038 Sfax, Tunisia; Computer Science and Information Technology, Al-Qadisiyah University, Iraq - (saad.hussain@qu.edu.iq)

2 TISP Laboratory, ENET’com, University of Sfax, Tunisia - (mondher.frikha@enetcom.usf.tn)

3 TISP Laboratory, ENET’com, University of Sfax, Tunisia - (heni.bouhamed@fsegs.usf.tn)

Doi: https://doi.org/10.54216/JISIoT.180216

Received: March 11, 2025 Revised: June 10, 2025 Accepted: August 01, 2025

Abstract

The rapid expansion of big data has accelerated the adoption of distributed computing frame- works such as Apache Hadoop and Apache Spark, enabling efficient large-scale data processing. While Spark’s in-memory computation model significantly enhances performance compared to Hadoop’s traditional MapReduce, the deployment architecture—whether Dockerized or non- Dockerized—plays a crucial role in affecting performance, scalability, and resource management. This study evaluates the impact of containerized and non-containerized multi-node cluster architectures on the performance of Hadoop and Spark, utilizing standardized workloads such as WordCount and TeraSort. Key performance metrics, including execution time, throughput, and resource utilization, are analyzed across various configurations with parameter tuning. Beyond pure performance benchmarking, the study also assesses the quality attributes of microservices in big data environments, focusing on scalability, maintainability, fault tolerance, and resource efficiency. The comparative analysis between monolithic and microservice-based architectures highlights the advantages of modularity and independent scaling inherent to microservices. Experimental findings indicate that Spark outperforms Hadoop on small to medium-scale workloads, while Hadoop exhibits superior robustness for processing extremely large datasets. Dockerized deployments offer better resource isolation and management flexibility, whereas non-Dockerized setups demonstrate reduced overhead under certain configurations. These insights contribute to optimizing deployment strategies and architectural decisions for microservices-based big data processing frameworks.

Keywords :

Apache Hadoop , Apache Spark , Big Data , Microservices , Quality Attribute Assessment , Docker Containerization , Kubernetes , Multi-Node Clusters , Performance Benchmarking

References

[1] H. Brown and E. Lee, “Node.js in the microservices era: An empirical study,” IEEE Softw., vol. 41, no. 2, pp. 76–85, 2024.

[2] Q. Chen, L. Zhou, “Concurrency models and scalability in microservices: A comparative review,” ACM Trans. Softw. Eng. Methodol., vol. 31, no. 3, pp. 1–34, 2022.

[3] L. Davis, C. Martinez, “Rust vs. Go: A comparative study for high-performance microservices,” ACM Trans. Internet Technol., vol. 21, no. 3, pp. 1–22, 2024.

[4] R. Zhang, Y. Patel, “Developer productivity in microservice development: A survey on programming language impact,” Softw. Eng. Rev., vol. 41, no. 2, pp. 101–119, 2021.

[5] N. K. Hadi, S. H. A. Hamad, S. J. Abbas, G. F. Ali, and M. M. M. Maadi, “Enhancing software reusability in higher education applications through microservices architecture,” J. Softw. Eng. Appl., 2023.

[6] T. Smith, J. Wilson, “Microservices in practice: A survey of performance and scalability,” J. Softw. Eng., vol. 32, no. 4, pp. 210–223, 2021.

[7] M. Costanzo, E. Rucci, M. Naiouf, and A. De Giusti, “Performance vs programming effort between rust and c on multicore architectures: Case study in n-body,” in Proc. XLVII Latin Amer. Comput. Conf. (CLEI), 2021, pp. 1–10.

[8] M. Gupta, R. Al-Bassam, “Java and its evolving role in microservices,” IEEE Softw., vol. 41, no. 2, pp. 50–61, 2023.

[9] J. Goncalves, J. Rodrigues, “A comparative study of microservices frameworks in terms of performance and scalability,” J. Syst. Softw., vol. 176, p. 110978, 2021.

[10] F. Almeida, R. Costa, “Evaluating modern programming languages for microservices architecture,” IEEE Trans. Softw. Eng., vol. 50, no. 4, pp. 410–425, 2023.

[11] H. Dinh-Tuan, M. Mora-Martinez, F. Beierle, and S. R. Garzon, “Development frameworks for microservice-based applications: Evaluation and comparison,” in Proc. Eur. Symp. Softw. Eng., 2020, pp. 12–20.

[12] S. Kim, J. Park, “Rust in microservices: Leveraging performance and safety,” in Proc. Int. Conf. Syst. Archit., 2024, pp. 67–79.

[13] T. Li, W. Zhang, “Node.js in microservices: Performance and scalability,” IEEE Trans. Cloud Comput., vol. 9, no. 1, pp. 45–58, 2024.

[14] R. Doe, A. Johnson, “Python vs. Java in microservices: A comparative analysis,” J. Softw. Eng., vol. 15, no. 4, pp. 215–232, 2023.

[15] Nakamura, S. Patel, “The role of programming languages in microservices architecture,” Softw. Pract. Exper, vol. 51, no. 8, pp. 1743–1756, 2021.

[16] W. Zhou, Q. Zhao, “Concurrency handling in microservices: Go vs. Java,” J. Parallel Distrib. Comput., vol. 79, no. 3, pp. 182–192, 2022.

[17] R. Wang, X. Patel, “Programming languages for microservices: A comprehensive analysis,” in Proc. Int. Conf. Cloud Comput., 2022, pp. 143–155.

[18] M. D. Hossain et al., “The role of microservice approach in edge computing: Opportunities, challenges, and research directions,” ICT Express, vol. 9, no. 6, pp. 1162–1182, 2023.

[19] D. Peters, J. Cruz, “Security in microservices: A comparative analysis of language features,” J. Cybersecurity, vol. 8, no. 4, pp. 1–19, 2022.

[20] T. Liu, H. Yang, “Security aspects of programming languages in microservices architectures: A comparative study,” Comput. Secur, vol. 106, p. 102269, 2021.

[21] Y. Lin, H. Chen, “Scalability of microservices: A language perspective,” Softw. Archit. J., vol. 29, no. 7, pp. 421–433, 2021.

[22] García, R. Silva, “Ecosystem evaluation of programming languages for microservices: Community and tooling aspects,” Softw. Pract. Exper, vol. 51, no. 12, pp. 2751–2771, 2021.

[23] S. H. A. Hamed, “Reusability of legacy software using microservices: An online exam system example,” J. Al-Qadisiyah Comput. Sci. Math., vol. 15, no. 3, p. 35, 2023.

[24] G. Türkmen, A. Sezen, and G. Şengül, “Comparative analysis of programming languages utilized in artificial intelligence applications: Features, performance, and suitability,” Int. J. Comput. Exp. Sci. Eng., vol. 10, no. 3, pp. 461–469, 2024.

[25] S. Davis and R. Kumar, “Analysis of innovative architectures for data-intensive processing in distributed systems,” IEEE Trans. Big Data, vol. 8, pp. 101–120, 2022.

[26] S. Johnson, P. Martinez, and F. Lee, “Benchmarking using standardized workloads (e.g., WordCount and TeraSort) to compare Hadoop and Spark,” Big Data Res., vol. 20, pp. 55–75, 2022.

[27] D. Nguyen, T. Tran, and Q. Pham, “Performance evaluations of in-memory versus disk-based processing in big data frameworks,” Int. J. Data Eng., vol. 15, pp. 87–102, 2022.

[28] M. Rodriguez and L. Garcia, “Deployment of Hadoop in high-throughput, domain-specific environments,” J. Syst. Archit., vol. 26, pp. 147–165, 2022.

[29] P. Patel, S. Kumar, and R. Jain, “Case study on the application of Hadoop in specialized domains such as synchrotron experiments,” Comput. Sci. Eng., vol. 24, pp. 35–45, 2022.

[30] S. Lee, J. Kim, and H. Park, “Demonstrating Apache Spark’s adaptability for interactive and educational applications,” IEEE Access, vol. 10, pp. 1450–1462, 2022.

[31] C. Smith, A. Johnson, and B. White, “Real-time analytics and e-learning applications using Spark,” ACM Trans. Intell. Syst. Technol., vol. 13, pp. 78–95, 2022.

[32] R. Miller and D. Gupta, “Evaluations of Docker-based deployments and orchestration strategies in big data environments,” Future Gener. Comput. Syst., vol. 119, pp. 345–360, 2022.

[33] L. Garcia, F. Rodriguez, and E. Santos, “The impact of containerization on the performance and scalability of big data systems,” J. Cloud Comput., vol. 11, pp. 89–107, 2022.

[34] X. Zhang, Y. Kumar, and S. Li, “Comparative study of microservice-based architectures in big data frameworks,” IEEE Trans. Serv. Comput., vol. 15, pp. 112–130, 2022.

[35] P. Plecinski, N. Bokla, T. Klymkovych, M. Melnyk, and W. Zabierowski, “Comparison of representative microservices technologies in terms of performance for use for projects based on sensor networks,” Sensors, vol. 22, no. 20, p. 7759, 2022.

[36] R. Kumar, S. Zhang, and Y. Patel, “An integrated quality attribute framework for evaluating big data systems incorporating microservices,” IEEE Trans. Big Data, vol. 8, pp. 210–225, 2022.

[37] Y. Hernandez and M. Wang, “Resource tuning and its effect on Apache Spark’s performance,” J. High Perform. Comput. Appl., vol. 36, pp. 123–138, 2022.

[38] K. Brown and E. Davis, “Analysis of resource management techniques to improve Spark execution times,” Parallel Comput., vol. 105, pp. 102–118, 2022.

[39] Z. Chen, X. Li, and Y. Wang, “An evaluation of Apache Hadoop and Spark performance on data-intensive workloads,” J. Data Sci., vol. 18, pp. 112–130, 2022.

[40] S. Singh, R. Verma, and P. Agarwal, “Evaluation of scalability and fault tolerance in microservice-based deployments,” J. Netw. Comput. Appl., vol. 196, pp. 103–115, 2022.

[41] S. Almeida, R. Martin, and G. Liu, “Performance comparisons of Dockerized multi-node architectures for big data processing,” J. Parallel Distrib. Syst., vol. 82, pp. 150–165, 2022.

[42] R. Liu, S. Martin, and G. Almeida, “A study comparing Dockerized and non-Dockerized environments in distributed frameworks,” ACM Trans. Auton. Adapt. Syst., vol. 17, pp. 1–22, 2022.

[43] S. Martin, G. Almeida, and R. Liu, “Investigations into latency and overhead in containerized big data deployments,” Future Internet, vol. 14, pp. 77–90, 2022.

[44] H. Bouhamed, M. Hamdi, and R. Gargouri, “Covid-19 patients’ hospital occupancy prediction during the recent omicron wave via some recurrent deep learning architectures,” Int. J. Comput. Commun. Control, vol. 17, no. 3, 2022.

[45] F. AlFayez and H. Bouhamed, “Machine learning and uLBP histograms for posture recognition of dependent people via Big Data Hadoop and Spark platform,” Int. J. Comput. Commun. Control, vol. 18, no. 1, 2023.

[46] J. Smith, B. T. Johnson, and C. R. Lee, “Microservices architecture: Challenges and opportunities in software development,” J. Softw. Eng. Appl., vol. 17, no. 5, pp. 45–62, 2023, doi: 10.4236/jsea.2023.175004.

[47] P. Kumar and R. Singh, “Performance analysis of microservices languages,” in Proc. Int. Conf. Cloud Comput., 2023, pp. 54–61.

[48] S. Hussein, M. Lahami, and M. Torjmen, “Assessing the quality of microservice and monolithic-based architectures: A systematic literature review,” Oper. Res. Eng. Sci. Theory Appl., vol. 7, no. 2, 2024.

Cite This Article As :

Hussein, Saad. , Frikha, Mondher. , Bouhamed, Heni. Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures. Journal of Intelligent Systems and Internet of Things, vol. , no. , 2026, pp. 220-238. DOI: https://doi.org/10.54216/JISIoT.180216

Hussein, S. Frikha, M. Bouhamed, H. (2026). Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures. Journal of Intelligent Systems and Internet of Things, (), 220-238. DOI: https://doi.org/10.54216/JISIoT.180216

Hussein, Saad. Frikha, Mondher. Bouhamed, Heni. Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures. Journal of Intelligent Systems and Internet of Things , no. (2026): 220-238. DOI: https://doi.org/10.54216/JISIoT.180216

Hussein, S. , Frikha, M. , Bouhamed, H. (2026) . Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures. Journal of Intelligent Systems and Internet of Things , () , 220-238 . DOI: https://doi.org/10.54216/JISIoT.180216

Hussein S. , Frikha M. , Bouhamed H. [2026]. Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures. Journal of Intelligent Systems and Internet of Things. (): 220-238. DOI: https://doi.org/10.54216/JISIoT.180216

Hussein, S. Frikha, M. Bouhamed, H. "Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures," Journal of Intelligent Systems and Internet of Things, vol. , no. , pp. 220-238, 2026. DOI: https://doi.org/10.54216/JISIoT.180216

Journal of Intelligent Systems and Internet of Things

Journal Menu

Journal Volumes

Volume 0

Volume 1

Volume 2

Volume 3

Volume 4

Volume 5

Volume 6

Volume 7

Volume 8

Volume 9

Volume 10

Volume 11

Volume 12

Volume 13

Volume 14

Volume 15

Volume 16

Volume 17

Volume 18

Assessing Quality Attributes of Microservices in Hadoop and Spark Clusters: A Performance Benchmarking Approach in Dockerized and Non-Dockerized Architectures

Abstract

Keywords :

References

Cite This Article As :

Article Statistics

Download