Green IOT and Sustainable Wireless Sensor Networks: A Deep

Reinforcement Learning Approach for Energy Optimization and

Qos Enhancement

Surapaneni Phani Praveen1* Massila Kamalrudin2 Sai Srinivas Vellela3

Deshinta Arrova Dewi4 Dedeepya Pulletikurthy5 Klodian Dhoska6

1 Associate Professor, Department of Computer Science and Engineering, Prasad V. Potluri Siddhartha Institute of Technology,

Kanuru, Vijayawada – 520007, Andhra Pradesh, India

2 Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka (UTeM), Melaka, Malaysia.

3 Associate Professor, Department of CSE – Data Science, Chalapathi Institute of Technology, Guntur – 522016, Andhra Pradesh,

India

4 Professor, Faculty of Data Science and Information Technology (FDSIT), INTI International University, Malaysia

5 Department of Computer Science & Engineering, SRM university AP, Amaravati, Andhra Pradesh, India

6 Polytechnic University of Tirana, Tirana, Albania

Emails: phani.0713@gmail.com, massila@utem.edu.my, sais1916@gmail.com, deshinta.ad@newinti.edu.my,

dedeepya_pulletikurthy@srmap.edu.in, kdhoska@upt.al

Received: January 18, 2026 Revised: February 12, 2026 Accepted: March 22, 2026 ⋆ Corresponding author

ABSTRACT

Due to the increasing adoption of IoT applications, there is a growing necessity for energy-efficient and sustainable

WSN. Yet, traditional routing protocols tend to face problems like energy wastage, congestion, unreliable communication,

and shorter network life spans under dynamic network conditions. This study presents the development

of a DRL-powered Green IoT framework to enhance efficient communication through WSN while optimizing

QoS performance. Specifically, the proposed framework employs the Deep Q-Network, Double Deep Q-Learning,

adaptive clustering, and multi-objective optimization in order to enhance both routing and QoS performance.The

model makes use of residual energy, congestion levels, throughput, delivery rate, and communication delays during

its decision-making processes. Experimentation with the model was performed by making use of Python and NS-3.

The simulation results showed that the presented model performed better than traditional routing methods like

LEACH, PEGASIS, and HEED when evaluated on factors like energy preservation, enhanced throughput, minimized

congestion, reduced delays, and increased network life spans. It can be concluded that DRL-powered communication

optimization is a viable solution for the future development of Green IoT communication systems.

Keywords: Green IoT Wireless Sensor Networks Deep Reinforcement Learning Energy Efficiency QoS Enhancement

Energy Sustainable Communication Adaptive Routing Network Lifetime

1. INTRODUCTION

The advent of Internet of Things (IoT) has revolutionized

modern communication networks in that it enables automatic

information sharing among different smart objects connected

in the network. With more deployments, the IoT has produced

a lot of communication information that necessitates efficient

management solutions for the network (Chen et al., 2014).