Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement

S. Phani; Massila; Sai; Deshinta Arrova; Dedeepya; Klodian

doi:https://doi.org/10.54216/IJAACI.080104

Full Length Article

Volume 8 • Issue 1 • PP: 20–32 • 2026

Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement

S. Phani Praveen ^1*

mail

,

Massila Kamalrudin ²

mail

,

Sai Vellela ³

mail

,

Deshinta Arrova Dewi ⁴

mail

,

Dedeepya Pulletikurthy ⁵

mail

,

Klodian Dhoska ⁶

mail

¹Associate Professor, Department of Computer Science and Engineering, Prasad V. Potluri Siddhartha Institute of Technology, Kanuru, Vijayawada – 520007, Andhra Pradesh, India

²Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka (UTeM), Melaka, Malaysia

³Associate Professor, Department of CSE – Data Science, Chalapathi Institute of Technology, Guntur – 522016, Andhra Pradesh, India

⁴Professor, Faculty of Data Science and Information Technology (FDSIT), INTI International University, Malaysia

⁵Department of Computer Science & Engineering, SRM university AP, Amaravati, Andhra Pradesh, India

⁶Polytechnic University of Tirana, Tirana, Albania

* Corresponding Author.

DOI https://doi.org/10.54216/IJAACI.080104

format_quote Cite this article

Received: January 18, 2026 Revised: February 12, 2026 Accepted: March 22, 2026

View PDF open_in_new

Abstract

Due to the increasing adoption of IoT applications, there is a growing necessity for energy-efficient and sustainable WSN. Yet, traditional routing protocols tend to face problems like energy wastage, congestion, unreliable communication, and shorter network life spans under dynamic network conditions. This study presents the development of a DRL-powered Green IoT framework to enhance efficient communication through WSN while optimizing QoS performance. Specifically, the proposed framework employs the Deep Q-Network, Double Deep Q-Learning, adaptive clustering, and multi-objective optimization in order to enhance both routing and QoS performance. The model makes use of residual energy, congestion levels, throughput, delivery rate, and communication delays during its decision-making processes. Experimentation with the model was performed by making use of Python and NS-3. The simulation results showed that the presented model performed better than traditional routing methods like LEACH, PEGASIS, and HEED when evaluated on factors like energy preservation, enhanced throughput, minimized congestion, reduced delays, and increased network life spans. It can be concluded that DRL-powered communication optimization is a viable solution for the future development of Green IoT communication systems.

Keywords

Green IoT Wireless Sensor Networks Deep Reinforcement Learning Energy Efficiency QoS Enhancement Energy Sustainable Communication Adaptive Routing Network Lifetime

References

[1] K. S. Adu-Manu, E. Amoako, and F. Engmann, “Advancements in machine learning-enhanced green wireless sensor networks: A comprehensive survey on energy efficiency, network performance, and future directions,” Journal of Sensors, vol. 2025, no. 1, Art. no. 5242517, 2025.

[2] A. Al-Fuqaha, M. Guizani, M. Mohammadi, M. Aledhari, and M. Ayyash, “Internet of things: A survey on enabling technologies, protocols, and applications,” IEEE Communications Surveys & Tutorials, vol. 17, no. 4, pp. 2347–2376, 2015.

[3] N. Alsalmi, K. Navaie, and H. Rahmani, “Energy and throughput efficient mobile wireless sensor networks: A deep reinforcement learning approach,” IET Networks, vol. 13, nos. 5–6, pp. 413–433, 2024.

[4] M. Chen, S. Mao, and Y. Liu, “Big data: A survey,” Mobile Networks and Applications, vol. 19, no. 2, pp. 171–209, 2014.

[5] Q. Ding, R. Zhu, H. Liu, and M. Ma, “An overview of machine learning-based energy-efficient routing algorithms in wireless sensor networks,” Electronics, vol. 10, no. 13, Art. no. 1539, 2021.

[6] H. Dutta, A. K. Bhuyan, and S. Biswas, “Contextual deep reinforcement learning for flow and energy management in wireless sensor and IoT networks,” IEEE Transactions on Green Communications and Networking, vol. 8, no. 3, pp. 1233–1244, 2024.

[7] W. K. Ghamry and S. Shukry, “Multi-objective intelligent clustering routing schema for internet of things enabled wireless sensor networks using deep reinforcement learning,” Cluster Computing, vol. 27, no. 4, pp. 4941–4961, 2024.

[8] D. Godfrey, B. Suh, B. H. Lim, K. C. Lee, and K. I. Kim, “An energy-efficient routing protocol with reinforcement learning in software-defined wireless sensor networks,” Sensors, vol. 23, no. 20, Art. no. 8435, 2023.

[9] J. Gubbi, R. Buyya, S. Marusic, and M. Palaniswami, “Internet of Things (IoT): A vision, architectural elements, and future directions,” Future Generation Computer Systems, vol. 29, no. 7, pp. 1645–1660, 2013.

[10] R. Hamdi, E. Baccour, A. Erbad, M. Qaraqe, and M. Hamdi, “LoRa-RL: Deep reinforcement learning for resource management in hybrid energy LoRa wireless networks,” IEEE Internet of Things Journal, vol. 9, no. 9, pp. 6458–6476, 2021.

[11] L. Hu, C. Han, X. Wang, H. Zhu, and J. Ouyang, “Security enhancement for deep reinforcement learning-based strategy in energy-efficient wireless sensor networks,” Sensors, vol. 24, no. 6, Art. no. 1993, 2024.

[12] S. Khairy, P. Balaprakash, L. X. Cai, and Y. Cheng, “Constrained deep reinforcement learning for energy sustainable multi-UAV based random access IoT networks with NOMA,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 4, pp. 1101–1115, 2020.

[13] S. S. Khatami, M. Shoeibi, R. Salehi, and M. Kaveh, “Energy-efficient and secure double RIS-aided wireless sensor networks: A QoS-aware fuzzy deep reinforcement learning approach,” Journal of Sensor and Actuator Networks, vol. 14, no. 1, Art. no. 18, 2025.

[14] S. Li, L. D. Xu, and S. Zhao, “The internet of things: A survey,” Information Systems Frontiers, vol. 17, no. 2, pp. 243–259, 2015.

[15] X. Li, X. Wei, S. Chen, and L. Sun, “Multi-agent deep reinforcement learning based resource management in SWIPT enabled cellular networks with H2H/M2M coexistence,” Ad Hoc Networks, vol. 149, Art. no. 103256, 2023.

[16] J. C. Lopez-Ardao, R. F. Rodriguez-Rubio, A. Suarez- Gonzalez, M. Rodriguez-Perez, and M. E. Sousa-Vieira, “Current trends on green wireless sensor networks,” Sensors, vol. 21, no. 13, Art. no. 4281, 2021.

[17] Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,” IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322–2358, 2017.

[18] X. Meng, H. Inaltekin, and B. Krongold, “Deep reinforcement learning-based topology optimization for self-organized wireless sensor networks,” in Proc. IEEE Global Communications Conference (GLOBECOM), 2019, pp. 1–6.

[19] V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.

[20] P. Pandiyan, S. Saravanan, R. Kannadasan, S. Krishnaveni, M. H. Alsharif, and M. K. Kim, “A comprehensive review of advancements in green IoT for smart grids: Paving the path to sustainability,” Energy Reports, vol. 11, pp. 5504–5531, 2024.

[21] G. Sattibabu, N. Ganesan, and R. S. Kumaran, “IoTenabled wireless sensor networks optimization based on federated reinforcement learning for enhanced performance,” Peer-to-Peer Networking and Applications, vol. 18, no. 2, Art. no. 75, 2025.

[22] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 2017.

[23] S. P. Singh, N. Kumar, N. S. Alghamdi, G. Dhiman, W. Viriyasitavat, and A. Sapsomboon, “Next-gen WSN enabled IoT for consumer electronics in smart city: Elevating quality of service through reinforcement learningenhanced multi-objective strategies,” IEEE Transactions on Consumer Electronics, vol. 70, no. 4, pp. 6507–6518, 2024.

[24] M. Sohail, S. Khan, R. Ahmad, D. Singh, and J. Lloret, “Game theoretic solution for power management in IoTbased wireless sensor networks,” Sensors, vol. 19, no. 18, Art. no. 3835, 2019.

[25] I. Surenther, K. P. Sridhar, and M. K. Roberts, “Maximizing energy efficiency in wireless sensor networks for data transmission: A deep learning-based grouping model approach,” Alexandria Engineering Journal, vol. 83, pp. 53–65, 2023.

[26] H. Van Hasselt, A. Guez, and D. Silver, “Deep reinforcement learning with double Q-learning,” in Proc. AAAI Conference on Artificial Intelligence, vol. 30, no. 1, 2016.

[27] Z. Yang, K. Merrick, L. Jin, and H. A. Abbass, “Hierarchical deep reinforcement learning for continuous action control,” IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 11, pp. 5174–5184, 2018.

[28] M. U. Younus, M. K. Khan, and A. R. Bhatti, “Improving the software-defined wireless sensor networks routing performance using reinforcement learning,” IEEE Internet of Things Journal, vol. 9, no. 5, pp. 3495–3508, 2021.

[29] J. Yuan, J. Peng, Q. Yan, G. He, H. Xiang, and Z. Liu, “Deep reinforcement learning-based energy consumption optimization for peer-to-peer (P2P) communication in wireless sensor networks,” Sensors, vol. 24, no. 5, Art. no. 1632, 2024.

[30] B. Zhao and X. Zhao, “Deep reinforcement learning resource allocation in wireless sensor networks with energy harvesting and relay,” IEEE Internet of Things Journal, vol. 9, no. 3, pp. 2330–2345, 2021.

[31] S. P. Praveen, H. Dendukuri, C. Subbarao, V. J. Manasa, M. Saritha, and S. Poddar, “A stream-processingenabled AI framework for fast and reliable cybersecurity threat identification,” in Proc. 3^rd International Conference on Sustainable Computing and Smart Systems (ICSCSS), 2025, pp. 2023–2029.

[32] S. P. Praveen, P. Panguluri, U. Sirisha, D. A. Dewi, T. B. Kurniawan, and L. Efrizoni, “Stacked LSTM with multi-head attention based model for intrusion detection,” Journal of Applied Data Sciences, vol. 7, no. 1, pp. 475–488, 2026.

[33] S. P. Praveen, K. Sharma, D. Parashar, V. S. N. Murthy, U. Sirisha, and D. A. Dewi, “Design of an iterative method for adaptive federated intrusion detection for energy-constrained edge-centric 6G IoT cyber-physical systems,” Scientific Reports, vol. 15, no. 1, Art. no. 41387, 2025.

[34] S. P. Praveen, M. Kamalrudin, M. Musa, U. Harita, Y. Ayyappa, and T. Nagamani, “A unified AI framework for confidentiality preserving cyberattack detection in healthcare cyber physical networks,” International Journal of Innovative Technology and Interdisciplinary Sciences, vol. 8, no. 3, pp. 818–841, 2025.

[35] R. Tang, Y.Wu, J. Tan, et al., “Research on rechargeable agricultural wireless sensor network based on ZigBee immune routing repair algorithm,” Scientific Reports, vol. 15, Art. no. 5756, 2025. [36] G. Priscilla, B. Kumar, S. Maidin, and Z. Attarbashi, “Trust aware congestion control mechanism for wireless sensor network,” Journal of Applied Data Sciences, vol. 6, no. 2, pp. 858–870, 2025.

[37] C. Feng, A. K. Jumaah Al-Nussairi, M. H. Chyad, et al., “AI powered blockchain framework for predictive temperature control in smart homes using wireless sensor networks and time shifted analysis,” Scientific Reports, vol. 15, Art. no. 18168, 2025.

Cite This Article

Choose your preferred format

format_quote

Praveen, S. Phani, Kamalrudin, Massila, Vellela, Sai , Dewi, Deshinta Arrova, Pulletikurthy, Dedeepya , Dhoska, Klodian. "Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement." International Journal of Advances in Applied Computational Intelligence, vol. Volume 8 , no. Issue 1, 2026, pp. 20–32. DOI: https://doi.org/10.54216/IJAACI.080104

Praveen, S., Kamalrudin, M., Vellela, S., Dewi, D., Pulletikurthy, D., Dhoska, K. (2026). Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement. International Journal of Advances in Applied Computational Intelligence, Volume 8 (Issue 1), 20–32. DOI: https://doi.org/10.54216/IJAACI.080104

Praveen, S. Phani, Kamalrudin, Massila, Vellela, Sai , Dewi, Deshinta Arrova, Pulletikurthy, Dedeepya , Dhoska, Klodian. "Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement." International Journal of Advances in Applied Computational Intelligence Volume 8 , no. Issue 1 (2026): 20–32. DOI: https://doi.org/10.54216/IJAACI.080104

Praveen, S., Kamalrudin, M., Vellela, S., Dewi, D., Pulletikurthy, D., Dhoska, K. (2026) 'Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement', International Journal of Advances in Applied Computational Intelligence, Volume 8 (Issue 1), pp. 20–32. DOI: https://doi.org/10.54216/IJAACI.080104

Praveen S, Kamalrudin M, Vellela S, Dewi D, Pulletikurthy D, Dhoska K. Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement. International Journal of Advances in Applied Computational Intelligence. 2026;Volume 8 (Issue 1):20–32. DOI: https://doi.org/10.54216/IJAACI.080104

S. Praveen, M. Kamalrudin, S. Vellela, D. Dewi, D. Pulletikurthy, K. Dhoska, "Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement," International Journal of Advances in Applied Computational Intelligence, vol. Volume 8 , no. Issue 1, pp. 20–32, 2026. DOI: https://doi.org/10.54216/IJAACI.080104

Digital Archive Ready