Volume 8 • Issue 1 • PP: 20–32 • 2026
Green IOT and Sustainable Wireless Sensor Networks: A Deep Reinforcement Learning Approach for Energy Optimization and Qos Enhancement
Abstract
Due to the increasing adoption of IoT applications, there is a growing necessity for energy-efficient and sustainable WSN. Yet, traditional routing protocols tend to face problems like energy wastage, congestion, unreliable communication, and shorter network life spans under dynamic network conditions. This study presents the development of a DRL-powered Green IoT framework to enhance efficient communication through WSN while optimizing QoS performance. Specifically, the proposed framework employs the Deep Q-Network, Double Deep Q-Learning, adaptive clustering, and multi-objective optimization in order to enhance both routing and QoS performance. The model makes use of residual energy, congestion levels, throughput, delivery rate, and communication delays during its decision-making processes. Experimentation with the model was performed by making use of Python and NS-3. The simulation results showed that the presented model performed better than traditional routing methods like LEACH, PEGASIS, and HEED when evaluated on factors like energy preservation, enhanced throughput, minimized congestion, reduced delays, and increased network life spans. It can be concluded that DRL-powered communication optimization is a viable solution for the future development of Green IoT communication systems.
Keywords
References
[1] K. S. Adu-Manu, E. Amoako, and F. Engmann, “Advancements in machine learning-enhanced green wireless sensor networks: A comprehensive survey on energy efficiency, network performance, and future directions,” Journal of Sensors, vol. 2025, no. 1, Art. no. 5242517, 2025.
[2] A. Al-Fuqaha, M. Guizani, M. Mohammadi, M. Aledhari, and M. Ayyash, “Internet of things: A survey on enabling technologies, protocols, and applications,” IEEE Communications Surveys & Tutorials, vol. 17, no. 4, pp. 2347–2376, 2015.
[3] N. Alsalmi, K. Navaie, and H. Rahmani, “Energy and throughput efficient mobile wireless sensor networks: A deep reinforcement learning approach,” IET Networks, vol. 13, nos. 5–6, pp. 413–433, 2024.
[4] M. Chen, S. Mao, and Y. Liu, “Big data: A survey,” Mobile Networks and Applications, vol. 19, no. 2, pp. 171–209, 2014.
[5] Q. Ding, R. Zhu, H. Liu, and M. Ma, “An overview of machine learning-based energy-efficient routing algorithms in wireless sensor networks,” Electronics, vol. 10, no. 13, Art. no. 1539, 2021.
[6] H. Dutta, A. K. Bhuyan, and S. Biswas, “Contextual deep reinforcement learning for flow and energy management in wireless sensor and IoT networks,” IEEE Transactions on Green Communications and Networking, vol. 8, no. 3, pp. 1233–1244, 2024.
[7] W. K. Ghamry and S. Shukry, “Multi-objective intelligent clustering routing schema for internet of things enabled wireless sensor networks using deep reinforcement learning,” Cluster Computing, vol. 27, no. 4, pp. 4941–4961, 2024.
[8] D. Godfrey, B. Suh, B. H. Lim, K. C. Lee, and K. I. Kim, “An energy-efficient routing protocol with reinforcement learning in software-defined wireless sensor networks,” Sensors, vol. 23, no. 20, Art. no. 8435, 2023.
[9] J. Gubbi, R. Buyya, S. Marusic, and M. Palaniswami, “Internet of Things (IoT): A vision, architectural elements, and future directions,” Future Generation Computer Systems, vol. 29, no. 7, pp. 1645–1660, 2013.
[10] R. Hamdi, E. Baccour, A. Erbad, M. Qaraqe, and M. Hamdi, “LoRa-RL: Deep reinforcement learning for resource management in hybrid energy LoRa wireless networks,” IEEE Internet of Things Journal, vol. 9, no. 9, pp. 6458–6476, 2021.
[11] L. Hu, C. Han, X. Wang, H. Zhu, and J. Ouyang, “Security enhancement for deep reinforcement learning-based strategy in energy-efficient wireless sensor networks,” Sensors, vol. 24, no. 6, Art. no. 1993, 2024.
[12] S. Khairy, P. Balaprakash, L. X. Cai, and Y. Cheng, “Constrained deep reinforcement learning for energy sustainable multi-UAV based random access IoT networks with NOMA,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 4, pp. 1101–1115, 2020.
[13] S. S. Khatami, M. Shoeibi, R. Salehi, and M. Kaveh, “Energy-efficient and secure double RIS-aided wireless sensor networks: A QoS-aware fuzzy deep reinforcement learning approach,” Journal of Sensor and Actuator Networks, vol. 14, no. 1, Art. no. 18, 2025.
[14] S. Li, L. D. Xu, and S. Zhao, “The internet of things: A survey,” Information Systems Frontiers, vol. 17, no. 2, pp. 243–259, 2015.
[15] X. Li, X. Wei, S. Chen, and L. Sun, “Multi-agent deep reinforcement learning based resource management in SWIPT enabled cellular networks with H2H/M2M coexistence,” Ad Hoc Networks, vol. 149, Art. no. 103256, 2023.
[16] J. C. Lopez-Ardao, R. F. Rodriguez-Rubio, A. Suarez- Gonzalez, M. Rodriguez-Perez, and M. E. Sousa-Vieira, “Current trends on green wireless sensor networks,” Sensors, vol. 21, no. 13, Art. no. 4281, 2021.
[17] Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,” IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322–2358, 2017.
[18] X. Meng, H. Inaltekin, and B. Krongold, “Deep reinforcement learning-based topology optimization for self-organized wireless sensor networks,” in Proc. IEEE Global Communications Conference (GLOBECOM), 2019, pp. 1–6.
[19] V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.
[20] P. Pandiyan, S. Saravanan, R. Kannadasan, S. Krishnaveni, M. H. Alsharif, and M. K. Kim, “A comprehensive review of advancements in green IoT for smart grids: Paving the path to sustainability,” Energy Reports, vol. 11, pp. 5504–5531, 2024.
[21] G. Sattibabu, N. Ganesan, and R. S. Kumaran, “IoTenabled wireless sensor networks optimization based on federated reinforcement learning for enhanced performance,” Peer-to-Peer Networking and Applications, vol. 18, no. 2, Art. no. 75, 2025.
[22] J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 2017.
[23] S. P. Singh, N. Kumar, N. S. Alghamdi, G. Dhiman, W. Viriyasitavat, and A. Sapsomboon, “Next-gen WSN enabled IoT for consumer electronics in smart city: Elevating quality of service through reinforcement learningenhanced multi-objective strategies,” IEEE Transactions on Consumer Electronics, vol. 70, no. 4, pp. 6507–6518, 2024.
[24] M. Sohail, S. Khan, R. Ahmad, D. Singh, and J. Lloret, “Game theoretic solution for power management in IoTbased wireless sensor networks,” Sensors, vol. 19, no. 18, Art. no. 3835, 2019.
[25] I. Surenther, K. P. Sridhar, and M. K. Roberts, “Maximizing energy efficiency in wireless sensor networks for data transmission: A deep learning-based grouping model approach,” Alexandria Engineering Journal, vol. 83, pp. 53–65, 2023.
[26] H. Van Hasselt, A. Guez, and D. Silver, “Deep reinforcement learning with double Q-learning,” in Proc. AAAI Conference on Artificial Intelligence, vol. 30, no. 1, 2016.
[27] Z. Yang, K. Merrick, L. Jin, and H. A. Abbass, “Hierarchical deep reinforcement learning for continuous action control,” IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 11, pp. 5174–5184, 2018.
[28] M. U. Younus, M. K. Khan, and A. R. Bhatti, “Improving the software-defined wireless sensor networks routing performance using reinforcement learning,” IEEE Internet of Things Journal, vol. 9, no. 5, pp. 3495–3508, 2021.
[29] J. Yuan, J. Peng, Q. Yan, G. He, H. Xiang, and Z. Liu, “Deep reinforcement learning-based energy consumption optimization for peer-to-peer (P2P) communication in wireless sensor networks,” Sensors, vol. 24, no. 5, Art. no. 1632, 2024.
[30] B. Zhao and X. Zhao, “Deep reinforcement learning resource allocation in wireless sensor networks with energy harvesting and relay,” IEEE Internet of Things Journal, vol. 9, no. 3, pp. 2330–2345, 2021.
[31] S. P. Praveen, H. Dendukuri, C. Subbarao, V. J. Manasa, M. Saritha, and S. Poddar, “A stream-processingenabled AI framework for fast and reliable cybersecurity threat identification,” in Proc. 3rd International Conference on Sustainable Computing and Smart Systems (ICSCSS), 2025, pp. 2023–2029.
[32] S. P. Praveen, P. Panguluri, U. Sirisha, D. A. Dewi, T. B. Kurniawan, and L. Efrizoni, “Stacked LSTM with multi-head attention based model for intrusion detection,” Journal of Applied Data Sciences, vol. 7, no. 1, pp. 475–488, 2026.
[33] S. P. Praveen, K. Sharma, D. Parashar, V. S. N. Murthy, U. Sirisha, and D. A. Dewi, “Design of an iterative method for adaptive federated intrusion detection for energy-constrained edge-centric 6G IoT cyber-physical systems,” Scientific Reports, vol. 15, no. 1, Art. no. 41387, 2025.
[34] S. P. Praveen, M. Kamalrudin, M. Musa, U. Harita, Y. Ayyappa, and T. Nagamani, “A unified AI framework for confidentiality preserving cyberattack detection in healthcare cyber physical networks,” International Journal of Innovative Technology and Interdisciplinary Sciences, vol. 8, no. 3, pp. 818–841, 2025.
[35] R. Tang, Y.Wu, J. Tan, et al., “Research on rechargeable agricultural wireless sensor network based on ZigBee immune routing repair algorithm,” Scientific Reports, vol. 15, Art. no. 5756, 2025. [36] G. Priscilla, B. Kumar, S. Maidin, and Z. Attarbashi, “Trust aware congestion control mechanism for wireless sensor network,” Journal of Applied Data Sciences, vol. 6, no. 2, pp. 858–870, 2025.
[37] C. Feng, A. K. Jumaah Al-Nussairi, M. H. Chyad, et al., “AI powered blockchain framework for predictive temperature control in smart homes using wireless sensor networks and time shifted analysis,” Scientific Reports, vol. 15, Art. no. 18168, 2025.
Cite This Article
Choose your preferred format