A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks

Suhasini Monga; Damandeep Kaur

doi:https://doi.org/10.54216/IJWAC.100204

A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks

Suhasini Monga ^{1
*} , Damandeep Kaur ²

1 Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, Punjab, India - (suhasini.monga@gmail.com)

2 Department of CSE, Chandigarh University, Mohali, India - ( daman03.cu@gmail.com)

Doi: https://doi.org/10.54216/IJWAC.100204

Received: January 31, 2026 Revised: March 08, 2026 Accepted: May 07, 2026

Abstract

Battery-powered sensor nodes expire when their energy reserves are depleted, terminating data collection regardless of the physical integrity of the hardware. Solar harvesting offers a viable path to perpetual operation, but only when the routing layer can continuously track the time-varying energy state of every node and steer traffic away from nodes likely to be power-starved in the near future. Classical clustering and chain-based protocols select forwarding paths without regard to harvested energy, leading to premature node death even when sufficient solar income would have been available to sustain operation. This paper presents a deep reinforcement learning framework in which each sensor node operates an independent Deep Q-Network agent that adapts its next-hop forwarding decision based on local battery state, short-horizon solar energy forecasts, link quality estimates, and the residual energy levels of candidate neighbours. A lightweight LSTM sub-model provides the solar prediction horizon that the agent uses as part of its state representation, enabling it to distinguish nodes that are temporarily depleted but will recover from those whose batteries are trending toward permanent failure. Extensive simulation across a 100-node deployment over 3,000 operational rounds confirms that the proposed approach substantially extends network lifetime, improves packet delivery, and reduces wasted harvested energy compared with five competitive baselines. Reward function ablation, scalability experiments, and an energy neutrality verification further validate the design choices and confirm stability across a wide range of deployment conditions.

Keywords :

Wireless sensor networks , Energy harvesting , Deep Q-Network , Adaptive routing , Network lifetime , Solar power , LSTM forecasting , Reinforcement learning , IoT sustainability

References

[1] X. Zhong, Y. Liang, and Y. Li, “Energy-efficient and robust QoS control for wireless sensor networks using the extended Gur game,” Sensors, vol. 25, no. 3, p. 730, 2025, doi: 10.3390/s25030730.

[2] O. A. Khashan, N. M. Khafajah, W. Alomoush, and M. Alshinwan, “Innovative energy-efficient proxy reencryption for secure data exchange in wireless sensor networks,” IEEE Access, vol. 12, pp. 23 290–23 304, 2024, doi: 10.1109/ACCESS.2024.3360488.

[3] W. R. Heinzelman, A. Chandrakasan, and H. Balakrishnan, “Energy-efficient communication protocol for wireless microsensor networks,” in Proc. 33rd Hawaii Int. Conf. System Sciences, 2000, doi: 10.1109/HICSS.2000.926982.

[4] S. Lindsey and C. S. Raghavendra, “PEGASIS: Powerefficient gathering in sensor information systems,” IEEE Aerospace Conference Proceedings, vol. 3, pp. 1125– 1130, 2002, doi: 10.1109/AERO.2002.1035242.

[5] H. Guo, R. Wu, B. Qi, and C. Xu, “Deep-Q-networksbased adaptive dual-mode energy-efficient routing in rechargeable wireless sensor networks,” IEEE Sensors Journal, vol. 22, pp. 9956–9966, 2022, doi: 10.1109/JSEN.2022.3163368.

[6] D. Prabhu, R. Alageswaran, and S. Miruna Joe Amali, “Multiple agent based reinforcement learning for energy efficient routing in WSN,” Wireless Networks, vol. 29, no. 4, pp. 1787–1797, 2023, doi: 10.1007/s11276-022- 03048-3.

[7] A. S. Balobaid, S. B. Ahamed, S. Shamsudheen, and S. Balamurugan, “Neural network clustering and swarm intelligence-based routing protocol for wireless sensor networks,” Wireless Communications and Mobile Computing, vol. 2023, p. 4758852, 2023, doi: 10.1155/2023/4758852.

[8] C. J. C. H. Watkins and P. Dayan, “Q-learning,” Machine Learning, vol. 8, no. 3–4, pp. 279–292, 1992, doi: 10.1007/BF00992698.

[9] D. Godfrey, B. Suh, B.-H. Lim, K.-C. Lee, and K.-I. Kim, “An energy-efficient routing protocol with reinforcement learning in software-defined wireless sensor networks,” Sensors, vol. 23, no. 20, p. 8435, 2023, doi: 10.3390/s23208435.

[10] A. Barat, K. J. Prabuchandran, and S. Bhatnagar, “Energy management in a cooperative energy harvesting wireless sensor network,” IEEE Communications Letters, vol. 28, pp. 243–247, 2024, doi: 10.1109/LCOMM.2023.3335143.

[11] N. S. Albalawi, Y. Alzahrani, N. Alsalmi, Y. Patidar, and M. Tolani, “Energy-efficient priority encoding strategies using machine learning based hybrid MAC protocol for wireless sensor networks,” Scientific Reports, vol. 15, p. 45054, 2025, doi: 10.1038/s41598-025-31752-1.

[12] S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735– 1780, 1997, doi: 10.1162/neco.1997.9.8.1735.

Cite This Article As :

Monga, Suhasini. , Kaur, Damandeep. A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks. International Journal of Wireless and Ad Hoc Communication, vol. , no. , 2026, pp. 27–35. DOI: https://doi.org/10.54216/IJWAC.100204

Monga, S. Kaur, D. (2026). A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks. International Journal of Wireless and Ad Hoc Communication, (), 27–35. DOI: https://doi.org/10.54216/IJWAC.100204

Monga, Suhasini. Kaur, Damandeep. A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks. International Journal of Wireless and Ad Hoc Communication , no. (2026): 27–35. DOI: https://doi.org/10.54216/IJWAC.100204

Monga, S. , Kaur, D. (2026) . A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks. International Journal of Wireless and Ad Hoc Communication , () , 27–35 . DOI: https://doi.org/10.54216/IJWAC.100204

Monga S. , Kaur D. [2026]. A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks. International Journal of Wireless and Ad Hoc Communication. (): 27–35. DOI: https://doi.org/10.54216/IJWAC.100204

Monga, S. Kaur, D. "A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks," International Journal of Wireless and Ad Hoc Communication, vol. , no. , pp. 27–35, 2026. DOI: https://doi.org/10.54216/IJWAC.100204

International Journal of Wireless and Ad Hoc Communication

Journal Menu

Journal Volumes

Volume 0

Volume 1

Volume 2

Volume 3

Volume 4

Volume 5

Volume 6

Volume 7

Volume 8

Volume 9

Volume 10

A Deep Reinforcement Learning Framework with Solar Energy Forecasting for Adaptive Routing and Lifetime Extension in Energy-Harvesting Wireless Sensor Networks

Abstract

Keywords :

References

Cite This Article As :

Article Statistics

Download