Clustering and Classification of IoT-Based Environmental Data Using Machine Learning Techniques

 

 

 

Ali Subhi Alhumaima1, Waleed Khalid Al-Zubaidi1, El-Sayed M. El-Kenawy2,3,*, Marwa M. Eid4,5

 

1Electronic Computer Centre, University of Diyala, Diyala, Iraq

 

2Department of Communications and Electronics, Delta Higher Institute of Engineering and Technology, Mansoura, 35111, Egypt

 

3Applied Science Research Center. Applied Science Private University, Amman, Jordan

 

4Faculty of Artificial Intelligence, Delta University for Science and Technology, Mansoura, Egypt

 

5Jadara Research Center, Jadara University, Irbid 21110, Jordan

 

Emails: alhumaimaali@uodiyala.edu.iq; waleed300@uodiyala.edu.iq; skenawy@ieee.org; mmm@ieee.org

 

 

 

 

 

Abstract

 

In this study, we present an integrated approach to IoT-based environmental data analysis using a collection of unsupervised-learning techniques. We employed KMeans clustering in particular to identify natural groupings in environmental and behavioral features such as air quality, noise level, temperature, stress level, sleeping hours, and mood score. We then trained a Decision Tree classifier to predict and interpret cluster membership from raw sensor readings. The data of more than 30,000 observations in indoor school environments has multifaceted relationships between environmental factors and psychological well-being. KMeans consistently detected three environmental-behavioral states, and the Decision Tree classifier performed 87% classification accuracy, which indicated extremely high predictability power in addition to interpretability. The results indicated that sleep duration, air, and stress were the main factors for cluster discrimination. The hybrid model introduces the potential of observing real-time environmental and mental states for applications in smart cities. The approach is scalable, interpretable, and usable in IoT settings for proactivity-enabled wellness management.

 

Keywords: IoT Sensor Data; Environmental Monitoring; KMeans Clustering; Decision Tree Classification; Behavioral Analysis; Air Quality; Stress Prediction; Machine Learning, Data Mining