Volume 15 , Issue 1 , PP: 11-21, 2025 | Cite this article as | XML | Html | PDF | Full Length Article
Zahraa Fadhel 1 , Hussien Attia 2 , Yossra Hussain Ali 3
Doi: https://doi.org/10.54216/JCIM.150102
Fake review detection, often known as spam review detection, is a crucial aspect of natural language processing. It involves extracting valuable information from text documents obtained from various sources. Various methodologies, such as simple rule-based approaches, lexicon-based methods, and advanced machine learning algorithms, have been extensively employed with diverse classifiers to provide accurate detection of fake reviews. Nevertheless, review classification based on lexicons continues to face challenges in achieving high accuracies, mostly because of the need for domain-specific comprehensive dictionaries. Furthermore, machine learning-driven review detection also addresses the limitations in accuracy caused by the uncertainty of features in social data. In order To address the problem of accuracy, one effective approach is to carefully choose the most optimal set of features and minimize the number of features used. The Objective of the research paper is to select a small subset of features out of the thousands of features for accurate classification of spam review detection. Therefore, a good feature selection method is needed in order to speed up the processing rate and predictive accuracy. This paper, Harris Hawks Optimization (HHO), is utilized for feature selection in sentiment analysis tasks. The performance of the selected feature subsets was evaluated using SVM, X-GBoost, ETC classifiers. Experimental results on tweet reviews for the airline dataset demonstrated superior sentiment classification capabilities, achieving an accuracy of 0.9435% with SVM and 0.9607%, 0.9635% for X-Boost, ETC, respectively.
Fake reviews detection , HHO , Feature selection , SVM , X_GBoost , ETC
[1] P. Akhtar et al., “Detecting fake news and disinformation using artificial intelligence and machine learning to avoid supply chain disruptions,” Ann. Oper. Res., pp. 633–657, 2022, doi: 10.1007/s10479-022-05015-5.
[2] A. M. Elmogy, U. Tariq, and A. Ibrahim, “Fake Reviews Detection using Supervised Machine Learning,” vol. 12, no. 1, pp. 601–606, 2021.
[3] “View of Comparative Analysis of MFO, GWO and GSO for Classification of Covid-19 Chest X-Ray Images.pdf.”
[4] A. A. Heidari, S. Mirjalili, H. Faris, I. Aljarah, M. Mafarja, and H. Chen, “Harris hawks optimization: Algorithm and applications,” Futur. Gener. Comput. Syst., vol. 97, pp. 849–872, 2019, doi: 10.1016/j.future.2019.02.028.
[5] M. Abdollahi, X. Gao, Y. Mei, S. Ghosh, and J. Li, “An Ontology-based Two-Stage Approach to Medical Text Classification with Feature Selection by Particle Swarm Optimisation,” 2019 IEEE Congr. Evol. Comput. CEC 2019 - Proc., pp. 119–126, 2019, doi: 10.1109/CEC.2019.8790259.
[6] A. Kumar and A. Jaiswal, “Swarm intelligence based optimal feature selection for enhanced predictive sentiment accuracy on twitter,” Multimed. Tools Appl., vol. 78, no. 20, pp. 29529–29553, 2019, doi: 10.1007/s11042-019-7278-0.
[7] T. Anuprathibha and C. S. Kanimozhiselvi, “Penguin search optimization based feature selection for automated opinion mining,” Int. J. Recent Technol. Eng., vol. 8, no. 3, pp. 648–653, 2019, doi: 10.35940/ijrte.B2629.098319.
[8] S. R. Priya, “Twitter Sentiment Analysis with Diabetic Drugs Using Machine Learning Techniques with Glowworm Swarm Optimization Algorithm,” vol. 9, no. 07, pp. 62–68, 2020.
[9] C. N. Dang, M. N. Moreno-García, and F. De La Prieta, “Hybrid Deep Learning Models for Sentiment Analysis,” Complexity, vol. 2021, 2021, doi: 10.1155/2021/9986920.
[10] A. M. Elmogy, U. Tariq, A. Ibrahim, and A. Mohammed, “Fake Reviews Detection using Supervised Machine Learning,” Int. J. Adv. Comput. Sci. Appl., vol. 12, no. 1, pp. 601–606, 2021, doi: 10.14569/IJACSA.2021.0120169.
[11] M. Imani and S. Noferesti, “Aspect extraction and classification for sentiment analysis in drug reviews,” J. Intell. Inf. Syst., vol. 59, no. 3, pp. 613–633, 2022.
[12] O. Y. Abdulhammed and P. J. Karim, “Sentiment Analysis using SVM-based SSO Intelligence Algorithm,” Passer J. Basic Appl. Sci., vol. 4, no. 02, pp. 17–39, 2022, doi: 10.24271/psr.2022.160801.
[13] A. M. Asri, S. R. Ahmad, and N. M. M. Yusop, “Feature Selection using Particle Swarm Optimization for Sentiment Analysis of Drug Reviews,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 5, pp. 286–295, 2023, doi: 10.14569/IJACSA.2023.0140530.
[14] S. Gouthami and N. P. Hegde, “Improving Sentiment Analysis on Imbalanced Airlines Twitter Data Using DSMOTE Technique,” SSRG Int. J. Electron. Commun. Eng., vol. 10, no. 9, pp. 38–51, 2023, doi: 10.14445/23488549/IJECE-V10I9P105.
[15] M. Periasamy, “Finding fake reviews in e-commerce platforms by using hybrid algorithms”.
[16] A. I. Saad, “Opinion Mining on US Airline Twitter Data Using Machine Learning Techniques,” 16th Int. Comput. Eng. Conf. ICENCO 2020, pp. 59–63, 2020, doi: 10.1109/ICENCO49778.2020.9357390.
[17] W. Z. H. U. Dong-, “JOURNAL OF SOUTHWEST JIAOTONG UNIVERSITY USocial SciencesR,” vol. 58, no. 1, pp. 1139–1149, 2011.
[18] M. R. Mandal, “An Advance Approach toward Sentiment Analysis using Swarm Intelligence,” Int. J. Res. Appl. Sci. Eng. Technol., vol. 9, no. VI, pp. 1710–1713, 2021, doi: 10.22214/ijraset.2021.35228.
[19] H. M. Alabool, D. Alarabiat, L. Abualigah, and A. A. Heidari, Harris hawks optimization: a comprehensive review of recent variants and applications, vol. 33, no. 15. Springer London, 2021. doi: 10.1007/s00521-021-05720-5.
[20] A. N. H. Zaied, M. Ismail, and S. El-Sayed, “A Survey on Meta-heuristic Algorithms for Global Optimization Problems,” J. Intell. Syst. Internet Things, vol. 1, no. 2, pp. 48–60, 2020, doi: 10.54216/jisiot.010104.
[21] J. Park, S. Kwon, and S. P. Jeong, “A study on improving turnover intention forecasting by solving imbalanced data problems: focusing on SMOTE and generative adversarial networks,” J. Big Data, vol. 10, no. 1, 2023, doi: 10.1186/s40537-023-00715-6.
[22] A. S. Issa, Y. H. Ali, and T. A. Rashid, “Comparative Analysis of Swarm Algorithms to Classification of covid19 on X-Rays,” 2022 Int. Conf. Data Sci. Intell. Comput. ICDSIC 2022, no. Icdsic, pp. 164–169, 2022, doi: 10.1109/ICDSIC56987.2022.10075733.
[23] S. By, “Asst. Prof. Dr. Samar Allouch 2023,” 2023.
[24] A. Patel, P. Oza, and S. Agrawal, “Sentiment Analysis of Customer Feedback and Reviews for Airline Services using Language Representation Model,” Procedia Comput. Sci., vol. 218, pp. 2459–2467, 2022, doi: 10.1016/j.procs.2023.01.221.
[25] I. Journal, F. Technological, and A. Rai, “Us Airline Twitter Data on Sentiment Analysis Using Deep Neural Network,” vol. 9, no. 8, pp. 1–9, 2022.
[26] J. R. Monalisha Sahoo, “Survey on Sentiment Analysis to Predict Twitter Data using Machine Learning and Deep Learning,” Int. J. Eng. Res. Technol., vol. 11, no. 7, pp. 506–512, 2022.
[27] P. K. Jain, V. Saravanan, and R. Pamula, “A Hybrid CNN-LSTM: A Deep Learning Approach for Consumer Sentiment Analysis Using Qualitative User-Generated Contents,” ACM Trans. Asian Low-Resource Lang. Inf. Process., vol. 20, no. 5, 2021, doi: 10.1145/3457206.
[28] M. V. K. Et.al, “Collaborative Classification Approach for Airline Tweets Using Sentiment Analysis,” Turkish J. Comput. Math. Educ., vol. 12, no. 3, pp. 3597–3603, 2021, doi: 10.17762/turcomat.v12i3.1639.
[29] K. M. Hasib, M. A. Habib, N. A. Towhid, and M. I. H. Showrov, “A Novel Deep Learning based Sentiment Analysis of Twitter Data for US Airline Service,” 2021 Int. Conf. Inf. Commun. Technol. Sustain. Dev. ICICT4SD 2021 - Proc., pp. 450–455, 2021, doi: 10.1109/ICICT4SD50815.2021.9396879.
[30] M. Umer, I. Ashraf, A. Mehmood, S. Kumari, S. Ullah, and G. Sang Choi, “Sentiment analysis of tweets using a unified convolutional neural network-long short-term memory network model,” Comput. Intell., vol. 37, no. 1, pp. 409–434, 2021, doi: 10.1111/coin.12415.