Journal of Intelligent Systems and Internet of Things

Journal DOI

https://doi.org/10.54216/JISIoT

Submit Your Paper

2690-6791ISSN (Online) 2769-786XISSN (Print)

Volume 11 , Issue 2 , PP: 22-29, 2024 | Cite this article as | XML | Html | PDF | Full Length Article

Improving Support vector machine for Imbalanced big data classification

Alaa Abdulazeez Qanbar 1 * , Zakariya Yahya Algamal 2

  • 1 Department of Statistics and Informatics, University of Mosul, Mosul, Iraq - (alaa.22csp59@student.uomosul.edu.iq)
  • 2 Department of Statistics and Informatics, University of Mosul, Mosul, Iraq - (zakariya.algamal@uomosul.edu.iq)
  • Doi: https://doi.org/10.54216/JISIoT.110202

    Received: August 17, 2023 Revised: November 11, 2023 Accepted: January 11, 2024
    Abstract

    A significant proportion of one type of pattern and a relatively small quantity of another type of pattern can be found in many unbalanced real data sets. In addition, finding significant observations and excluding influential observations is effectively accomplished through diagnostic analysis. Support vector machines (SVM), a common classification technique, perform poorly on imbalanced datasets and when influential observations exist. In this research, the pigeon optimization algorithm as a metaheuristic algorithm is employed to address the influence observation issues in SVM. Experiments are done on three real sets of data. Our approach provides higher classification accuracy compared to other widely used algorithms. This approach could be used for further biological, chemical, and medical datasets.

    Keywords :

    Pigeon optimization algorithm , meta-heuristic algorithm , imbalanced data , support vector machine.

    References

    [1] Ismael OM, Qasim OS, Algamal ZY. Improving Harris hawks optimization algorithm for hyperparameters estimation and feature selection in v‐support vector regression based on opposition‐based learning. Journal of Chemometrics. 2020;34(11). doi: 10.1002/cem.3311.

    [2] Qasim OS, Algamal ZY. A gray wolf algorithm for feature and parameter selection of support vector classification. International Journal of Computing Science and Mathematics. 2021;13(1):93-102.

    [3] Guido R, Groccia MC, Conforti D. A hyper-parameter tuning approach for cost-sensitive support vector machine classifiers. Soft Computing. 2023;27(18):12863-12881.

    [4] Wang Y, Xu Y. A non-convex robust small sphere and large margin support vector machine for imbalanced data classification. Neural Computing Applications. 2023;35(4):3245-3261.

    [5] Wang Z, Liu Q. Imbalanced Data Classification Method Based on LSSASMOTE. IEEE Access. 2023;11:32252-32260.

    [6] Vapnik V. The nature of statistical learning theory. Springer science & business media; 1999.

    [7] Widodo CE, Adi K, Gernowo R. A support vector machine approach for identification of pleural effusion. Heliyon. 2023.

    [8] Cervantes J, Li X, Yu W. Imbalanced data classification via support vector machines and genetic algorithms. Connection Science. 2014;26(4):335-348.

    [9] Benítez-Peña S, Blanquero R, Carrizosa E, et al. Cost-sensitive probabilistic predictions for support vector machines. European Journal of Operational Research. 2023.

    [10] Tang Y, Zhang Y-Q, Chawla NV, et al. SVMs modeling for highly imbalanced classification. IEEE Transactions on Systems, Man, Cybernetics, Part B. 2008;39(1):281-288.

    [11] Rocha AV, Simas AB. Influence diagnostics in a general class of beta regression models. Test. 2011;20:95-119.

    [12] Algamal ZY. Diagnostic in poisson regression models. Electronic Journal of Applied Statistical Analysis. 2012;5(2):178-186.

    [13] Al-Thanoon NA, Algamal ZY, Qasim OS. Feature selection based on a crow search algorithm for big data classification. Chemometrics and Intelligent Laboratory Systems. 2021;212. doi: 10.1016/j.chemolab.2021.104288.

    [14] Lin K-C, Chen S-Y, Hung JC. Feature Selection and Parameter Optimization of Support Vector Machines Based on Modified Artificial Fish Swarm Algorithms. Mathematical Problems in Engineering. 2015;2015:1-9. doi: 10.1155/2015/604108.

    [15] Saad Y, Shaker K. Support Vector Machine and Back Propagation Neural Network Approach for Text Classification. Journal of University of Human Development. 2017;3(2):869-876. doi: 10.21928/juhd.20170610.40.

    [16] Tharwat A, Hassanien AE. Chaotic antlion algorithm for parameter optimization of support vector machine. Applied Intelligence. 2017;48(3):670-686. doi: 10.1007/s10489-017-0994-0.

    [17] Duan H, Qiao P. Pigeon-inspired optimization: a new swarm intelligence optimizer for air robot path planning. International Journal of Intelligent Computing and Cybernetics. 2014;7(1):24-37. doi: 10.1108/ijicc-02-2014-0005.

     

    Cite This Article As :
    Abdulazeez, Alaa. , Yahya, Zakariya. Improving Support vector machine for Imbalanced big data classification. Journal of Intelligent Systems and Internet of Things, vol. , no. , 2024, pp. 22-29. DOI: https://doi.org/10.54216/JISIoT.110202
    Abdulazeez, A. Yahya, Z. (2024). Improving Support vector machine for Imbalanced big data classification. Journal of Intelligent Systems and Internet of Things, (), 22-29. DOI: https://doi.org/10.54216/JISIoT.110202
    Abdulazeez, Alaa. Yahya, Zakariya. Improving Support vector machine for Imbalanced big data classification. Journal of Intelligent Systems and Internet of Things , no. (2024): 22-29. DOI: https://doi.org/10.54216/JISIoT.110202
    Abdulazeez, A. , Yahya, Z. (2024) . Improving Support vector machine for Imbalanced big data classification. Journal of Intelligent Systems and Internet of Things , () , 22-29 . DOI: https://doi.org/10.54216/JISIoT.110202
    Abdulazeez A. , Yahya Z. [2024]. Improving Support vector machine for Imbalanced big data classification. Journal of Intelligent Systems and Internet of Things. (): 22-29. DOI: https://doi.org/10.54216/JISIoT.110202
    Abdulazeez, A. Yahya, Z. "Improving Support vector machine for Imbalanced big data classification," Journal of Intelligent Systems and Internet of Things, vol. , no. , pp. 22-29, 2024. DOI: https://doi.org/10.54216/JISIoT.110202