Journal of Cybersecurity and Information Management

Journal DOI

https://doi.org/10.54216/JCIM

Submit Your Paper

2690-6775ISSN (Online) 2769-7851ISSN (Print)

Volume 15 , Issue 2 , PP: 305-321, 2025 | Cite this article as | XML | Html | PDF | Full Length Article

Text Categorization for Information Retrieval Using NLP Models

Sundws M. Mohammed 1 , Vijay Madaan 2 , Rajaa Daami Resen 3 , Neha Sharma 4 , Oday Ali Hassen 5 * , Jamal kh-madhloom 6

  • 1 Sulaimani Polytechnic University College of Health and Medical Technology Anaesthesia, Iraq - (Sundws.Mustafa@spu.edu.iq)
  • 2 Chitkara University Institute of Engineering & Technology, Chitkara University, Rajpura, Punjab, India - (Vijaymadaan1@gmail.com)
  • 3 University of Information Technology and Communication, Iraq - (rajaa.alnidway@uoitc.edu.iq)
  • 4 Chitkara University Institute of Engineering & Technology, Chitkara University, Rajpura, Punjab, India - (nehasharma0110@gmail.com;)
  • 5 Ministry of Education, Wasit Education Directorate, Iraq - (oday123456789.oa@gmail.com)
  • 6 Wasit University, College of Arts, Iraq - (Jamalkh@uowasit.edu.iq)
  • Doi: https://doi.org/10.54216/JCIM.150223

    Received: May 24, 2024 Revised: July 25, 2024 Accepted: November 14, 2024
    Abstract

    The paper presents the state-of-the-art natural language processing (NLP) models and methods, such as BERT and DistilBERT, to evaluate textual data and extract noteworthy insights. Preprocessing textual input, tokenization, and the implementation of deep learning architectures such as bidirectional LSTMs for classification tasks are all components of the approach that has been presented. To achieve the goal of producing accurate prediction models with the least amount of validation loss possible. Natural language processing (NLP) is a major focus of the manuscript in multiple areas such as sentiment analysis, language understanding, and text classification. The results show that our proposed NLP models perform exceptionally well. Long-term memory and natural language processing (NLP) go hand in hand. Therefore, these results demonstrate the value and relevance of our natural language processing approach to obtaining unstructured text data to improve and develop a variety of applications, such as chatbots, virtual assistants, and information retrieval systems, as well as to gain insights and help make better decisions, and the flexibility and generalizability of the models, while confirming their ability to handle a range of activities and textual materials. Excellent and accurate results were obtained in terms of validation, with the experimental models often exceeding the 99.85% accuracy benchmark. Another crucial factor to consider is that the average validation loss metrics for all tests remained remarkably low at 0.0058.

    Keywords :

    Natural Language Processing (NLP) , Long Short-Term Memory (LSTM) , Text Categorization

    References

    [1] D. Khurana, A. Koli, ,K. Khatter, and S. Singh, “Natural language processing: State of the art, current trends and challenges,” Multimedia tools and applications,82(3), 3713-3744, 2023.

    [2] Svendsen, B., & Kadry, S. (2023). A Dataset for recognition of Norwegian Sign Language. International Journal of Mathematics, Statistics, and Computer Science, 2. https://doi.org/10.59543/ijmscs.v2i.8049

    [3] R. Collobert, , J.Weston, ,L. Bottou, M. Karlen, K. Kavukcuoglu, and P. Kuksa, “Natural language processing (almost) from scratch. Journal of machine learning research,”, 12, 2493-2537, 2011

    [4] T. Brants, “Natural Language Processing in Information Retrieval,” CLIN, 111, 1-13, 2003.

    [5] N. Sager, M. Lyman, C. Bucknall, N. Nhan, and L. J. Tick, “Natural language processing and the representation of clinical data,” J. Am. Med. Inform. Assoc., vol. 1, no. 2, pp. 142–160, 1994.

    [6] D. Meurers,” Natural language processing and language learning. Encyclopedia of applied linguistics,” 4193-4205, 2012.

    [7] Z. Desai, K. Anklesaria and H. Balasubramaniam, "Business Intelligence Visualization Using Deep Learning Based Sentiment Analysis on Amazon Review Data," in 12th International Conference on Computing Communication and Networking (2021) Kharagpur, India.

    [8] K.Jensen, G.E. Heidorn, and S. D. (Eds.). Richardson,” Natural language processing: the PLNLP approach (Vol. 196),” Springer Science & Business Media, 2012.

    [9] Lavania, G., Arya, V., Sharma, N., Rashid, M., & Akram, S. V. (2022, December). Real-Time Signal Processing using AI Integrated Framework for Color and Drawing in Gesture Recognition. In 2022 5th International Conference on Contemporary Computing and Informatics (IC3I) (pp. 473-478). IEEE.

    [10] A.Chopra, A. Prashar, and C. Sain, “Natural language processing,” International journal of technology enhancements and emerging engineering research, 1(4), 131-134, 2013.

    [11] J. Hirschberg, and C.D. Manning, “Advances in natural language processing,” Science, 349(6245), 261-266, 2015.

    [12] A. Galassi, M. Lippi, and P. Torroni, “Attention in natural language processing,” IEEE transactions on neural networks and learning systems, 32(10), 4291-4308, 2020.

    [13] Y. Goldberg,” A primer on neural network models for natural language processing,” Journal of Artificial Intelligence Research, 57, 345-420, 2016.

    [14] Sharma, N., Chakraborty, C., & Kumar, R. (2023). Optimized multimedia data through computationally intelligent algorithms. Multimedia Systems, 29(5), 2961-2977.

    [15] Gupta, S., Sharma, N., Tyagi, R., Singh, P., Aggarwal, A., & Chawla, S. (2023). Cognitive-Inspired and Computationally Intelligent Early Melanoma Detection Using Feature Analysis Techniques. Journal of Artificial Intelligence and Technology, 3(4), 215-224.

    [16] S. Locke, A. Bashall, S. Al-Adely, J. Moore, A. Wilson, and G. B. Kitchen, “Natural language processing in medicine: a review,” Trends in Anaesthesia and Critical Care, 38, 4-9, 2021.

    [17] Sharma, N., & Batra, U. (2021). An enhanced Huffman-PSO based image optimization algorithm for image steganography. Genetic Programming and Evolvable Machines, 22(2), 189-205.

    [18] T. Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, and A. M. Rush, “Transformers: State-of-the-art natural language processing,” In Proceedings of the 2020 conference on empirical methods in natural language processing: system demonstrations (pp. 38-45), 2020.

    [19] A. Farzindar, D. Inkpen and G. Hirst,” Natural language processing for social media,” San Rafael: Morgan & Claypool, 2015.

    [20] M. V. Koroteev, “BERT: a review of applications in natural language processing and understanding,”arXiv preprint arXiv: 2103.11943, 2021.

    [21] Y. Kang, Z. Cai, , C. W. Tan, Q. Huang, and H. Liu, “Natural language processing (NLP) in management research: A literature review,” Journal of Management Analytics, 7(2), 139-172, 2020.

    [22] H. Li, “Deep learning for natural language processing: advantages and challenges,” National Science Review, 5(1), 24-26, 2018.

    [23] Gupta, S., Saluja, K., Goyal, A., Vajpayee, A., & Tiwari, V. (2022). Comparing the performance of machine learning algorithms using estimated accuracy. Measurement: Sensors, 24, 100432.

    [24] Gupta, S. (2015, October). An effective model for anomaly IDS to improve the efficiency. In 2015 International Conference on Green Computing and Internet of Things (ICGCIoT) (pp. 190-194). IEEE.

    [25] Sharma, N., & Batra, U. (2018). A study on integrating crypto-stego techniques to minimize the distortion. In Data Science and Analytics: 4th International Conference on Recent Developments in Science, Engineering and Technology, REDSET 2017, Gurgaon, India, October 13-14, 2017, Revised Selected Papers 4 (pp. 608-615). Springer Singapore.

    [26] E. Cambria and B. White, “Jumping NLP curves: A review of natural language processing research,” IEEE Computational intelligence magazine, 9(2), 48-57, 2014.

     

    [27] T. Wolf, L. Debut, V. Sanh, J. Chaumond, C. Delangue, A. Moi, and A. M. Rush, “Huggingface's transformers: State-of-the-art natural language processing,” arXiv preprint arXiv: 1910.03771, 2019.

    [28] N. Hardeniya, J. Perkins, D. Chopra, N. Joshi and I. Mathur, “Natural language processing: python and NLTK,” Packt Publishing Ltd, 2016.

    [29] Shamshiri, A., Ryu, K. R., & Park, J. Y. (2024). Text mining and natural language processing in construction. Automation in Construction, 158, 105200.

    [30] Chiruzzo, L., Jiménez-Zafra, S. M., & Rangel, F. (2024). Overview of IberLEF 2024: natural language processing challenges for Spanish and other Iberian languages. In Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2024), co-located with the 40th Conference of the Spanish Society for Natural Language Processing (SEPLN 2024), CEUR-WS. org.

    Cite This Article As :
    M., Sundws. , Madaan, Vijay. , Daami, Rajaa. , Sharma, Neha. , Ali, Oday. , kh-madhloom, Jamal. Text Categorization for Information Retrieval Using NLP Models. Journal of Cybersecurity and Information Management, vol. , no. , 2025, pp. 305-321. DOI: https://doi.org/10.54216/JCIM.150223
    M., S. Madaan, V. Daami, R. Sharma, N. Ali, O. kh-madhloom, J. (2025). Text Categorization for Information Retrieval Using NLP Models. Journal of Cybersecurity and Information Management, (), 305-321. DOI: https://doi.org/10.54216/JCIM.150223
    M., Sundws. Madaan, Vijay. Daami, Rajaa. Sharma, Neha. Ali, Oday. kh-madhloom, Jamal. Text Categorization for Information Retrieval Using NLP Models. Journal of Cybersecurity and Information Management , no. (2025): 305-321. DOI: https://doi.org/10.54216/JCIM.150223
    M., S. , Madaan, V. , Daami, R. , Sharma, N. , Ali, O. , kh-madhloom, J. (2025) . Text Categorization for Information Retrieval Using NLP Models. Journal of Cybersecurity and Information Management , () , 305-321 . DOI: https://doi.org/10.54216/JCIM.150223
    M. S. , Madaan V. , Daami R. , Sharma N. , Ali O. , kh-madhloom J. [2025]. Text Categorization for Information Retrieval Using NLP Models. Journal of Cybersecurity and Information Management. (): 305-321. DOI: https://doi.org/10.54216/JCIM.150223
    M., S. Madaan, V. Daami, R. Sharma, N. Ali, O. kh-madhloom, J. "Text Categorization for Information Retrieval Using NLP Models," Journal of Cybersecurity and Information Management, vol. , no. , pp. 305-321, 2025. DOI: https://doi.org/10.54216/JCIM.150223