Volume 7 , Issue 2 , PP: 79-90, 2022 | Cite this article as | XML | Html | PDF | Full Length Article
Harsh Taneja 1 * , Abhinav 2 , Apoorv 3 , Himanshu Mangal 4 , Naman Agarwal 5
Doi: https://doi.org/10.54216/FPA.070202
Coronavirus, the pandemic due to which about 4 million have lost their lives and counting, is still on. Many scientists and researchers are trying to find ways to detect coronavirus as soon as possible in the human body so that they can start their medication and precaution as soon as possible. Still, due to lack of lab facilities, the RT-PCR is taking more than three days to give the report, and in the meanwhile, patients get serious and life in danger. So in this paper, we proposed an audio-based coronavirus detection technique in which we can get results in minutes. Coronavirus is a respiratory disease, and the sound produced while breathing can tell us about the presence of coronavirus. Audio-based detection was already used for the detection of asthma, pneumonia. So, in this paper, we implemented a combination of machine learning and deep learning techniques to find the presence of Covid-19, and the model has an accuracy of 78% and an f1 score of 74%. This technique can be used as a starting point for just audio data to diagnose diseases and save lives.
COVID-19, Cough, Voice, Machine Learning, VGG-19, Classifiers, KNN, Audio Analysis
1. Mouawad, Pauline, Tammuz Dubnov, and Shlomo Dubnov. ”Robust Detection of COVID-19 in Cough Sounds.” SN Computer Science 2.1 (2021): 1-13.
2. World Health Organization. ”World Health Organization coronavirus disease 2019 (COVID-19) situation report.” (2020).
3. Deshpande, Gauri, and Björn Schuller. ”An overview on audio, signal, speech, & language processing for covid-19.” arXiv preprint arXiv:2005.08579 (2020).
4. Sharma, Neeraj, et al. ”Coswara–A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis.” arXiv preprint arXiv:2005.10548 (2020). https://github.com/iiscleap/Coswara-Data.
5. McFee, Brian, et al. ”librosa: Audio and music signal analysis in python.” Proceedings of the 14th python in science conference. Vol. 8. 2015.
6. Dalmazzo, David, and Rafael Ramirez. ”Mel-spectrogram Analysis to Identify Patterns in Musical Gestures: a Deep Learning Approach.” MML 2020: 1.
7. Mateen, Muhammad, et al. ”Fundus image classification using VGG-19 architecture with PCA and SVD.” Symmetry 11.1 (2019): 1.
8. Cherif, Iyad Lahsen, and Abdesselem Kortebi. ”On using eXtreme gradient boosting (XGBoost) machine learning algorithm for home network traffic classification.” 2019 Wireless Days (WD). IEEE, 2019.
9. Santosh, K. C. ”Speech Processing in Healthcare: Can We Integrate?.” Intelligent Speech Signal Processing. Academic Press, 2019. 1-4.
10. Abeyratne, Udantha R., et al. ”Cough sound analysis can rapidly diagnose childhood pneumonia.” Annals of biomedical engineering 41.11 (2013): 2448-2462.
11. Swarnkar, Vinayak, et al. ”Automatic identification of wet and dry cough in pediatric patients with respiratory diseases.” Annals of biomedical engineering 41.5 (2013): 1016-1028.
12. Al-Khassaweneh, Mahmood, and Ra’ed Bani Abdelrahman. ”A signal processing approach for the diagnosis of asthma from cough sounds.” Journal of medical engineering & technology 37.3 (2013): 165-171.
13. Boyanov, Boyan, and Stefan Hadjitodorov. ”Acoustic analysis of pathological voices. A voice analysis system for the screening of laryngeal diseases.” IEEE Engineering in Medicine and Biology Magazine 16.4 (1997): 74-82.
14. Imran, Ali, et al. ”AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app.” Informatics in Medicine Unlocked 20 (2020): 100378.
15. Lella, Kranthi Kumar, and P. J. A. Alphonse. ”A literature review on COVID-19 disease diagnosis from respiratory sound data.” AIMS Bioengineering 8.2 (2021): 140-153.
16. Pan, Sinno Jialin, and Qiang Yang. ”A survey on transfer learning.” IEEE Transactions on knowledge and data engineering 22.10 (2009): 1345-1359.
17. Livingston, Frederick. ”Implementation of Breiman’s random forest machine learning algorithm.” ECE591Q Machine Learning Journal Paper (2005): 1-13.
18. Brown, Chloë, et al. ”Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data.” arXiv preprint arXiv:2006.05919 (2020).
19. Peng, Chao-Ying Joanne, Kuk Lida Lee, and Gary M. Ingersoll. "An introduction to logistic regression analysis and reporting." The journal of educational research 96.1 (2002): 3-14.
20. Feng, Ji, Yang Yu, and Zhi-Hua Zhou. "Multi-layered gradient boosting decision trees." arXiv preprint arXiv:1806.00007 (2018).
21. Vishwanathan, S. V. M., and M. Narasimha Murty. "SSVM: a simple SVM algorithm." Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No. 02CH37290). Vol. 3. IEEE, 2002.
22. Shuja, Junaid, et al. "COVID-19 open source data sets: a comprehensive survey." Applied Intelligence 51.3 (2021): 1296-1325.
23. Hwang, Yeongtae, et al. "Mel-spectrogram augmentation for sequence to sequence voice conversion." arXiv preprint arXiv:2001.01401 (2020).
24. Nanni, Loris, et al. "An ensemble of convolutional neural networks for audio classification." Applied Sciences 11.13 (2021): 5796.
25. Godino-Llorente, Juan Ignacio, et al. "Support vector machines applied to the detection of voice disorders." International Conference on Nonlinear Analyses and Algorithms for Speech Processing. Springer, Berlin, Heidelberg, 2005.
26. Hemdan, Ezz El-Din, Marwa A. Shouman, and Mohamed Esmail Karar. "Covidx-net: A framework of deep learning classifiers to diagnose covid-19 in x-ray images." arXiv preprint arXiv:2003.11055 (2020).
27. Pham, Tuan D. "A comprehensive study on classification of COVID-19 on computed tomography with pretrained convolutional neural networks." Scientific reports 10.1 (2020): 1-8.
28. Subramanian, Vishnu. Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch. Packt Publishing Ltd, 2018.
29. Ayesha, Shaeela, Muhammad Kashif Hanif, and Ramzan Talib. "Overview and comparative study of dimensionality reduction techniques for high dimensional data." Information Fusion 59 (2020): 44-58.
30. Quinlan, J. Ross. "Learning decision tree classifiers." ACM Computing Surveys (CSUR) 28.1 (1996): 71-72.
31. Ghojogh, Benyamin, and Mark Crowley. "Linear and quadratic discriminant analysis: Tutorial." arXiv preprint arXiv:1906.02590 (2019).
32. Agarap, Abien Fred. "An architecture combining convolutional neural network (CNN) and support vector machine (SVM) for image classification." arXiv preprint arXiv:1712.03541 (2017). 33. Chen, Tianqi, and Carlos Guestrin. "Xgboost: A scalable tree boosting system." Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. 2016.