The Coronavirus disease 19 (COVID-19) is an ongoing global pandemic where it is easily transmittable and life threatening the world. The number of infected and non-survived patients is increasing in almost all the affected countries. Currently, there is no clinically approved vaccine available yet. Early prediction is necessary to assist the healthcare systems to strategize and reduce the spread of this virus. This is a very critical decision that is considered as a potential threat to others. Supervised Machine Learning (SML) models have demonstrated promising performance in various prediction applications that can improve decision making. Thus, this research investigates the capabilities of SML models to predict whether a patient is infected with COVID-19 or not based on certain symptoms. A comparative analysis of the impact of seven standard SML prediction models has been conducted. They are Adaboost, K-Nearest Neighbor, Logistic Regression, Naive Bayes, Neural Network, Random Forest, and Support Vector Machine. A publicly available dataset from kaggle.com has been utilized for this research that consists of twenty symptoms collected from eight different countries. The outcome from Random Forest revealed that the five most important symptoms are tiredness, fever, dry cough, nasal congestion and those whose age is more than 60. These symptoms are consistent for all eight countries. Besides that, experimental results of the SML models also indicate that Neural Network achieves the best predictive results followed by Adaboost.
Ab Mutalib, S. M., Ramli, N., & Mohamad, D. (2017). Forecasting Unemployment based on Fuzzy Time Series with Different Degree of Confidence. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 9(1-4), 21-24.
Ahamad, M. M., Aktar, S., Rashed-Al-Mahfuz, M., Uddin, S., Liò, P., Xu, H., ... & Moni, M. A. (2020). A machine learning model to identify early stage symptoms of SARS-Cov-2 infected patients. Expert systems with applications, 160, 113661.
Asri, H., Mousannif, H., Al Moatassime, H., & Noel, T. (2016). Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Computer Science, 83, 1064-1069.
Beunza, J. J., Puertas, E., García-Ovejero, E., Villalba, G., Condes, E., Koleva, G., ... & Landecho, M. F. (2019). Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease). Journal of biomedical informatics, 97, 103257.
Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.
Cover, T. M. (1967). P, F. Hart, Nearest neighbour pattern classification, I. EEE Trans.
Finkelstein, J., & cheol Jeong, I. (2017). Machine learning approaches to personalize early prediction of asthma exacerbations. Annals of the New York Academy of Sciences, 1387(1), 153.
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and system sciences, 55(1), 119-139.
Swapnarekha, H., Behera, H. S., Nayak, J., & Naik, B. (2020). Role of intelligent computing in COVID-19 prognosis: A state-of-the-art review. Chaos, Solitons & Fractals, 138, 109947.
Jamali, A. A., Ferdousi, R., Razzaghi, S., Li, J., Safdari, R., & Ebrahimie, E. (2016). DrugMiner: comparative analysis of machine learning algorithms for prediction of potential druggable proteins. Drug discovery today, 21(5), 718-724.
Kay, A. (2001). Artificial Neural Network, Computerworld.
Khanday, A. M. U. D., Rabani, S. T., Khan, Q. R., Rouf, N., & Din, M. M. U. (2020). Machine learning based approaches for detecting COVID-19 using clinical text data. International Journal of Information Technology, 12(3), 731-739.
Rajkumar, S. (2019) Novel Corona Virus 2019 Dataset, https://www.kaggle.com/
sudalairajkumar/novel-corona-virus-2019-dataset?select=COVID19_line_list_data.csv/
Rustam, F., Reshi, A. A., Mehmood, A., Ullah, S., On, B. W., Aslam, W., & Choi, G. S. (2020). COVID-19 future forecasting using supervised machine learning models. IEEE access, 8, 101489-101499.
Shariff, S. S. R., Suhaimi, M. A., Zahari, S. M., & Derasit, Z. (2018). Alternative Methods for Forecasting Variations in Hospital Bed Admission. Indonesian Journal of Electrical Engineering and Computer Science, 9(2), 410-416.
Sisodia, D., & Sisodia, D. S. (2018). Prediction of diabetes using classification algorithms. Procedia computer science, 132, 1578-1585.
Nor, S. H., Ismail, S., & Yap, B. W. (2019). Personal bankruptcy prediction using decision tree model. Journal of Economics, Finance and Administrative Science, 24(47), 157-170.
Vapnik, V., & Vapnik, V. (1998). Statistical learning theory Wiley. New York, 1(624), 2.
Wang, S., & Summers, R. M. (2012). Machine learning and radiology. Medical image analysis, 16(5), 933-951.
World Health Organization. (2019) Coronavirus Disease (COVID-19) pandemic
https://www.who.int/emergencies/diseases/novel-coronavirus-2019/
In-Text Citation: (Ibrahim et al., 2021)
To Cite this Article: Ibrahim, Z., Diah, N. M., Rizal, N. A., & Yuri, M. N. (2021). Prediction of Early Symptoms of COVID-19 Infected Patients using Supervised Machine Learning Models. International Journal of Academic Research in Business and Social Sciences, 11(12), 2471–2481.
Copyright: © 2021 The Author(s)
Published by Knowledge Words Publications (www.kwpublications.com)
This article is published under the Creative Commons Attribution (CC BY 4.0) license. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this license may be seen at: http://creativecommons.org/licences/by/4.0/legalcode