International Journal of Academic Research in Business and Social Sciences

search-icon

Comparing MobileNet-SSD and YOLO v3 Learning Architecture for Real-time Driver’s Fatigue Detection

Open access
Convolutional Neural Network is known to achieve high accuracy in solving classification, recognition, and detection problems. In a real-time environment, time is an important factor of consideration. Even though most CNN-based architectures achieved considerably high accuracy, they are still slow even with high-end hardware. Therefore, this paper compares the time-accuracy tradeoff between two recent CNN-based learning architectures in detecting a driver’s fatigue status. In our work, we define fatigue based on the rate of eye blinking. We developed a proof of concept systems, and evaluate the systems based on accuracy and detection speed. The accuracy and speed of both learning architectures were trained and tested using the Closed Eyes in the Wild (CEW) containing 1,193 closed eyes images and 1,232 opened eyes images. As MobileNet-SSD and YOLO v3 were pre-trained using a general COCO dataset, they were further configured and fine-tuned to optimize the results based on the CEW datasets, The results showed that YOLO v3 has slightly higher meanAveragePrecision(mAP) than MobileNet-SSD but slower detection speed(ms), while MobileNet-SSD proved that it has much faster speed but still maintaining high accuracy. The results of the research also showed that there is a trade-off between speed and accuracy which there was a loss of accuracy to obtain faster speed. This research also proved that lightweight MobileNet-SSD can minimize the accuracy loss to gain speed. The accuracy of the MobileNet-SSD learning model was still considered high and the detection speed was far higher than YOLO v3 learning model. Therefore, MobileNetSSD learning model was selected to have the best speed and accuracy trade-off in this research.
Abbood, H., Al-Nuaimy, W., Al-Ataby, A., Salem, S. A., & AlZubi, H. S. (2014). Prediction of driver fatigue: Approaches and open challenges. In 2014 IEEE 14th UK Workshop on Computational Intelligence (UKCI) (pp. 1-6).
Abtahi, S., Omidyeganeh, M., Shirmohammadi, S., & Hariri, B. (2014). YawDD: A yawning detection dataset. In Proceedings of the 5th ACM Multimedia Systems Conference (pp. 24-28).
Balasubramanian, V., & Bhardwaj, R. (2018). Grip and electrophysiological sensor-based estimation of muscle fatigue while holding steering wheel in different positions. IEEE Sensors Journal, 19(5), 1951-1960.
Bharadwaj, S., & Kumari, B. (2017). Electrooculography: Analysis on device control by signal processing. International Journal of Advanced Research in Computer Science, 8(3).
Bhardwaj, R., Natrajan, P., & Balasubramanian, V. (2018). Study to Determine the Effectiveness of Deep Learning Classifiers for ECG Based Driver Fatigue Classification. In 2018 IEEE 13th International Conference on Industrial and Information Systems (ICIIS) (pp. 98-102).
Chen, P. (2017, October). Research on driver fatigue detection strategy based on human eye state. In 2017 IEEE Chinese Automation Congress (CAC) (pp. 619-623).
Dai, K. J., & R-FCN, Y. L. (2016). Object detection via region-based fully convolutional networks. arxiv preprint. In arXiv preprint.
Huang, J., Rathod, V., Sun, C., Zhu, M., Korattikara, A., Fathi, A., ... & Murphy, K. (2017). Speed/accuracy trade-offs for modern convolutional object detectors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7310-7311).
Islam, A., Rahaman, N., & Ahad, M. A. R. (2019). A Study on Tiredness Assessment by Using Eye Blink Detection. Jurnal Kejuruteraan, 31(2), 209-214.
Jabbar, R., Shinoy, M., Kharbeche, M., Al-Khalifa, K., Krichen, M., & Barkaoui, K. (2020, February). Driver drowsiness detection model using convolutional neural networks techniques for android application. In 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT) (pp. 237-242).
Jain, V., & Learned-Miller, E. (2015). Face detection data set and benchmark. URL http://vis-www.cs.umass.edu/fddb/results.html.
Lin, T., Maire, M., Belongie, S. J., Bourdev, L. D., Girshick, R. B., Hays, J., ... & Zitnick, C. L. (2014). Microsoft COCO: common objects in context. CoRR abs/1405.0312 (2014). arXiv preprint arXiv:1405.0312.
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016). Ssd: Single shot multibox detector. In European Conference on Computer Vision (pp. 21-37). Springer, Cham.
Khushaba, R. N., Kodagoda, S., Lal, S., & Dissanayake, G. (2010). Driver drowsiness classification using fuzzy wavelet-packet-based feature-extraction algorithm. IEEE Transactions on Biomedical Engineering, 58(1), 121-131.
Muhammad, N. A., Ab Nasir, A., Ibrahim, Z., & Sabri, N. (2018). Evaluation of CNN, Alexnet and GoogleNet for fruit recognition. Indonesian Journal of Electrical Engineering and Computer Science, 12(2), 468-475.
Sharifuddin, M. S. I., Nordin, S., & Ali, A. M. (2019). Voice Control Intelligent Wheelchair Movement Using CNNs. In 2019 1st International Conference on Artificial Intelligence and Data Sciences (AiDAS) (pp. 40-43). IEEE.
Pasaribu, N. T. B., Prijono, A., Ratnadewi, R., Adhie, R. P., & Felix, J. (2019). Drowsiness detection according to the number of blinking eyes specified from eye aspect ratio value modification. In 1st International Conference on Life, Innovation, Change and Knowledge (ICLICK 2018). Atlantis Press.
Redmon, J. (2016). Darknet: Open source neural networks in C (2013–2016). URL: https://pjreddie.com/darknet/
Redmon, J., & Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767.
Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. In Advances in Neural Information Processing Systems (pp. 91-99).
Saufi, M. M., Zamanhuri, M. A., Mohammad, N., Ibrahim, Z. (2018). Deep Learning for Roman Handwritten Character Recognition. International Journal of Electrical Engineering and Computer Science, 12(2), pp.455-460.
Shakeel, M. F., Bajwa, N. A., Anwaar, A. M., Sohail, A., & Khan, A. (2019). Detecting driver drowsiness in real time through deep learning based object detection. In International Work-Conference on Artificial Neural Networks (pp. 283-296). Springer, Cham.
Shamsuddin, M. R. B., Sahar, N. N. B. S., & Rahmat, M. H. B. (2017, November). Eye Detection for Drowsy Driver Using Artificial Neural Network. In International Conference on Soft Computing in Data Science (pp. 116-125). Springer, Singapore.
Song, F., Tan, X., Liu, X., & Chen, S. (2014). Eyes closeness detection from still images with multi-scale histograms of principal oriented gradients. Pattern Recognition, 47(9), 2825-2838.
Sravan, C., Onesim, K. J., Bhavana, V. S. S., Arthi, R., & Srinadh, G. (2018). Eye Fatigue Detection System. In 2018 International Conference on System Modeling & Advancement in Research Trends (SMART) (pp. 245-247).
Tsang, S.-H. (2018). Review: SSD — Single Shot Detector (Object Detection). URL: https://towardsdatascience.com/review-ssd-single-shot-detector-object- detection 851a9460 -7d11
Weng, C. H., Lai, Y. H., & Lai, S. H. (2016). Driver drowsiness detection via a hierarchical temporal deep belief network. In Asian Conference on Computer Vision (pp. 117-133). Springer, Cham.
Wierwille, W. W., Wreggit, S. S., Kirn, C. L., Ellsworth, L. A., & Fairbanks, R. J. (1994). Research on vehicle-based driver status/performance monitoring; development, validation, and refinement of algorithms for detection of driver drowsiness. Final report (No. HS-808 247).
Yahya, M. A., Abdul-Rahman, S., & Mutalib, S. (2020). Object Detection for Autonomous Vehicle with LiDAR Using Deep Learning. In 2020 IEEE 10th International Conference on System Engineering and Technology (ICSET) (pp. 207-212).
In-Text Citation: (Jamil et al., 2021)
To Cite this Article: Jamil, N., Fadhil, M. H. M., Hamzah, R., & Ramli, M. I. (2021). Comparing MobileNet-SSD and YOLO v3 Learning Architecture for Real-time Driver’s Fatigue Detection. International Journal of Academic Research in Business and Social Sciences, 11(12), 2409–2419.