Journal of Information Technology in Construction
ITcon Vol. 30, pg. 1398-1430, http://www.itcon.org/2025/57
Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments
DOI: | 10.36680/j.itcon.2025.057 | |
submitted: | May 2025 | |
revised: | August 2025 | |
published: | September 2025 | |
editor(s): | Bosché F | |
authors: | Amr A. Mohy, PhD Student, MSc, BSc, PMP, PRINCE2, PSM-1
Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt A.el-deen5146@student.aast.edu Hesham A. Bassioni, Professor of Construction Management, PhD, MSc, MBA, BSc, PMP Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt, hbassioni@aast.edu Elbadr O. Elgendi, Associate Professor of Construction Engineering and Management, PhD, MSc, BSc Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt elbadrosman@aast.edu Tarek M. Hassan, Professor of Construction Informatics, BSc., MSc., PhD, MASCE, FCIOB School of Architecture, Building and Civil Engineering, Loughborough University, United Kingdom T.Hassan@Lboro.ac.uk | |
summary: | Construction site safety demands proactive hazard detection, a challenge traditionally met with reactive measures that are often inadequate. This paper introduces a novel deep learning-based computer vision model designed for automated safety compliance monitoring, addressing critical limitations of existing approaches. The model utilizes a modified one-stage object detection algorithm, uniquely enhanced with Contextual Transformer Networks (CoTs), a Triplet Attention module, Activate or Not (ACON) activation functions, and Content-Aware Reassembly of Features (CARAFE) up-sampling, to significantly improve feature extraction, visual recognition, and contextual understanding in complex construction environments. To support this model development, a new OSHA-data-driven dataset of 55,594 images across 28 safety categories was developed. This dataset encompasses personal protective equipment (PPE), scaffolding, materials, hazards, and worker actions, ensuring comprehensive coverage of key safety domains. The Wise-Intersection over Union (IoU) loss function further refines bounding box regression, enhancing localization accuracy. Evaluations on both a benchmarking dataset and the newly developed dataset demonstrate the model's benchmark-surpassing performance (Precision: 0.89, mAP95: 0.45). This research offers a practically viable, data-driven solution for a critical industry challenge, moving towards a future of zero-accident construction sites. | |
keywords: | Construction Safety Management, Artificial Intelligence, Automated Hazard Detection, Object Detection, Computer Vision | |
full text: | (PDF file, 1.196 MB) | |
citation: | Mohy A A, Bassioni H A, Elgendi E O, Hassan T M (2025). Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments, ITcon Vol. 30, pg. 1398-1430, https://doi.org/10.36680/j.itcon.2025.057 | |
statistics: |