ITcon Vol. 30, pg. 1398-1430, http://www.itcon.org/2025/57

Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments

DOI:10.36680/j.itcon.2025.057
submitted:May 2025
revised:August 2025
published:September 2025
editor(s):Bosché F
authors:Amr A. Mohy, PhD Student, MSc, BSc, PMP, PRINCE2, PSM-1
Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt
A.el-deen5146@student.aast.edu

Hesham A. Bassioni, Professor of Construction Management, PhD, MSc, MBA, BSc, PMP
Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt,
hbassioni@aast.edu

Elbadr O. Elgendi, Associate Professor of Construction Engineering and Management, PhD, MSc, BSc
Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt
elbadrosman@aast.edu

Tarek M. Hassan, Professor of Construction Informatics, BSc., MSc., PhD, MASCE, FCIOB
School of Architecture, Building and Civil Engineering, Loughborough University, United Kingdom
T.Hassan@Lboro.ac.uk
summary:Construction site safety demands proactive hazard detection, a challenge traditionally met with reactive measures that are often inadequate. This paper introduces a novel deep learning-based computer vision model designed for automated safety compliance monitoring, addressing critical limitations of existing approaches. The model utilizes a modified one-stage object detection algorithm, uniquely enhanced with Contextual Transformer Networks (CoTs), a Triplet Attention module, Activate or Not (ACON) activation functions, and Content-Aware Reassembly of Features (CARAFE) up-sampling, to significantly improve feature extraction, visual recognition, and contextual understanding in complex construction environments. To support this model development, a new OSHA-data-driven dataset of 55,594 images across 28 safety categories was developed. This dataset encompasses personal protective equipment (PPE), scaffolding, materials, hazards, and worker actions, ensuring comprehensive coverage of key safety domains. The Wise-Intersection over Union (IoU) loss function further refines bounding box regression, enhancing localization accuracy. Evaluations on both a benchmarking dataset and the newly developed dataset demonstrate the model's benchmark-surpassing performance (Precision: 0.89, mAP95: 0.45). This research offers a practically viable, data-driven solution for a critical industry challenge, moving towards a future of zero-accident construction sites.
keywords:Construction Safety Management, Artificial Intelligence, Automated Hazard Detection, Object Detection, Computer Vision
full text: (PDF file, 1.196 MB)
citation:Mohy A A, Bassioni H A, Elgendi E O, Hassan T M (2025). Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments, ITcon Vol. 30, pg. 1398-1430, https://doi.org/10.36680/j.itcon.2025.057
statistics: