Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments

Amr A. Mohy; Hesham A. Bassioni; Elbadr O. Elgendi; Tarek M. Hassan

Journal of Information Technology in Construction

ISSN: 1874-4753

Editor-in-chief-:

Robert Amor

ITcon adheres to:

Web Of Science:

IF (2024): 3.8, Q1

Members of:

Acknowledgement

Journal is partially sponsored by:

Slovenian Research and Innovation Agency

ITcon Vol. 30, pg. 1398-1430, http://www.itcon.org/2025/57

Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments

DOI:	10.36680/j.itcon.2025.057
submitted:	May 2025
revised:	August 2025
published:	September 2025
editor(s):	Bosché F
authors:	Amr A. Mohy, PhD Student, MSc, BSc, PMP, PRINCE2, PSM-1 Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt A.el-deen5146@student.aast.edu Hesham A. Bassioni, Professor of Construction Management, PhD, MSc, MBA, BSc, PMP Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt, hbassioni@aast.edu Elbadr O. Elgendi, Associate Professor of Construction Engineering and Management, PhD, MSc, BSc Construction and Building Engineering Department, Arab Academy for Science and Technology and Maritime Transport, Egypt elbadrosman@aast.edu Tarek M. Hassan, Professor of Construction Informatics, BSc., MSc., PhD, MASCE, FCIOB School of Architecture, Building and Civil Engineering, Loughborough University, United Kingdom T.Hassan@Lboro.ac.uk
summary:	Construction site safety demands proactive hazard detection, a challenge traditionally met with reactive measures that are often inadequate. This paper introduces a novel deep learning-based computer vision model designed for automated safety compliance monitoring, addressing critical limitations of existing approaches. The model utilizes a modified one-stage object detection algorithm, uniquely enhanced with Contextual Transformer Networks (CoTs), a Triplet Attention module, Activate or Not (ACON) activation functions, and Content-Aware Reassembly of Features (CARAFE) up-sampling, to significantly improve feature extraction, visual recognition, and contextual understanding in complex construction environments. To support this model development, a new OSHA-data-driven dataset of 55,594 images across 28 safety categories was developed. This dataset encompasses personal protective equipment (PPE), scaffolding, materials, hazards, and worker actions, ensuring comprehensive coverage of key safety domains. The Wise-Intersection over Union (IoU) loss function further refines bounding box regression, enhancing localization accuracy. Evaluations on both a benchmarking dataset and the newly developed dataset demonstrate the model's benchmark-surpassing performance (Precision: 0.89, mAP95: 0.45). This research offers a practically viable, data-driven solution for a critical industry challenge, moving towards a future of zero-accident construction sites.
keywords:	Construction Safety Management, Artificial Intelligence, Automated Hazard Detection, Object Detection, Computer Vision
full text:	(PDF file, 1.196 MB)
citation:	Mohy A A, Bassioni H A, Elgendi E O, Hassan T M (2025). Deep Learning Enabled Computer Vision Model for Automated Safety Compliance in Construction Environments, ITcon Vol. 30, pg. 1398-1430, https://doi.org/10.36680/j.itcon.2025.057
statistics: