This article outlines a serverless, event-driven architecture for a computer vision and generative AI-based workplace safety monitoring system. It details how to scale the solution to hundreds of sites and thousands of cameras, focusing on image collection, anonymization, training pipelines, and inference. Key system design considerations include data privacy, security, and continuous model improvement through feedback loops.
Read original on AWS Architecture BlogThe article presents a robust, scalable architecture for automated workplace safety monitoring using computer vision and generative AI. This system addresses the limitations of manual safety audits by providing continuous, real-time oversight of PPE compliance and zone-based hazard detection across numerous facilities. It emphasizes a serverless, event-driven approach designed for efficiency and high scalability.
The system's workflow begins with image collection from site cameras, stored temporarily in a restricted Amazon S3 bucket for anonymization. These anonymized images are then replicated to S3 buckets across different accounts for training, inference, and the web application. This multi-stage data handling is critical for both privacy and operational efficiency.
Each safety use case has its own inference pipeline. Once an anonymized image lands in its dedicated S3 bucket, it triggers an Amazon SNS notification, initiating the hazard detection process. This continuous monitoring acts as a digital safety supervisor, distinguishing normal workflows from potential hazards.
Scalability and Maintainability
The design leverages serverless services for automatic scaling and manages model updates through a well-defined CI/CD process. Decoupling the model training and deployment cycles ensures that data scientists can iterate on models without directly impacting the application code, and engineers can manage infrastructure changes independently.