Building a Vision Maintenance AI Agent with Google Gemini
As part of the Google Gemini AI Challenge 2026, I built a multimodal AI agent that helps maintenance technicians diagnose equipment issues. Here’s the story of what I built, why, and the key technical decisions along the way. The Problem Maintenance technicians at transit companies spend significant time diagnosing equipment failures. They flip through thick manuals, search through past incident reports, and call senior colleagues for advice. All while the equipment sits broken and operations are impacted. ...