Vision Maintenance AI Agent

Overview Built for the Google Gemini AI Challenge 2026, this multimodal AI agent enables maintenance technicians to diagnose equipment issues by pointing a camera at a device and describing the problem through natural conversation. Problem Maintenance technicians often spend significant time diagnosing equipment issues — cross-referencing manuals, searching past incident reports, and consulting senior colleagues. This slows down repair times and increases downtime costs. Solution An AI agent that combines: ...

Mar 1, 2026 · 1 min · Avishek Saha

Building a Vision Maintenance AI Agent with Google Gemini

As part of the Google Gemini AI Challenge 2026, I built a multimodal AI agent that helps maintenance technicians diagnose equipment issues. Here’s the story of what I built, why, and the key technical decisions along the way. The Problem Maintenance technicians at transit companies spend significant time diagnosing equipment failures. They flip through thick manuals, search through past incident reports, and call senior colleagues for advice. All while the equipment sits broken and operations are impacted. ...

Mar 7, 2026 · 3 min · Avishek Saha