If you only track MTTR, you optimize the wrong thing. The 35-90 minute repair time is rarely the lever. The 2-3 hours of waiting around it is.
Three places the latency hides and the fix for each:
Failure-to-alarm gap. Operators discover the failure, walk to the terminal, log the event, supervisor calls maintenance. Easy 20 minutes lost.
- Fix: automated capture via PLC or Computer Vision. Stop is detected and routed to the right technician in under 90 seconds.
Spare-parts wait. The technician shows up, looks at the machine, walks to stores, finds the part missing, orders it.
- Fix: failure-mode-driven spare parts policy tied to MTBF data. Top 10% of EU plants have 95%+ on-shelf availability for critical-path parts.
Approval-to-restart gap. Repair is done. QA needs to check. Supervisor needs to sign. Line needs warm-up. 20-30 minutes more.
- Fix: pre-defined restart playbooks with QA criteria built into the CMMS. The technician hits a button and the approval flow runs in parallel with warm-up.
See the pit stop strategy for how preparation eliminates wait time.