Menu
Bereinigung eines Wartungsrückstands von über 200 Arbeitsaufträgen: Eine 30-tägige Methode

Bereinigung eines Wartungsrückstands von über 200 Arbeitsaufträgen: Eine 30-tägige Methode

Bei den meisten Backlogs mit mehr als 200 Arbeitsaufträgen bestehen 25–40 % aus Phantom- und Duplikateinträgen. Eine 30-tägige Triage-Methode, um ein echtes, funktionierendes Backlog zu erreichen, bei dem jede Zeile genau einen Verantwortlichen und eine Priorität hat.
Bereinigung eines Wartungsrückstands von über 200 Arbeitsaufträgen: Eine 30-tägige Methode
Cleaning up a 200+ work-order maintenance backlog: a 30-day method

Key takeaways

  • A 200+ work-order backlog is almost never a capacity problem. It is a classification problem: the backlog contains real work, dead work, duplicate work, and work that should never have been a work order in the first place, and nobody can tell them apart at a glance.
  • A 30-day cleanup is a triage exercise, not a sprint. The point is not to close 200 work orders; it is to leave the backlog with one status per row, one priority per row, and a real owner per row, so the next 30 days produce actual fixes instead of more triage.
  • The biggest delete category in most plants is "ghost work orders": items opened, partially worked, and abandoned because the underlying asset changed, was decommissioned, or was already fixed by someone else. These are usually the single largest delete category in an aged backlog.
  • After cleanup, the right backlog size for a 100-asset plant is usually 30-60 open items, not zero. Zero means the system is not capturing the real demand; 200+ means the system is capturing demand but not draining it.

Why backlogs explode

Maintenance backlogs grow for boring reasons. A work order opens for "vibration on Line 3 motor", the operator reports it, the technician swaps a bearing, the symptom is gone, but the original work order is never closed because closing it requires opening the CMMS, finding the right row, and clicking three things. A week later another operator reports the same symptom and opens a new work order. Now there are two. Multiply by 200 assets and 18 months and the backlog is unrecognisable.

The second source is duplicates from different reporters. Production opens "Line 3 packer slow." Maintenance opens "Line 3 packer bearing." Quality opens "Line 3 packer rejects up." All three are about the same asset, possibly the same root cause, but they sit as three separate rows. Without a deduplication pass, the backlog inflates without reflecting more actual work.

The third source is the absence of a clear close criterion. A work order is "in progress" because someone is working on it; it stays "in progress" indefinitely because nobody defined what "done" means for that work order type. The article on work order management systems covers the lifecycle states this method depends on.

Days 1-10: triage every open row

Step 1: Pull the whole backlog into one view

The whole backlog gets exported to a spreadsheet, sorted by age. Rows older than 90 days at the top. This is the only step where the CMMS UI is not the primary tool, sorting and bulk-editing in a spreadsheet is faster for triage, and the results get re-imported.

Step 2: Classify each row into one of five buckets

For every row, one of:

  • Real, work that still needs to happen.
  • Ghost, work that was already done, or that became obsolete because the asset changed or was decommissioned. Ghosts are usually the single largest delete category in an aged backlog.
  • Duplicate, the same issue as another open row. Merge.
  • Wrong type, should never have been a work order. Belongs in a PM schedule, a parts request, or an operations request. Move it.
  • Unclear, needs a floor walk before classification.

Most of the 30-day work happens here. A two-person triage team, one maintenance lead, one production representative, can move through 300 rows in a week if they are not also closing the work orders themselves. Resist the urge to merge triage with execution; they require different brains.

Step 3: Close ghosts and merge duplicates en masse

Ghosts get closed with a "post-hoc verified, already done" reason code. Duplicates get merged into the oldest row. Wrong-type rows get routed to the right system (PM schedule for recurring work, parts request system for parts-only items). At the end of day 10 the backlog should be 40-60% smaller than the starting count, before any actual fix has been done.

Days 11-20: prioritise what survives

Step 4: Assign a real priority to every remaining row

Most CMMS priorities are wrong. They were set by the original reporter, who had context only for that one work order. Now that the backlog is half its original size, the maintenance manager can re-rank the survivors against each other:

  • P1, safety / regulatory / production-stopping, same-week fix.
  • P2, significant OEE impact, recurring, within the month.
  • P3, minor / cosmetic / single-occurrence, backlog for available capacity.

A clean rule of thumb: if the plant has 100 assets, P1 should be under 10 rows. If P1 has 40 rows, "P1" no longer means anything and the whole priority system has collapsed. Re-rank until P1 fits the rule.

Step 5: Pair each row with an owner

Every surviving row gets a named owner, one person, not a department. Without this, P1 work sits because everyone assumes someone else has it. This is the second-highest leverage move in the method. See the framing in our piece on manufacturing KPIs for how ownership ties into trend metrics.

Days 21-30: drain P1 and lock the gains

Step 6: Drain P1 with a single named executor each day

For the last 10 days, every working day has one named technician responsible for closing two P1 rows. Not "the team will close", one person, two specific row IDs. This produces 20 closed P1s in 10 days, which is usually most of the surviving P1 stack.

Step 7: Install the standing rules that prevent re-explosion

This is what makes the cleanup last. Three standing rules to lock in:

  • Close-criterion per type, every work order type has a written definition of "done." No type without one.
  • Auto-close after verification, when a technician marks complete, the supervisor verifies within 24 hours and the row auto-closes. No more "in progress" forever.
  • Duplicate detection on open, when a new work order opens on an asset, the system surfaces any open rows on the same asset and asks the reporter to confirm or merge. The root cause analysis connection comes naturally here, because the duplicates point at the same underlying cause.

Step 8: Set a target backlog size and watch it weekly

For a typical 100-asset mid-market plant, a healthy backlog is 30-60 open rows. Bigger than that and demand is overwhelming capacity (or duplicates are creeping back). Smaller than that and the system is no longer capturing real demand. Watch the count weekly; the trend matters more than the absolute number. Pair the backlog count with the broader preventive maintenance schedule so the team knows what proportion of effort is reactive vs planned.

What the math usually looks like

A plant that starts at 220 open work orders typically lands at:

  • 40 ghosts closed (post-hoc verified)
  • 25 duplicates merged
  • 15 wrong-type rows moved to PM schedule or parts requests
  • 10 P1 rows drained in days 21-30
  • ~130 P2/P3 rows surviving as the new working backlog

The "real" backlog turns out to be much smaller than the open-row count suggested. The plant has not added capacity. It has just stopped pretending that 220 rows reflected 220 separate units of work.

How Fabrico fits

The triage steps above are tool-agnostic, they work in any CMMS, including a spreadsheet. What changes when the CMMS is built on a unified OEE+CMMS foundation is the prevention side: cluster threshold rules and OEE-event linking (covered in the article on work order management systems) catch the recurring causes that produce duplicates in the first place. Fabrico is built for that workflow. To see what cleanup looks like against your live data, book a demo.

Frequently asked questions

What if our backlog is over 500 rows?

Scale the triage team. The method is the same, but the day-1-to-10 phase needs more bodies. A 500-row backlog with two triagers takes ~3 weeks of triage; with four, it fits in 10 days. Do not let the method stretch to 60+ days, the freshness of context decays.

Are ghost work orders really the biggest delete category?

Yes, in plants that have not done a cleanup in 18+ months, ghosts are usually the single largest delete bucket. The longer the gap since the last cleanup, the higher the ghost share. Plants on a quarterly cleanup cadence keep it low.

What about open PM tasks, do those count toward the backlog?

Open PM tasks are a separate population. They have their own due dates and their own success measure (on-time completion rate). Mixing them into the reactive backlog hides the picture of both. Keep them separate even if they live in the same CMMS.

Should we publish backlog counts to the broader team?

Yes, with care. The number can drop dramatically in the first 30 days and look like a miracle. Then it stabilises in the 30-60 range and looks like nothing is happening. Pair the count with the trend of P1 rows specifically, which is the more meaningful signal.

What is the most common backslide cause?

Letting duplicate detection lapse. If the reporter is not prompted to confirm or merge on every new open, duplicates reappear within a quarter and the backlog re-inflates. The duplicate-detection rule is the one with the lowest engineering cost and the highest payoff over time.

Das Neueste aus unserem Blog

Definieren Sie Ihren Zuverlässigkeitsfahrplan
Überzeugen Sie sich selbst!
Definieren Sie Ihren Zuverlässigkeitsfahrplan
Indem Sie auf die Schaltfläche „Akzeptieren“ klicken, erklären Sie sich mit der Nutzung einverstanden.Cookies beim Zugriff auf diese Website und bei der Nutzung unserer Dienste. Erfahren Sie mehrWeitere Informationen zur Verwendung und Verwaltung von Cookies finden Sie in unserem Datenschutzrichtlinie und Cookie-Erklärung