
Key takeaways
Short answer: Plant floor data quality is whether the inputs to OEE, CMMS, and reporting are accurate, timely, complete, and consistent. Dirty data produces wrong analytics and wrong decisions. Five rules: capture at source, automate where possible, validate at entry, reconcile periodically, expose discrepancies. Most plants have dirty data they do not know about because they report metrics without validating inputs. See also Manufacturing Data Quality Audit.
Each item produces dirty input that pollutes downstream analytics.
Three reasons plants do not see the dirty data problem:
The problem persists until something breaks (audit failure, wrong decision, vendor exposes the dirt) — by which time correcting it is expensive.
1. Capture at source. Data should be created where the event happens, not entered later. Operator memory degrades within minutes. PLC and sensor capture is the gold standard; vision systems work for some events; operator entry is acceptable only when nothing else is feasible AND it happens at the moment.
2. Automate where possible. Every manual data entry is a chance for error and lag. Cycle counts from PLC. Downtime from machine state. Quality from in-line QC. Operator entry only for context that machines cannot capture (reason codes, observations).
3. Validate at entry. Required fields, valid ranges, consistency checks. A scrap count of 1,500 on a line that produced 800 units should not be accepted. A downtime of 25 hours on a 24-hour shift should not be accepted.
4. Reconcile periodically. PLC says 600 cycles, operator-entered count says 580. Resolve the gap. Persistent gaps mean the data flow is broken; investigate.
5. Expose discrepancies. Do not hide them in calculations. Show them on the dashboard. Make data quality visible so it gets fixed.
For most plants: automate cycle count capture from PLC. Operator-entered cycle counts are typically 5-15% wrong; PLC counts are typically under 1% wrong. The improvement in OEE Performance accuracy alone justifies the work.
1. End-of-shift batch entry. Operator enters all of the shift's data in the last 10 minutes. Times are estimated; reasons are guessed.
2. Standardized "unknown" reason codes. Operators default to "other" or "unknown" because the right code is hard to find. Pareto becomes meaningless.
3. Copy-paste from previous shift. "Same as yesterday" entries that are not actually true.
4. Round number rounding. Cycle counts ending in 0 or 5 when actual counts are random. Sign of manual entry.
5. Missing data silently filled with zeros. Downtime hours of 0 when nobody logged anything are different from confirmed-zero-downtime shifts.
A modern OEE platform validates at entry, surfaces discrepancies, exposes data quality as a tracked metric, and supports automated capture from PLC, vision, and other instruments.
Fabrico's OEE module ships with automated PLC capture, vision integration, entry-time validation, periodic reconciliation reports, and a data quality dashboard that exposes dirt instead of hiding it.
See how Fabrico captures this automatically — explore OEE for manufacturing or book a demo.
5-15% error rate on cycle counts; reason codes much higher. PLC capture is typically under 1% error.
Not in production decisions. Audit a sample to validate the data flow.
Block at entry where possible. Cleaning later usually means the dirty data is already feeding analytics.
Show them how dirty data hurts them — wrong reason codes mean wrong improvements, which means more of the same problems they are complaining about.
Ongoing. Equipment changes, processes change, operators change. Data quality maintenance is permanent.