
Key takeaways
Short answer: A data historian and a relational database are two ways to store data, optimized for different kinds. A data historian (or process historian) is software purpose-built to capture, compress, and store high-frequency time-series data from industrial processes — thousands of tag readings per second from sensors and control systems — with fast time-based retrieval. A relational database is general-purpose software optimized for structured, transactional data organized in tables, with SQL queries, relationships, and ACID transactions. Historians excel at time-stamped sensor streams; relational databases excel at structured records like orders, work orders, and inventory. They are complementary, and most plants run both.
A data historian — also called a process or operational historian — is software purpose-built to capture, store, and retrieve high-frequency time-series data from industrial processes. Its job is to log the continuous stream of time-stamped readings coming off sensors, instruments, and control systems (PLCs, SCADA, DCS) — temperatures, pressures, flows, speeds, statuses — often thousands of "tags" sampled many times per second, around the clock, for years. To handle that volume, historians use specialized time-series storage with aggressive compression (such as deadband and swinging-door algorithms that store only meaningful changes), and they are optimized for fast retrieval by time range — "show me this tag's values over the last shift" returns instantly. Historians are engineered for the realities of plant data: extreme write throughput, long retention, time-based queries, and resilience. They typically integrate directly with the control layer and provide trending, aggregation, and access for analysis. The defining strength of a historian is handling massive, continuous, time-stamped sensor data efficiently — capturing the full fidelity of how a process behaved over time, compactly and retrievably. It is not designed for complex relational queries or transactional record-keeping; it is designed to be the high-speed time-series memory of the plant floor.
A relational database (an RDBMS) is general-purpose software for storing structured data in tables, optimized for transactions, relationships, and complex queries. Data is organized into tables of rows and columns with defined schemas, related to each other by keys, and queried with SQL — allowing powerful filtering, joining, and aggregation across structured records. Relational databases provide ACID transactions (atomicity, consistency, isolation, durability), which guarantee data integrity for operations that must not be partially completed — essential for things like orders, payments, and inventory movements. They excel at structured, transactional, relational data: customer records, work orders, inventory, production records, bills of material — anything that fits neatly into related tables and needs reliable, consistent updates and rich querying. The defining strength of a relational database is managing structured data with integrity and flexible querying: it enforces consistency, models relationships, and answers complex questions across the data. What it is not optimized for is the extreme write throughput and time-based retrieval of high-frequency sensor streams — storing millions of rapid time-series readings in a general-purpose relational table is inefficient compared with a purpose-built historian. The relational database is the workhorse for the plant's structured, transactional records, not its raw sensor firehose.
The core distinction is the kind of data each is built for: a historian is optimized for high-frequency time-series sensor data, a relational database for structured transactional data. This difference in purpose drives everything else. Historians use time-series storage with heavy compression and time-indexed retrieval, tuned for enormous write rates and "value over time" queries; relational databases use tables, schemas, keys, SQL, and ACID transactions, tuned for structured records, relationships, and consistent updates. They answer different questions: a historian answers "what did this sensor read every second last week?" efficiently; a relational database answers "which work orders for this asset are overdue and what parts do they need?" efficiently. Each is poor at the other's job — a relational database bogs down storing and querying millions of rapid sensor readings, and a historian is not built to model and transact structured relational records with integrity. This is why they are complementary rather than competing: the plant generates both kinds of data — a continuous firehose of process measurements (historian) and structured operational records like orders and inventory (relational database) — and each type belongs in the store designed for it. Choosing between them is really a matter of matching the data to the right tool. (Modern time-series databases blur the line, offering historian-like time-series performance with more general query capability.)
Consider the data a single production line generates. Its machines stream high-frequency process data — vibration, temperature, motor current, cycle counts, run/stop status — hundreds of tags sampled many times a second, continuously. That torrent belongs in a data historian: it compresses and stores the full time-series cheaply, and an engineer can instantly pull any tag's trend over any time window to investigate a problem or feed analysis. Meanwhile, the same operation produces structured, transactional records: production orders, the work orders raised to maintain the line, the parts drawn from inventory, the batches produced and their quality dispositions. Those belong in a relational database: they fit into related tables, need consistent transactional updates (an inventory issue must complete fully or not at all), and are queried relationally ("show open work orders for this asset and the parts they consume"). Trying to store the sensor firehose in the relational database would overwhelm it; trying to manage transactional work orders and inventory in the historian would be unworkable. The line needs both — the historian as the high-speed memory of how the process behaved, the relational database as the system of record for structured operational data. Often the two are linked, with summarized historian data joined to the structured records for reporting.
Choose by the nature of the data. Use a data historian for high-frequency, time-stamped process and sensor data — the continuous measurements from machines and control systems where you need to capture full fidelity over time, compress massive volumes, and retrieve by time range for trending and analysis. Use a relational database for structured, transactional, relational data — orders, work orders, inventory, production and quality records, anything that fits into related tables and needs integrity, consistent updates, and complex querying. In nearly every plant the answer is both, deployed where each is strongest: the historian as the time-series store for the process firehose, the relational database as the system of record for structured operational data, often integrated so the two can be analyzed together. A modern wrinkle is the time-series database (and cloud time-series services), which offers historian-like performance for time-series with more general querying and can overlap the historian's role; and at large scale, raw historian data may also feed a data lake for advanced analytics. The decision framework is simple: match the store to the data — time-series sensor streams to a historian (or time-series database), structured transactional records to a relational database — and integrate them rather than forcing one to do the other's job.
Both stores feed OEE, in complementary ways. The high-frequency machine data a historian captures — run/stop states, cycle counts, speeds, fault signals — is the raw material from which OEE's Availability and Performance are computed; the historian is often where the granular process data behind the losses lives, and where you go to investigate why a loss happened. The structured records a relational database holds — work orders, production and quality records, downtime reasons — supply the operational context and the transactional side of OEE, like the maintenance and quality data that explain and cost the losses. A complete OEE picture usually joins both: the time-series proof of what the machine did (historian) with the structured records of orders, maintenance, and reasons (relational database). This connects to where raw versus modelled data lives in the data lake versus warehouse split, and to the control systems (SCADA and DCS) that generate the signals. But as with those, raw stored data is not OEE — a layer must turn the historian's tags and the database's records into the Availability-Performance-Quality loss model that drives improvement.
Fabrico turns the machine data — whether streamed from the control layer or stored in a historian — into a structured OEE loss picture, joining it with the operational records (downtime reasons, work orders, quality) that explain and cost the losses. Rather than leaving raw time-series tags in the historian and transactional records in a separate database, it delivers the Availability, Performance, and Quality breakdown on top of both, so the data becomes a clear view of where productive time is lost. Book a demo to see your process and operational data become actionable OEE.
A data historian is software optimized to capture, compress, and store high-frequency time-series sensor data from industrial processes, with fast time-based retrieval. A relational database is general-purpose, optimized for structured transactional data in tables with SQL and ACID transactions. Historians suit sensor streams; databases suit structured records.
Because high-frequency sensor data (thousands of readings per second, for years) overwhelms general-purpose relational tables and is slow to query by time. Historians use specialized time-series compression and time-indexed retrieval built for that volume and access pattern, which relational databases are not optimized for.
Capturing and storing the continuous, time-stamped readings from sensors and control systems (PLCs, SCADA, DCS) — temperatures, pressures, flows, speeds, statuses — at high frequency, with compression and fast trend retrieval. It is the high-speed time-series memory of the plant, used for trending, troubleshooting, and analysis.
Yes, usually. The historian stores the high-frequency process firehose; the relational database is the system of record for structured transactional data like orders, work orders, and inventory. Each is used where it is strongest, and the two are often integrated so data can be analyzed together.
The historian's high-frequency machine data (run states, counts, speeds) is the raw material for OEE's Availability and Performance, while the database's structured records (work orders, downtime reasons, quality) supply context and cost. A full OEE picture joins both, and a layer turns them into the loss model that drives improvement.