Predictive Maintenance
Preventing Equipment Failure Before It Happens
The $250,000 Breakdown
It is 2 AM at a paper mill in Wisconsin. The main drive motor on the forming section seizes. Production stops across all four machines. The maintenance team is called in, but the replacement motor winding needs to come from a specialist shop in Ohio — three days out. Total cost: lost production ($150,000), emergency procurement ($40,000 at rush pricing), overtime labour ($25,000), plus a damaged gearbox that would not have failed if the motor had been serviced on time ($35,000). One bearing worth $800 caused $250,000 in damage.
This scenario plays out thousands of times daily across North American and European factories. The average US manufacturer loses 5-7% of annual revenue to unplanned downtime. For a plant doing $50 million in revenue, that is $2.5-3.5 million disappearing every year — not because machines are bad, but because failures are not predicted.
Predictive maintenance changes this equation entirely. Instead of waiting for failure (reactive) or replacing parts on a fixed schedule regardless of condition (preventive), AI monitors actual machine health and tells you exactly when intervention is needed — not too early (wasting parts and labour) and not too late (causing cascading damage).
Three Generations of Maintenance
Reactive Maintenance (Run to Failure)
The oldest approach: run the machine until it breaks, then fix it. Still used in 40-50% of small-to-mid-size manufacturers. Advantages: zero monitoring cost, no planning needed. Disadvantages: catastrophic failures, secondary damage, emergency procurement costs, safety risks (OSHA recordable incidents increase 3x with reactive maintenance), and unpredictable production schedules.
Preventive Maintenance (Time-Based)
Replace parts on a fixed schedule — change oil every 500 hours, replace bearings every 18 months, overhaul pumps annually. Better than reactive, but wasteful. Studies show that 30% of time-based maintenance is performed too early (parts still had life remaining) and 10% too late (damage already occurring between scheduled checks). Most large US and EU plants use this as their baseline.
Predictive Maintenance (Condition-Based)
Monitor actual machine condition in real time using sensors. Replace parts only when data shows they need replacement. AI models learn what "normal" looks like for each machine and alert when behaviour changes. Result: 25-30% reduction in maintenance costs, 35-45% reduction in unplanned downtime, and 20-25% increase in machine lifespan.
Reliability-Centered Maintenance (RCM)
RCM is the framework that determines which maintenance strategy to use for each asset. Developed originally for aviation (United Airlines and the FAA in the 1970s), it is now standard practice in process industries.
The P-F Curve
The P-F curve is the foundation of predictive maintenance. "P" is the point where a potential failure becomes detectable. "F" is the point of functional failure. The distance between P and F — the P-F interval — determines how much warning you get.
| Failure Mode | Detection Method | Typical P-F Interval |
|---|---|---|
| Bearing wear | Vibration analysis | 1-9 months |
| Insulation breakdown | Thermography / partial discharge | 1-6 months |
| Seal degradation | Pressure / leak detection | 1-3 months |
| Corrosion | Ultrasonic thickness | 6-24 months |
| Belt wear | Visual / vibration | 1-4 weeks |
The key insight: your monitoring interval must be less than half the P-F interval. If a bearing has a 6-month P-F interval, you need to check at least every 3 months — or better, continuously with online sensors.
How Sensors Feed AI Models
Every machine tells you it is failing before it actually fails. The signals are just too subtle for human senses to detect early enough.
The Data Chain
Sensor Types and What They Detect
| Sensor | Measures | Detects |
|---|---|---|
| Vibration (accelerometer) | Machine oscillation in 3 axes | Imbalance, misalignment, bearing wear, looseness, gear mesh issues |
| Temperature (thermocouple/RTD) | Surface or internal temperature | Overheating, lubrication failure, electrical faults, cooling system issues |
| Current (CT sensor) | Motor current draw | Mechanical load changes, winding degradation, phase imbalance |
| Pressure (transducer) | Hydraulic/pneumatic pressure | Seal leaks, pump wear, valve degradation, blockages |
| Ultrasonic (acoustic) | High-frequency sound | Early bearing defects, steam/air leaks, electrical discharge |
| Oil analysis (particle counter) | Metal particles in lubricant | Gear wear, bearing degradation, contamination |
Open data/sensor-readings.csv in the code panel. You will find 30 days of real vibration and temperature data from an industrial motor. Notice how vibration gradually increases from Day 1 to Day 25 — the bearing was degrading. A technician checking weekly would have missed the trend. The AI model detected it on Day 12 and recommended bearing replacement by Day 20.
The Cost of Unplanned Downtime
| Industry | Typical Hourly Cost | Common Failure |
|---|---|---|
| Automotive (OEM) | $50,000-100,000/hour | Robotic cell failure, press line breakdown |
| Oil & Gas | $100,000-250,000/hour | Compressor trip, pump seal failure |
| Pulp & Paper | $20,000-50,000/hour | Drive motor, refiner plates, dryer bearings |
| Food & Beverage | $10,000-30,000/hour | Conveyor failure, refrigeration, packaging line |
| Power Generation | $50,000-500,000/hour | Turbine trip, boiler tube leak, transformer failure |
These numbers do not include secondary costs: quality rejections from unstable restart, customer penalties for delayed delivery, OSHA incidents during emergency repairs, or EPA violations from uncontrolled releases.
CMMS Integration: Where AI Meets Your Workflow
Predictive maintenance only works if it connects to how your team already operates. That means integration with your CMMS (Computerized Maintenance Management System).
Leading CMMS Platforms
The goal: AI-generated alerts automatically create work orders in your CMMS with the right priority, parts list, and estimated labour hours. No manual data entry, no alerts that get lost in email.
Open data/failure-history.json to explore a failure history log from a manufacturing plant. It contains 50 failure events over 24 months — categorized by machine, failure mode, downtime hours, and cost. Notice how 80% of the total downtime cost comes from just 6 machines — these are your priority targets for predictive maintenance.
Building Your Business Case
When proposing predictive maintenance to leadership, focus on three numbers:
If your annual downtime cost is $2 million and you can reduce it by 35%, that is $700,000 saved per year against an investment of $150,000. That is a 4.7x return in year one, improving each year as the AI model learns your machines better.
Key Takeaways
This is chapter 2 of AI for Engineers (Global).
Get the full hands-on course — free during early access. Build the complete system. Your projects become your portfolio.
View course details