Statistical and Probabilistic Analysis-Based RCM-Oriented Cost Optimization Methodology for Oil and Gas Component Assets ()
1. Introduction
Offshore oil and gas facilities operate in highly demanding environments where equipment reliability, operational safety, and production availability are critical to achieving sustainable and cost-effective operations. Failures of critical assets can result in significant production losses, increased maintenance expenditure, environmental risks, and safety concerns. Consequently, maintenance management has evolved from traditional corrective approaches toward reliability-based methodologies that support proactive asset management and lifecycle optimization. Reliability-Centered Maintenance (RCM) has become one of the most widely adopted maintenance frameworks since its development within the aerospace industry during the 1960s. The primary objective of RCM is to preserve the functional performance of assets by systematically identifying asset functions, functional failures, failure modes, and their consequences. Based on this analysis, appropriate maintenance strategies are selected to ensure reliability, safety, and operational effectiveness while minimizing maintenance costs and equipment downtime. Typical maintenance actions identified through RCM include corrective maintenance, preventive maintenance, condition-based maintenance, predictive maintenance, run-to-failure policies, and equipment redesign measures [1].
A conventional RCM study generally begins with the establishment of an equipment hierarchy and the definition of the asset operating context. Functional analysis is subsequently performed to identify the intended functions of the asset and potential functional failures. Critical assets are then prioritized through asset criticality assessment, followed by Failure Modes and Effects Analysis (FMEA), which evaluates failure mechanisms, causes, effects, and consequences at both local and system levels. The outputs of FMEA are further assessed using RCM decision logic to determine the most suitable maintenance tasks and resource allocation requirements. Although this structured methodology has demonstrated considerable success across multiple industries, its implementation remains largely qualitative and highly dependent on expert judgement and historical experience. The increasing complexity of modern industrial systems and the emergence of Industry 4.0 technologies have created a need for more quantitative and data-driven maintenance decision-making approaches. Contemporary maintenance practices increasingly incorporate predictive analytics, continuous condition monitoring, digitalization, and reliability-based optimization techniques to improve equipment performance and maintenance effectiveness. Despite these advancements, many RCM implementations continue to rely on qualitative assessments and logic-tree decisions without fully exploiting available reliability and failure data. This limitation often reduces the ability of maintenance engineers to quantify uncertainty, predict future failures, and optimize maintenance intervals based on economic considerations [2].
Several studies have demonstrated the benefits of integrating reliability analysis techniques, such as Weibull probability modeling, with maintenance planning and replacement optimization. Weibull analysis provides valuable insights into component failure behavior by characterizing failure distributions through shape and scale parameters, thereby enabling the estimation of reliability, failure probability, and remaining useful life. Similarly, age replacement and block replacement optimization models have been widely employed to identify economically optimal maintenance intervals by balancing preventive maintenance costs against the consequences of unexpected failures. However, limited research has integrated equipment criticality assessment, probabilistic reliability modeling, and maintenance cost optimization within a unified Reliability-Centered Maintenance framework for offshore oil and gas applications. To address this gap, this study proposes an integrated Reliability-Centered Maintenance optimization framework that combines Pareto-based equipment criticality screening, Weibull reliability analysis, and quantitative maintenance cost optimization. The proposed methodology extends the conventional RCM process by incorporating statistical reliability assessment and economic decision analysis to support data-driven maintenance planning. Reliability parameters are estimated using historical failure information obtained from the Offshore Reliability Data (OREDA) database, while optimal maintenance intervals are determined using age replacement and block replacement cost models [3].
The proposed framework (Figure 1) is demonstrated through a case study involving centrifugal compressor systems used in offshore oil and gas facilities. Critical valve failure modes are analyzed to evaluate failure behavior, reliability characteristics, and optimal replacement strategies. The study aims to provide maintenance engineers and asset managers with a practical decision-support methodology capable of improving reliability, reducing unplanned downtime, and minimizing lifecycle maintenance costs. The key contributions of this research are as follows:
1) Development of an integrated RCM framework incorporating probabilistic reliability analysis and maintenance cost optimization.
2) Application of Pareto-based equipment screening to identify critical offshore oil and gas assets.
3) Integration of Weibull reliability modeling for failure behavior characterization and reliability prediction.
4) Evaluation of age replacement and block replacement policies for determining economically optimal maintenance intervals.
Figure 1. Flow chart of reliability-centered maintenance (RCM).
5) Demonstration of a quantitative decision-support methodology for enhancing maintenance planning and asset management in offshore oil and gas facilities.
2. RCM Optimization Framework
The proposed Reliability-Centered Maintenance (RCM) optimization framework integrates equipment criticality assessment, probabilistic reliability analysis, and maintenance cost optimization to support quantitative maintenance decision-making for offshore oil and gas assets. The framework was developed in response to limitations identified in conventional RCM implementations, which are often characterized by extensive reliance on expert judgement, qualitative assessments, and time-intensive analysis procedures. To evaluate current industrial practices, a structured survey was conducted involving 42 professionals, including asset managers, maintenance engineers, reliability engineers, operations managers, and maintenance decision-makers from offshore oil and gas facilities across the Middle East and Asia. The survey consisted of 15 structured questions addressing maintenance strategy selection, reliability assessment practices, and challenges associated with RCM implementation. Approximately 76% of respondents identified conventional RCM as highly dependent on expert judgement and time-consuming to implement, highlighting the need for more quantitative and data-driven maintenance methodologies [4].
The proposed framework follows the equipment taxonomy defined in ISO 14224, which provides a standardized structure for the collection, classification, and exchange of reliability and maintenance data. Equipment is systematically decomposed into hierarchical levels consisting of industry, installation, system, equipment unit, subunit, component, and maintainable item. This hierarchical structure facilitates the identification of critical assets and supports reliability analysis at different levels of the breakdown structure. The first stage of the framework involves equipment criticality assessment through Pareto analysis. Historical failure records are evaluated to determine the contribution of each equipment category to overall system failures. The Pareto principle, commonly referred to as the 80/20 rule, states that a relatively small proportion of assets is often responsible for most operational failures. In the context of reliability analysis, approximately 80% of failure impacts may originate from 20% of the equipment population. Consequently, Pareto analysis enables the identification and prioritization of critical assets that warrant detailed Reliability-Centered Maintenance investigation and optimization. Following criticality screening, Failure Modes and Effects Analysis (FMEA) is conducted to identify critical failure modes and evaluate their operational consequences. The FMEA process examines failure mechanisms, causes, effects, and associated risks to establish maintenance priorities and determine the most critical maintainable components for further analysis [5].
The identified critical failure modes are subsequently subjected to probabilistic reliability assessment using Weibull distribution modeling. Weibull analysis is widely employed in reliability engineering because of its flexibility in representing different failure behaviors, including infant mortality failures, random failures, and wear-out failures. The cumulative distribution function (CDF) of the two-parameter Weibull distribution is given by:
.
The Weibull shape parameter provides valuable information regarding the underlying failure mechanism. The value of (β < 1) indicates early-life or infant mortality failures typically associated with manufacturing defects, installation errors, or commissioning issues. The value of (β = 1) represents random failures with a constant failure rate, while (β > 1) indicates age-related degradation and wear-out failures, where failure probability increases with operating time. The scale parameter (\eta) corresponds to the characteristic life of the component and represents the operating time at which approximately 63.2% of the population is expected to have failed.
The Weibull parameters were estimated using Maximum Likelihood Estimation (MLE) based on failure frequency and operational exposure data extracted from the Offshore Reliability Data (OREDA) database. Reliability indicators including Mean Time to Failure (MTTF), Mean Time Between Failures (MTBF), and Mean Time to Repair (MTTR) were subsequently calculated to quantify asset performance and failure behavior. The Weibull scale parameter expressed in years was converted into operating hours using an annual operational exposure of 8760 hours per year prior to maintenance optimization analysis.
For failure modes exhibiting wear-out behavior (β > 1), preventive replacement policies were evaluated using maintenance cost optimization techniques. The objective of the optimization process is to determine the replacement interval that minimizes the expected maintenance cost per unit operating time. Two preventive replacement strategies were investigated:
1) Age Replacement Policy—The component is replaced either upon failure or when it reaches a predetermined age (T), whichever occurs first.
2) Block Replacement Policy—The component is replaced at fixed intervals (T) regardless of its condition or previous failure history.
The expected cost rate for each strategy is calculated by considering the probability of failure, planned replacement costs, corrective maintenance costs, and operational consequences associated with unexpected failures. The replacement interval corresponding to the minimum expected cost rate is selected as the economically optimal maintenance interval [6].
The reliability and maintenance data employed in this study were obtained from the OREDA Offshore Reliability Database and organized according to ISO 14224 guidelines. The OREDA database provides industry-recognized reliability statistics for offshore oil and gas equipment, including failure frequencies, operational exposure data, and maintenance records. The integration of equipment criticality assessment, Weibull reliability modeling, and maintenance cost optimization within a unified RCM framework enables a more objective, quantitative, and economically justified maintenance decision-making process for offshore oil and gas assets [7].
3. Quantitative Demonstration and Case Study Selection
A Pareto analysis was performed using annual mean failure rate data obtained from the Offshore Reliability Data (OREDA) database to identify critical equipment within typical offshore oil and gas installations. The analysis, presented in Figure 2, follows the Pareto principle, which states that a relatively small proportion of equipment is responsible for the majority of operational failures and associated maintenance impacts. By ranking equipment according to failure frequency, the analysis facilitates the prioritization of assets that contribute most significantly to reliability degradation and maintenance costs. The results indicate that centrifugal gas compressors are among the most critical equipment categories, ranking within the highest failure-contributing assets identified during the Pareto assessment. Due to their operational importance and relatively high contribution to overall equipment failures, centrifugal compressors were selected as the representative case study for demonstrating the proposed Reliability-Centered Maintenance (RCM) optimization framework. Centrifugal compressors play a vital role in offshore oil and gas production systems by increasing the pressure of processing gas for transportation, reinjection, and downstream processing operations. During operation, gas enters the compressor through the suction inlet and is directed toward a rotating impeller. The impeller imparts kinetic energy to the gas through high-speed rotation driven by the compressor shaft. Subsequently, the diffuser converts the kinetic energy into pressure energy, resulting in compressed gas suitable for pipeline transmission and process applications. Because
![]()
Figure 2. Failure Frequency analysis of oil and gas components.
compressor availability directly influences production continuity, failures within compressor subsystems can lead to significant operational disruptions and economic losses [8].
Following equipment selection, an equipment hierarchy was established in accordance with ISO 14224 standards to systematically decompose the compressor into subunits, components, and maintainable items. Figure 3 illustrates the equipment hierarchy developed for the compressor system, highlighting valve assemblies as maintainable components within the asset structure.
Further analysis revealed that valve-related failures represented a significant proportion of compressor maintenance events. Valve components were therefore identified as critical maintainable items due to their relatively high failure frequency and their direct impact on process isolation, flow regulation, pressure control, and overall system integrity. Failures of these components can adversely affect operational continuity, equipment reliability, and production performance. Valve failure modes were selected for detailed reliability assessment and maintenance optimization. Weibull reliability modeling was applied to characterize failure behavior and determine the underlying failure mechanisms, while age replacement and block replacement optimization models were employed to identify economically optimal maintenance intervals. This case study demonstrates how the proposed integrated RCM framework can support data-driven maintenance decision-making for critical offshore oil and gas equipment [9].
The equipment boundary of the centrifugal compressor, established in accordance with ISO 14224 guidelines, is illustrated in Figure 4. Defining the equipment boundary is a critical step in Reliability-Centered Maintenance (RCM) studies because it establishes the scope of reliability assessment, maintenance planning,
Figure 3. Equipment hierarchy of oil and gas components.
Figure 4. Equipment boundary of a centrifugal compressor.
and failure analysis. The centrifugal compressor system considered in this study comprises six major subsystems: power transmission, compressor unit, lubrication system, control and monitoring system, shaft seal system, and recycle system.
The power transmission subsystem is responsible for transferring mechanical power from the driver to the compressor rotor while maintaining operating conditions within specified performance limits. The lubrication system supplies clean and temperature-controlled lubricating oil to critical rotating components, including bearings and gears, ensuring adequate lubrication, cooling, and wear protection. The control and monitoring subsystem continuously supervises compressor performance through instrumentation and sensors that measure key operating parameters such as rotational speed, rotor vibration, shaft position, bearing metal temperature, lubricating oil pressure and temperature, as well as suction and discharge pressures and flow rates. The shaft seal system provides containment of process gas and prevents leakage to the surrounding environment through the application of dry gas seals and associated auxiliary equipment. Together, these subsystems ensure safe, reliable, and efficient compressor operation under varying process conditions [10].
Table 1 summarizes the principal subunits and maintainable components located within the defined equipment boundary. The hierarchical decomposition of the compressor system facilitates systematic identification of critical components and supports subsequent reliability and maintenance analyses.
The reliability data utilized in this study was obtained from the Offshore Reliability Data (OREDA) database, 2009 edition, which is widely recognized as one of the most comprehensive reliability data sources for offshore oil and gas facilities. The database contains operational observations, maintenance records, failure
Table 1. Subunits and maintainable items of a centrifugal compressor.
Subunits |
Maintainable Items |
Power Transmission |
Gearbox, variable drive, bearings, belt/sheave, coupling to the driver, coupling to the driven unit, lubrication system, seals |
Compressor |
Casing, rotor with impellers, balance piston, inter-stage seals, radial bearings, thrust bearing, shaft seals, internal piping, valves, anti-surge system, spool, cylinder line piping |
Control and Monitoring |
Actuating devices, control unit, cables and junction boxes, internal power supply, monitoring instruments, sensors, valves, wiring, piping, seals |
Lubrication System |
Oil tank with heating system, pump, motor, check valves, coolers, filters, piping, valves, oil |
Shaft Seal System |
Oil tank with heating, reservoir, pump, motor, gear, filters, valves, seal oil system, dry gas seal, mechanical seal, scrubber |
Miscellaneous |
Base frame, piping, pipe supports and bellows, control valves, isolation valves, check valves, coolers, silencers, purge air system, magnetic bearing control system, flange joints |
frequencies, and equipment reliability statistics collected from offshore production installations over multiple years of operation. The use of OREDA data ensures consistency, standardization, and industrial relevance in reliability assessment and maintenance optimization studies.
For the present investigation, reliability records associated with offshore top side compression systems were extracted and analyzed. Attention was given to centrifugal compressor equipment due to its critical role in gas processing and transportation operations. Historical failure statistics relating to valve components were selected for detailed analysis because valves exhibited relatively higher failure occurrence rates and have a direct impact on process control, pressure regulation, isolation functions, and operational continuity. The Weibull reliability demonstration focused on three representative valve failure modes commonly reported in offshore operations: valve leakage, failure to open, and failure to close. These failure modes were selected because they represent different degradation mechanisms and operational consequences, thereby providing a suitable basis for evaluating failure behavior, reliability characteristics, and maintenance optimization strategies. The extracted failure data were subsequently used to estimate Weibull distribution parameters, evaluate failure trends, and determine optimal preventive replacement intervals through age replacement and block replacement cost models.
Weibull analysis is performed for certain failure modes of valves, as seen in Figure 5. One potential cause of valve failure to initiate may stem from early design or installation deficiencies, characterized by a shape parameter of less than one. A valve may experience failure owing to random degradation during its operational lifespan, with the form parameter being one factor. A valve may leak owing to age-related variables, including wear and tear, when the form parameter exceeds one. The parameters, β and η, are derived from the instances of failure causes and the average failure rate as specified in OREDA. To ascertain cost optimization for the ideal replacement period, β must exceed 1, and the expense of unplanned failure must surpass that of planned replacement, as seen in Table 2. The expenses associated with failures and replacements are presented solely for illustrative purposes. The expense per unit of time is illustrated in Figure 6.
The Weibull shape parameter (β) and scale parameter (η) were estimated using historical failure frequency and operational exposure data extracted from OREDA records. The parameters were determined using maximum likelihood estimation (MLE) techniques. The reliability indicators used in the analysis included Mean
Figure 5. Weibull analysis for each failure mode for a valve: (a) Early or infant-mortality failures; (b) Random failures; (c) Age-related failures.
Table 2. Parameters for optimization of age-related failure.
Parameter |
Value |
β |
4 |
η |
20.7 years |
Cost of unscheduled failure |
$6350 |
Cost of planned replacement |
$1270 |
Figure 6. Cost optimization for optimum age and block replacement of valves.
Time to Failure (MTTF), Mean Time Between Failures (MTBF), and Mean Time to Repair (MTTR), which were calculated using standard reliability engineering equations based on cumulative operating hours and observed failure occurrences [11].
Reliability Metrics
The optimal replacement interval for the valves, due to age-related wear failure, is established at 116,000 operating hours, whereas the ideal block replacement interval is set at 107,000 hours. If a failure occurs prior to 116,000 hours in the age replacement scenario, the valves must be replaced and subsequently replaced again only when a further 116,000 hours have elapsed. In a block replacement scenario, if a failure transpires before 107,000 hours, namely before 7000 hours, the valve must be replaced and subsequently replaced again at 107,000 hours. Age replacement is optimally applied to costly components, while block replacements are conducted at regular intervals primarily for less expensive components.
Age Replacement Cost Rate
Block Replacement Cost Rate
where:
= age replacement cost rate;
= block replacement cost rate;
= planned replacement cost;
= failure replacement cost;
= reliability function;
= cumulative failure function;
= replacement interval.
4. Conclusion
This study presented an integrated Reliability-Centered Maintenance (RCM) optimization framework that combines equipment criticality assessment, probabilistic reliability modeling, and maintenance cost optimization to enhance maintenance decision-making for offshore oil and gas assets. While conventional RCM methodologies provide a structured approach for identifying maintenance requirements, they remain largely qualitative and highly dependent on expert judgement. The proposed framework addresses these limitations by incorporating quantitative reliability analysis and economic optimization within the traditional RCM process. The methodology was demonstrated using a centrifugal compressor case study based on reliability data obtained from the OREDA database and equipment classification principles defined in ISO 14224. Pareto analysis was employed to identify critical equipment and maintainable items, while Weibull reliability modeling was used to characterize failure behavior and estimate reliability parameters for selected valve failure modes. Maintenance optimization was subsequently performed using age replacement and block replacement cost models to determine economically optimal preventive maintenance intervals. The results demonstrated that the selected valve failure mode exhibited age-related degradation characteristics and that preventive replacement policies could significantly improve maintenance effectiveness. The optimal age replacement interval and block replacement interval were estimated to be approximately 116,000 and 107,000 operating hours, respectively, under the assumed reliability and maintenance cost conditions. These findings illustrate the potential of integrating reliability analytics and economic evaluation into RCM decision-making to support more effective maintenance planning and lifecycle cost management.