Determining Failure-Finding Frequency for Protective Devices: A Monte-Carlo Simulation Approach

Question:

Greetings everyone, I am currently in the process of developing software designed to aid in determining the frequency of failure-finding tasks for protective devices using Monte-Carlo simulation. While I have created a model that appears to be working effectively thus far, it is crucial to validate it with real-world data. I would greatly appreciate input from fellow professionals in this field. I have extensively studied Moubray's writings on this subject and have incorporated much of his terminology into my work. However, I still have questions regarding how to calculate the availability of protective devices and the associated costs per unit of time. Let's consider the following scenario: a pump with a protected function experiences unanticipated failures every 1,000 hours, while the protective device (which is not fail-safe) fails following a Weibull distribution with parameters of Shape = 4 and Scale = 2,000 hours. The time required to repair or replace the protective device is negligible for simplicity's sake in this example. Inspections are conducted every 200 hours, with each inspection costing €100. The estimated cost of failure consequences is €10,000. During simulation iterations, various results are obtained. For instance, the protected function may fail before the protective device, or vice versa. It is essential to analyze these outcomes to understand the interaction between the two components. Calculating the availability of the protective device and evaluating the costs associated with inspections and potential failures are key aspects to consider. By running the simulator for 5,000 iterations, valuable insights have been gained. Without inspections, the probability of multiple failures is 0.1822, with an average availability of the protective device at 93.86%, leading to a mean total cost per iteration of €1,810. On the other hand, with inspections performed every 200 hours, the probability of multiple failures decreases to 0.0162, resulting in an average availability of 98.84% and a mean total cost per iteration of €537. These findings highlight the importance of inspections in reducing costs and enhancing device availability. In conclusion, it is evident that inspections yield significant benefits in this scenario. Continuously evaluating the frequency of inspections to optimize costs and device performance is essential. I welcome any feedback or recommendations for improving this methodology. Additionally, if you are aware of relevant resources or literature on this topic, please feel free to share. Thank you for your insights. Rui

Top Replies

Hello Rui, I found your post quite engaging. I have a few queries regarding the parameters you have utilized. How did you determine the shape factor of 4 for the protective device? Typically, failure parameters are deduced through periodic testing, where a constant hazard rate assumption with a shape factor of 1 is commonly employed. Additionally, for protective devices, it is generally deemed unacceptable to have a test interval exceeding 2-5% of the scale factor as it impacts system availability adversely. Therefore, I am intrigued by the use of a 200-hour test in your simulation. Lastly, when assessing protective devices, it is imperative to consider tolerable risk criteria rather than economic factors. Regards, V. Narayan.

Hello Vee, thank you for your prompt response. As someone with limited experience in inspections, I truly value your insightful comments. Let me address your specific questions: 1. The parameters used in the example were hypothetical. I believed that a protective device failure could be attributed to a tested failure mode under laboratory conditions by the manufacturer. However, I now understand that this scenario may not reflect real-world experiences. I will modify the shape factor to one. 2. The 10% scale parameter for the time between inspections was arbitrarily chosen. I appreciate your suggestion of 2-5% based on practical experience. I will review these figures to ensure the validity of the method and update you on the results. Do you find the achieved figure of 98.84% in my example acceptable? 3. The method illustrated in my example helps determine the risk of multiple failures occurring, as seen with a probability of 0.0162. This information can be compared against a tolerable threshold and used to evaluate the costs of inspections and potential consequences of a failure in an operationally-focused context. I fail to see any reason for disagreement on this matter. Best regards, Rui

After conducting several simulations with a shape factor of 1, varying times between inspections (TBI), and analyzing the results, it is evident that costs are influenced by TBI. When TBI is set at 50 hours, the cost is approximately 1,360 € with a probability of 0.007 and availability of 96.75%. Increasing TBI to 100 hours reduces cost to 810 € with a probability of 0.0176 and availability of 94.48%. The trend continues as TBI increases, with costs fluctuating and availability decreasing. The optimal TBI range appears to be between 150-300 hours based on the data. This finding is crucial as it impacts cost, availability, and the probability of multiple failures. Overall, the results indicate a correlation between TBI, cost, availability, and failure probability. Thank you for your analysis, Rui.

The required TBI for a protective device like a PSV is approximately 150 to 300 hours. Typically, PSV insitu prepop testing is planned during every scheduled shutdown, which occurs every 2 to 3 years in a petrochemical plant. Moreover, the internal passing test for a wellhead valve takes about 3 months, equivalent to 2160 hours. It is essential to ensure 100% availability for at least one year of continuous operations for offshore platforms and three years for petrochemical plants, aligning with the planned shutdown/turnaround schedule. For more information, you can refer to this useful Barringer link: http://www.barringer1.com/ar.htm.

Hello Josh, I want to clarify that the numbers mentioned are purely for an exercise in a hypothetical scenario and not reflective of real-world experiences. I am seeking insights on calculating availability in the case of hidden failures, also known as dormant or failures on demand, through simulation. This process is more complex than in cases of evident failures. Your real-world experience in the field, particularly with Pressure Safety Valves (PSV), would be greatly valued. Thank you for sharing the Barringer's link. Best regards, Rui.

Hi Rui, your work in this field seems impressive and extremely vital. From your analysis, the significance of regular inspections cannot be overstated, given its impact on cost-efficiency and device availability. When you speak of optimizing inspection frequency, much would be dependent on the industry standards, preventive measures in place, and the criticality of device failure. This could translate into an adaptive inspection schedule, perhaps influenced by real-time device performance and other factors. But this adds another layer of complexity which may, or may not, be beneficial - it's an avenue worth exploring. You might want to consult API and ISO standards concerning the maintenance and inspection of industrial equipment. They may provide valuable benchmarks for your model. Keep up the good work!

Hi Rui, your work is indeed commendable and thorough. The reduction in failure probability and total costs with frequent inspections outlines the importance of regular maintenance checks. One potential area to delve into could be to include a risk analysis module to your software. Utilizing decision tree algorithms could provide potential failure pathways and their associated costs. This would offer a more comprehensive picture to your users by including both the statistical and risk-based methodologies. As for resources, I recommend looking into the work of Karen Marais from Purdue University who did extensive research on systemic risk in engineering systems. Good luck with your software development!

More Replies →

Streamline Your Asset Management
See How Oxmaint Works!!

✅   Work Order Management

✅   Asset Tracking

✅   Preventive Maintenance

✅   Inspection Report

We have received your information. We will share Schedule Demo details on your Mail Id.

To add a comment, please sign in or register if you haven't already..   

Frequently Asked Questions (FAQ)

FAQ: 1. What is the purpose of using Monte-Carlo simulation to determine the failure-finding frequency for protective devices?

Answer: - The purpose of using Monte-Carlo simulation is to analyze the interaction between protective devices and protected functions, understand outcomes, and evaluate costs associated with inspections and potential failures.

FAQ: 2. How is the availability of protective devices calculated in the scenario described?

Answer: - The availability of protective devices is calculated by running simulations with and without inspections, monitoring failures of the protected function and protective device, and analyzing the outcomes to determine the average availability.

FAQ: 3. How do inspections impact the costs and availability of protective devices in the simulation?

Answer: - The simulation results show that inspections reduce the probability of multiple failures, increase the average availability of the protective device, and lower the mean total cost per iteration, emphasizing the importance of inspections in optimizing costs and device performance.

FAQ: 4. What are the key findings from running the simulator for 5,000 iterations?

Answer: - The key findings include the probability of multiple failures, the average availability of the protective device, and the mean total cost per iteration with and without inspections, demonstrating the benefits of inspections in reducing costs and improving device availability.

FAQ: 5. How can professionals further enhance the methodology for determining failure-finding frequency for protective devices?

Answer: - Professionals can enhance the methodology by continuously evaluating the frequency of inspections, optimizing costs, and device performance based on simulation results and feedback. Additionally, seeking recommendations and utilizing relevant literature can help improve the methodology.

Ready to Simplify Maintenance?

Join hundreds of satisfied customers who have transformed their maintenance processes.
Sign up today and start optimizing your workflow.

Request Demo  â†’