Professional Documents
Culture Documents
Leith Hitchcock
Abstract: Many Root Cause Analysis (RCA) methodologies have specific applications and limitations and
in some case for complex machinery investigations they can be combined and enhanced for better results.
Typical methodologies that can be combined effectively are Kepner Tregoe, Causal Tree Analysis
(Apollo), Fault Tree Analysis, Logic Tree Analysis, Barrier Analysis, and Human Performance Evaluation
amongst others.
The difficulty with many RCA methodologies currently available today is that by themselves they may not
result in complete and efficient analyses and effectively implemented solutions for complex machinery
problems. Most methodologies also need expert facilitation to achieve results. Many methodologies today
also have widespread general application yet can be further enhanced for asset specific investigations.
Furthermore the can be effectively combined for a more effective and efficient approach.
Using Life Cycle Management principles the concept the all root causes can be classified into one of five
categories: Design, Manufacturing, Installation, Operation and Maintenance, can be used to enhance data
gathering, problem identification, causal tree selection and management and solution design. This concept
is also applicable to both logical (KT) and graphical (CT) methodologies.
Key Words: Fault Tree Analysis, RCA, Fault Tree Analysis, Barrier Analysis, Kepner Tregoe, Life Cycle
Management, Solution Design, KT, CT.
Notification of failure
Installation File
Is there a cost- No
induced documents
effective
Preserve physical for future
Clarify the event solution?
evidence use
Operation
induced Yes
Gather support Evaluate failed
documentation components Prepare report and
recommendations
Maintenance
induced
Interview all Test system
involved personnel dynamics Submit
recommendations
Develop probable for approval
root cause(s)
Develop sequence-
of-events diagram Is root Yes
Verify cause by
cause
Verify cause by testing
evident?
testing
No File
Are
documents
recommendations
No for future
approved?
use
Barrier Analysis Use to identify barrier & Provides systematic Requires familiarity with This process is based on the
equipment failures & procedural approach. process to be effective. MORT Hazard / Target
or administrative problems. concept.
Management Use when there is a shortage of Can be used with limited May only identify areas of If this process fails to identify
Oversight & Risk experts to ask the right questions prior training. Provides a cause, not specific problem areas, seek additional
Tree Analysis & whenever the problem is a list of questions for causes. help or use Cause & Effect
(MORT) recurring one. Helpful in solving specific control & analysis.
programmatic problems. Management factors.
Human Use whenever people have been Thorough analysis. Looks None if the process is Requires training.
Performance identified as being involved in the at systemic and human closely followed.
Evaluation problem cause. aspects to failure.
Kepner-Tregoe Use for major concerns where all Highly structured More comprehensive than Requires training.
Problem Solving & aspects need thorough analysis. approach, focuses on all may be needed.
Decision Making Can be used as a general aspects of the occurrence
framework. & problem resolution.
Disciplined solution
development process.
Problem Statement
Define the problem & gather supporting
2. Problem Analysis
data. Establish RCA teams.
Data Collection
Criteria Selection
Define the criteria that the solution must
satisfy.
Solution Generation Generate alternative solutions and
5. Solution Development evaluate them against the criteria.
Solution Evaluation Select the solution/s that best meet the
criteria with minimal risk of creating new
problems.
Solution Selection
Table 2. The expanded KT methodology with DMIOM category and change analysis embedded.
4 HUMAN ERROR
Human Performance Evaluation has a key role in linking solutions to DMIOM categories where there is human
involvement. In the case of machinery failure human error is prevalent and in order to make solutions permanent a focus on the
systemic and human dimensions needs to be carried out.
A systematic view of human error is outlined in Figure 4, which details the key areas for investigation into human and
systemic error.
6 CONCLUSION
Like any maintenance activity the different root cause analysis methodologies should be viewed as tools in a toolbox and as
such any RCA program should not rely on just one method in isolation but rather have expertise in several key methodologies
that can be deployed to suit the type of RCA required.
This concept of deploying a task oriented method is the general principle behind the US Governments DOE document.
When such techniques are used for machinery root cause analysis investigations users should consider further refinements
as outlined in this paper in order improve the effectiveness and efficiency of investigations. Such processes and modification
should, however, be based on the criticality of the failed asset, the risk associated with its failure, and the return on investments
expected from the RCA.
7 REFERENCES:
[1] Mobley, R.Keith. Root cause failure analysis (1999). Butterworth-Heinemann, Woburn MA, USA. ISBN 0-
7506-7158-0.
[2] DOE-NE-STD-1004-92, DOE Guideline, Root Cause Analysis Guidance Document (1992), US Department of
Energy, Office of Nuclear Energy, Washington DC, USA.
[3] C.H. Kepner and B.B. Tregoe. The New Rational Manager (1981). Princeton Research Press, Princeton, NJ,
USA.
[4] BS5760: Part 7: 1991, Reliability of systems, equipment and components, Part 7. Guide to fault tree analysis.
[5] Barrier Analysis (1995). Technical Research and Analysis Centre, Scientech, Inc. Idaho Falls, ID, USA. SCIE-
DOE-01-TRAC-29-95.
[6] N.W. Knox and R.W.Eicher (1983). Mort Users Manual, SSDC-4, Rev.2, System Safety Development Centre,
EG&G, Idaho, ID, USA.
[7] Gano, Dean L. (1999). Apollo Root Cause Analysis. Apollonian Publications, Yakima, Washington, USA. ISBN
1-883677-01-7.