Articles: 3,583  ·  Readers: 863,895  ·  Value: USD$2,699,175

Press "Enter" to skip to content

Fault Tree Analysis (FTA)




Fault Tree Analysis (FTA) is a systematic, top-down, and deductive method of failure analysis.

It is used to analyze an undesirable top-level event—such as a system failure—by creating a graphical model that maps out all the possible combinations of failures, human errors, and environmental events that could lead to it.

Originally developed by Bell Laboratories in 1962 for the U.S. Air Force’s Minuteman missile system, FTA is a critical tool in safety and reliability engineering. It is widely used in high-risk industries such as aerospace, nuclear power, and chemical processing to proactively identify and mitigate potential hazards.

The Purpose of Fault Tree Analysis

The primary goals of FTA are to:

  • Identify Root Causes: Systematically trace back from a high-level failure to uncover the basic events (root causes) that could contribute to its occurrence.
  • Assess System Reliability: Determine the probability of an undesirable event occurring, providing a quantitative measure of system reliability and risk.
  • Prioritize Failures: Identify the most critical failure paths and components, allowing engineers to focus resources on the most impactful solutions.
  • Support Design and Decision-Making: Provide a logical framework for evaluating system designs, making informed decisions about safety enhancements, and demonstrating compliance with safety standards.

Key Components of an FTA Diagram

An FTA diagram is a logic diagram built using a combination of events and logic gates.

1. Events

Events are represented by various shapes and symbolize failures, states, or conditions in the system.

  • Top Event (Rectangle): The undesired, high-level event being analyzed. It is the single event at the top of the tree.
  • Basic Event (Circle): An initiating, low-level event that cannot be broken down further. It is a root cause that can be assigned a probability (e.g., “A specific component fails”).
  • Intermediate Event (Rectangle): An event that is caused by one or more lower-level events and contributes to higher-level events. It is a stepping stone in the failure path.
  • External Event (House): An event that is expected or guaranteed to occur in the system’s environment and is not further developed in the tree.
  • Undeveloped Event (Diamond): An event that could be further broken down, but is not, either because further analysis is not required or due to a lack of information.

2. Logic Gates

Logic gates define the relationship between events and determine how they combine to cause a higher-level event. They are based on Boolean logic.

  • OR Gate: An event occurs if any of the input events occur. It represents a scenario where multiple failure paths can lead to the same outcome.
  • AND Gate: An event occurs only if all of the input events occur simultaneously. It represents a scenario where multiple failures must coincide to cause the top event.
  • Exclusive OR Gate: An event occurs if one and only one of the input events occurs.
  • Inhibit Gate: An event occurs if all input events happen, but only if an additional conditional event also takes place.

The FTA Process: A Step-by-Step Guide

  1. Define the Top Event: Clearly and concisely define the undesired event to be analyzed. This should be specific and measurable (e.g., “Engine failure during flight” or “Power outage in the main factory”).
  2. Understand the System: A team of subject matter experts, engineers, and designers must have a comprehensive understanding of the system’s components, functions, and interdependencies.
  3. Construct the Fault Tree:
    • Start with the Top Event at the top of the diagram.
    • Identify the immediate causal events that could lead to the Top Event and connect them with the appropriate logic gate (AND or OR).
    • Continue this deductive, top-down process, breaking down each intermediate event into its contributing factors and connecting them with gates.
    • Stop when you reach basic events that cannot be broken down further.
  4. Perform Analysis (Qualitative and Quantitative):
    • Qualitative Analysis: Analyze the logical structure of the tree to identify all possible combinations of basic events that could cause the Top Event. These combinations are called “minimal cut sets” and represent the most critical failure pathways.
    • Quantitative Analysis: Assign failure probabilities or rates to each basic event. Using Boolean algebra and probability theory, calculate the overall probability of the Top Event occurring. This step provides a numerical risk assessment.
  5. Develop Solutions and Mitigations: Based on the analysis, develop corrective actions to reduce the probability of the Top Event. This may involve redesigning the system, adding redundancy, or implementing new maintenance procedures.