FMEAFailure Mode and Effects Analysis
The bottom-up risk technique that walks every component, every step, every interface — asks what could fail, how it would be detected and what would happen — and scores the resulting risk so the team can prioritise mitigations.
01What FMEA actually is
Failure Mode and Effects Analysis is a structured, bottom-up risk analysis technique. A cross-functional team walks the design, the process or the system one element at a time, identifies every plausible failure mode for that element, determines the local and end effects of each failure, identifies the current controls that prevent the failure or detect it before it reaches the patient or customer, and scores the resulting risk so the team can prioritise where to put preventive effort.
FMEA originated in US military reliability work (MIL-P-1629, 1949) and was scaled by NASA, then adopted by automotive (AIAG, VDA), aerospace (SAE ARP5580), medical devices (ISO 14971 cites FMEA as a primary technique), pharmaceuticals (ICH Q9(R1) Annex I), and software (IEC 60812 Annex C). The 2019 AIAG-VDA Handbook unified the two dominant automotive variants into a single seven-step framework, and the 2018 IEC 60812 revision aligned the international standard with the same modern practice.
02Design FMEA vs Process FMEA
| Aspect | Design FMEA (DFMEA) | Process FMEA (PFMEA) |
|---|---|---|
| Scope | Product design — components, sub-assemblies, interfaces. | Manufacturing or service process — steps, operations, controls. |
| When | During design and any design change. | During process design / PFD development and any process change. |
| Element analysed | Component or function. | Process step or sub-step. |
| Failure mode | How the component fails to perform its function. | How the process step fails to produce the intended output. |
| Cause | Design weakness, material defect, interface mismatch. | Operator error, equipment drift, material variation, environmental factor. |
| Detection | Design verification (test, analysis, inspection). | In-process inspection, SPC, gauging, end-of-line test. |
| Owner | Design engineering. | Process engineering / manufacturing engineering. |
A medical device program typically runs a DFMEA per major sub-system, a use-related FMEA (uFMEA, often called application FMEA) for use-error scenarios per IEC 62366, and a PFMEA per manufacturing line. These feed the overall ISO 14971 risk file but each has its own owner, frequency and review cadence.
03The seven-step AIAG-VDA / IEC 60812 method
- Planning and preparation — define the scope, the boundary, the team, the timeline and the assumptions. Done before any failure analysis.
- Structural analysis — break the design or process into its hierarchical structure (parts, sub-functions, steps).
- Function analysis — for each structural element define the intended function and its requirements.
- Failure analysis — for each function identify the failure modes, the effects of each failure, and the causes.
- Risk analysis — for each failure score Severity (S), Occurrence (O) and Detection (D); compute Action Priority (AP) under AIAG-VDA or RPN (S × O × D) under classical IEC 60812.
- Optimisation — for high-AP / high-RPN items, define mitigations (design change, process change, additional control), re-score and demonstrate the improvement.
- Results documentation — issue the FMEA report, link to the design history file or process validation package, and define the trigger conditions for re-evaluation.
04Scoring — RPN, AP and the limits of arithmetic
Classical FMEA computes the Risk Priority Number RPN = S × O × D, where each axis is rated 1–10 against an organisation-specific calibration scale. The team prioritises by RPN and may set a threshold above which mitigation is mandatory. The mathematical problem: an RPN of 200 from (S=10, O=4, D=5) is treated identically to (S=4, O=10, D=5), but the first is a catastrophic-effect failure with low detection and the second is a low-consequence failure that happens constantly with low detection — utterly different risk profiles.
AIAG-VDA 2019 replaced single-number RPN with an Action Priority (AP) lookup table: High / Medium / Low priority is read off a three-axis cube where Severity dominates. A Severity 10 failure is automatically High AP regardless of Occurrence and Detection. This is the same principle ISO 14971 has always used — severity is non-trade-off-able.
05Team composition and review cadence
FMEA is a team sport. Effective teams have: a process or design owner (the chair), a quality engineer (the facilitator), the operators or users who actually perform the work (the ground truth), a representative from the downstream customer of the process, and where relevant, a regulatory or risk-management lead. Single-author FMEAs miss failure modes the team would catch; FMEAs run by external consultants without operator presence miss the failure modes the operators see every day.
Re-evaluation is event-driven and time-driven. Event triggers: any design change, any process change, any deviation that exposed an unscored failure mode, any post-market signal (complaint, vigilance, recall), any change-control package. Time triggers: typically annual review for active programs, quinquennial for legacy programs in steady state.
06Common audit findings on FMEA programmes
- FMEA exists but is not living — last updated three years ago, no link to current change-control activity.
- Failure modes scoped to component-level only — system-level failure modes (interface, timing, mode-of-operation) missing.
- Detection scores assigned generously to drive RPN below threshold, no traceable evidence of detection effectiveness.
- Use-related failure modes (per IEC 62366) absent from device FMEA — only mechanical and software failures considered.
- Mitigations identified but not tracked to closure in CAPA — recommendations sit in the FMEA spreadsheet with no due date or owner.
- Severity 9 or 10 failure modes without justified preventive control — relying on Detection 1 alone is not acceptable under ISO 14971 §6.
- FMEA disconnected from the ISO 14971 risk file — duplicate but inconsistent risk artefacts.
- Re-scoring after mitigation done by the original author without independent review.
07How V5 Ultimate is built around FMEA
- FMEA is a versioned, signed object — not a spreadsheet attached to a folder.
- Each failure mode line links to the structural and function items it covers; renaming a process step propagates without losing traceability.
- AIAG-VDA Action Priority lookup is built in; classical RPN remains available for legacy programs.
- Mitigations create CAPA tasks with owners and due dates; closure of the CAPA re-opens the FMEA line for re-score.
- Change control packages auto-flag the affected FMEA lines for re-evaluation before the change can be approved.
- Post-market signals (complaints, deviations, vigilance) feed back into the FMEA — a complaint flagged as 'failure mode not previously identified' generates an FMEA-update ticket.
- Annual review cadence is tracked at the asset level — overdue FMEAs surface on the management-review dashboard.
Frequently asked questions
Q.Is FMEA mandatory?+
FMEA is not named as mandatory in any regulation. However, ISO 14971 requires risk analysis and lists FMEA as a primary technique; ICH Q9(R1) Annex I lists FMEA as a primary tool; the EU MDR GSPRs require risk management throughout the lifecycle. In practice every regulated device, drug or food submission contains FMEA evidence under one of these umbrellas.
Q.What is the difference between FMEA and FMECA?+
FMECA (Failure Mode, Effects and Criticality Analysis) adds a criticality analysis to the basic FMEA — typically a quantitative ranking by criticality number. IEC 60812 covers both. In modern practice the AIAG-VDA Action Priority approach delivers a similar prioritisation outcome without a separate criticality step.
Q.RPN versus AP — which should I use?+
AIAG-VDA 2019 Action Priority is the modern recommendation for automotive and is increasingly adopted in medical device and pharma practice because it prevents the Severity dilution problem of multiplicative RPN. Legacy programs can continue with RPN as long as the limits are understood and Severity-high items are not allowed to escape mitigation. ISO 14971 has never used RPN — it uses a Severity × Probability matrix.
Q.How is FMEA different from HAZOP?+
FMEA is bottom-up (start with the element, ask what could fail). HAZOP (Hazard and Operability Study) is top-down and deviation-driven (start with the design intent, apply guide words like 'no', 'more', 'less', 'reverse'). HAZOP is more common in process industries (chemical, oil and gas) and pharmaceutical API manufacturing; FMEA dominates device design and discrete manufacturing.
Q.Can FMEA replace ISO 14971?+
No. ISO 14971 is the risk-management framework; FMEA is one technique used inside it. A compliant ISO 14971 risk file requires a risk-management plan, hazard identification, risk evaluation, risk control, residual risk evaluation and a risk-management report — FMEA only addresses the analysis steps. The two are complementary, not substitutable.
Primary sources
Further reading
- ISO 14971 — Risk management for devicesThe standard that mandates risk analysis; FMEA is the most common technique used inside it.
- ICH Q9(R1) — Quality risk managementThe pharma equivalent — FMEA listed in Annex I as a primary technique.
- Fishbone / Ishikawa diagramThe companion technique for root-cause analysis after a failure.
- CAPA — Corrective and preventive actionWhere FMEA-identified preventive actions are tracked to closure.
- Root cause analysisPost-event sibling — FMEA is prospective, RCA is retrospective.
Explore this topic
FMEA sits inside this topic cluster in our glossary. Every neighbour is one click away.
Root-cause toolkit, SPC, capability and the rest of the QA practitioner's bench.
V5 Ultimate ships with the FMEA controls already wired in — audit trail, e-signatures, validation evidence. Free trial, no credit card, onboard in days, not months.
