Failure analysis is the process of collecting and analyzing data to determine the cause of a failure, often with the goal of determining corrective actions or liability. According to Bloch and Geitner, machinery failures reveal a reaction chain of cause and effect… usually a deficiency commonly referred to as the symptom…”. failure analysis can save money, lives, and resources if done correctly and acted upon. It is an important discipline in many branches of manufacturing industry, such as the electronics industry, where it is a vital tool used in the development of new products and for the improvement of existing products. The failure analysis process relies on collecting failed components for subsequent examination of the cause or causes of failure using a wide array of methods, especially microscopy and spectroscopy. Nondestructive testing (NDT) methods (such as industrial computed tomography scanning) are valuable because the failed products are unaffected by analysis, so inspection sometimes starts using these methods.
Forensic inquiry into the failed process or product is the starting point of failure analysis. Such inquiry is conducted using scientific analytical methods such as electrical and mechanical measurements, or by analyzing failure data such as product reject reports or examples of previous failures of the same kind. The methods of forensic engineering are especially valuable in tracing product defects and flaws. They may include fatigue cracks, brittle cracks produced by stress corrosion cracking or environmental stress cracking for example. Witness statements can be valuable for reconstructing the likely sequence of events and hence the chain of cause and effect. Human factors can also be assessed when the cause of the failure is determined. There are several useful methods to prevent product failures occurring in the first place, including failure mode and effects analysis (FMEA) and fault tree analysis (FTA), methods which can be used during prototyping to analyze failures before a product is marketed.
Failure theories can only be constructed on such data, but when corrective action is needed quickly, the precautionary principle demands that measures be put in place. In aircraft accidents for example, all planes of the type involved can be grounded immediately pending the outcome of the inquiry.
Several of the techniques used in failure analysis are also used in the analysis of no fault found (NFF) which is a term used in the field of maintenance to describe a situation where an originally reported mode of failure can't be duplicated by the evaluating technician and therefore the potential defect can't be fixed.
NFF can be attributed to oxidation, defective connections of electrical components, temporary shorts or opens in the circuits, software bugs, temporary environmental factors, but also to the operator error. A large number of devices that are reported as NFF during the first troubleshooting session often return to the failure analysis lab with the same NFF symptoms or a permanent mode of failure.
The term failure analysis also applies to other fields such as business management and military strategy.
Failure analysis engineers
A failure analysis engineer often plays a lead role in the analysis of failures, whether a component or product fails in service or if failure occurs in manufacturing or during production processing. In any case, one must determine the cause of failure to prevent future occurrence, and/or to improve the performance of the device, component or structure. Structural Engineers and Mechanical Engineers are very common for the job. More specific majors can also get into the position such as materials engineers. Specializing in metallurgy and chemistry is always useful along with properties and strengths of materials. Someone could be hired for different reasons, whether it be to further prevent or liability issues. The median salary of a failure analysis engineer, an engineer with experience in the field, is $81,647. A failure analysis engineer requires a good amount of communication and ability to work with others. Usually, the person hired, has a bachelors in engineering but there are certifications that can be acquired.
Methods of analysis
The failure analysis of many different products involves the use of the following tools and techniques:
- Plasma etcher
- Back side thinning tools
- Mechanical back-side thinning
- Laser chemical back-side etching
- Focused ion beam etching (FIB)
- Dye penetrant inspection
- Other Surface analysis tools
- Scanning electron microscope (SEM)
- Transmission electron microscope (TEM)
- Computer-controlled scanning electron microscope (CCSEM)
Laser signal injection microscopy (LSIM)
- Photo carrier stimulation
- Optical beam induced current (OBIC)
- Light-induced voltage alteration (LIVA)
- Laser-assisted device alteration (LADA)
- Thermal laser stimulation (TLS)
- Mechanical probe station
- Electron beam prober
- Laser voltage prober
- Time-resolved photon emission prober (TRPE)
Software-based fault location techniques
Two Shear Key Rods failed on the Bay Bridge
People on the Case
Visual Observation which is non-destructive examination. This revealed sign of brittleness with no permanent plastic deformation before it broke. Cracks were shown which were the final breaking point of the shear key rods. The engineers suspected hydrogen was involved in producing the cracks.
Scanning Electron Microscopy which is the scanning of the cracked surfaces under high magnification to get a better understanding of the fracture. The full fracture happened after the rod couldn’t hold under load when the crack reached a critical size.
Conclusion of the Case Study
The rods failed from hydrogen embrittlement which was susceptible to the hydrogen from the high tensile load and the hydrogen already in the material. The rods did not fail because they did not meet the requirements for strength in these rods. While they met requirements, the structure was inhomogeneous which caused different strengths and low toughness.
This study shows a couple of the many ways failure analysis can be done. It always starts with a nondestructive form of observation, like a crime scene. Then pieces of the material are taken from the original piece which are used in different observations. Then destructive testing is done to find toughness and properties of the material to find exactly what went wrong.
Failure of failure analysis
The collapse of the Oakland Nimitz Freeway was a bridge that collapsed during an earthquake even after the program to strengthen the bridge. Different engineers were asked on their take on the situation. While some don’t blame the program or the department, like James Rogers who said that the earthquake could have “a good chance the Embarcadero would do the same thing the Nimitz did.” While some said more prevention could’ve been done. Dr. Priestly says that “neither of the department’s projects to strengthen roadways addressed the problems of weakness…” in the bridges joints. Some experts agreed that more could’ve been done to prevent this disaster. The program is under fire for making “the failure more serious”.
From a design engineer's POV
A product needs to be able to work even in the hardest of scenarios. This is very important on products made for expensive builds such as buildings or aircraft. If these parts fail, they can cause serious damage and/or safety problems. A product starts to be designed "...to minimize the hazards associated with this "worst case scenario." Discerning the worst case scenario requires a complete understanding of the product, its loading and its service environment. Prior to the product entering service, a prototype will often undergo laboratory testing which proves the product withstands the worst case scenario as expected." Some of the tests done on jet engines today are very intensive checking if the engine can withstand:
- ingestion of debris, dust, sand, etc.;
- ingestion of hail, snow, ice, etc.;
- ingestion of excessive amounts of water.
These tests must be harder than what the product will experience in use. The engines are pushed to the max in order to ensure that the product will function the way it should no matter the condition. Failure analysis on both sides is about the prevention of damage and maintaining safety.
- Metallurgical failure analysis
- Acronyms in microscopy
- List of materials analysis methods
- List of materials-testing resources
- Failure mode and effects analysis (FMEA)
- Failure rate
- Forensic electrical engineering
- Forensic engineering
- Forensic materials engineering
- Forensic polymer engineering
- Forensic science
- Material science
- Sample preparation equipment
- Accident analysis
- Characterization (materials science)
- Failure reporting, analysis and corrective action systems (failure data collection)
- Bloch, Heinz; Geitner, Fred (1994). Machinery Failure Analysis and Troubleshooting. Houston, Texas: Gulf Publishing Company. p. 1. ISBN 0-87201-232-8.
- "Failure Analysis Engineer Salary". PayScale.
- Brahimi, Salim; Agiular, Rosme; Christensen, Conrad (7 May 2013). "Shear Key Rod Failure Analysis Report" (PDF) – via Bay Bridge Info.
- Bishop, Katherine (1989). "Experts Ask if Anti-Quake Steps Contributed to Highway Collapse". NY Times. Retrieved 2018. Check date values in:
- T-9 Jet Engine Test Cell. Dir. Timothy Kirchner. Defense Visual Information Distribution Services. DVIDS, 12 Aug. 2013. Web.
- Brady, Brian (1999). "Failure Analysis". State University of New York at Stony Brook: Department of Material Science and Engineering.
- Duivis, Rob (7 March 2016). "How do we Test Jet Engines?". Meanwhile at KLM. Retrieved 8 April 2018.
- Martin, Perry L., Electronic Failure Analysis Handbook, McGraw-Hill Professional; 1st edition (February 28, 1999) ISBN 978-0-07-041044-2.
- Microelectronics Failure Analysis, ASM International; Fifth Edition (2004) ISBN 978-0-87170-804-5
- Lukowsky, D., Failure Analysis of Wood and Wood-Based Products, McGraw-Hill Education; 1st edition (2015) ISBN 978-0-07-183937-2.