Risk & Reliability of Transformers
Articles & Papers Alan RossGetting experts to agree on most things is challenging at best and, often, downright impossible. Getting transformer experts to agree might even be more challenging. Despite the fact that transformers have been around for nearly a century with varying opinions on transformer design and maintenance, most experts agree that not all transformer failures are alike. Also, the life of the insulation or paper is the primary limiting factor to the life of the transformer. Since most industrial companies use power to produce something, the risks associated with the loss of that power have been misunderstood and, all too often - ignored. But risk comes in many different forms. The Insurance Industry can quantify the risk of failure as the cost to replace the transformer. But the loss of productive capacity - Business Interruption Loss - is usually much higher than the cost of transformer replacement.
Additionally, what is the risk of failure when the unit's condition has deteriorated, and routine preventative maintenance is ignored? When asked, “Why do you test the oil in your transformers?” the most common response is, “Our insurance company requires it.” The need for a reliable system must become the reason why we test transformers since every reliable system starts with reliable, uninterrupted power.
Condition Based Risks
Why do transformers fail in the first place? Many failures happen after an incident, such as a lightning strike or a failure down the line from the transformer. These anomalies can be characterized as electrical, mechanical, or thermal, but for the most part, they are separate from the condition of the unit. These may be root cause failures, but they neglect a significant contributing factor: the condition of the unit. An overloaded transformer with well-maintained dielectric fluid, which is not already weakened by poor maintenance, is much less likely to fail than an overloaded transformer that has not been maintained.
Can transformer failures be avoided? Can life extension mitigate this risk? Since my relationship began with S. D. Myers, I have been tremendously biased when it comes to maximizing a transformer's life. I have seen first-hand how effective it is to develop standards, rigorously test and track the condition of the fleet of transformers, and maintain that fleet to those standards. Quite simply, the easiest risk to mitigate is the condition-based risk.
We maintain our productive assets with increasing rigor, yet too often, we confuse the chemical or electrical testing of a transformer with the maintenance of a transformer. In one illustrative case, we were asked to develop a standard for testing and maintenance for a customer with multiple transformers and multiple sites. There was a great deal of commonality between the units, and we had done chemical testing for over seven years, tracking the condition of the oil of the units. When we asked for the electrical test data to begin to address the complete condition of each unit, we were told we only tested with you.
I think it was that moment that changed my perspective on the whole issue of risk and reliability of electrical systems, with particular emphasis on the heart of the system, the transformer. There is so much more to a complete Condition Assessment than trended oil tests. While that is a great and important first step, the next steps are as equally critical. Is recommended maintenance on oil processing standards throughout the company, or is it left up to each individual responsible for maintaining one plant? Not all oil processing is alike. While some contractors clean the oil, others may process it on the unit until they address issues in the paper and the oil.
What criteria should be used for electrical testing to create a “Best Practices” testing and maintenance protocol? What data is available from the manufacturer at assembly and installation to determine changes over time? A simple SFRA test can serve as the baseline for future condition assessment, yet too often, we do not have that data. It is a matter of developing a reliable system that can address the condition of the unit without a lot of added costs or downtime during the life of that unit, not at the end of life. It is all about assessment, planning, and systems.
The Weakest Link-Paper
While there are cases when transformers fail without warning, most failures are ones that we can see coming. Deteriorating paper and oil emit gases, which can lead to a better diagnosis of the unit's condition. Moisture content has become a tremendously valuable predictor of failure. Sadly, gassing and high moisture content are not perceived as critically important as they should be. Accelerated deterioration of the paper insulation comes about from moisture, acid, overheating, or oxidation, and it weakens the unit's ability to perform.
One of the underlying causes of this lack of preventative maintenance when it comes to transformers is that they have been too trustworthy for too long. Transformers have been overbuilt in the past, and many have gone beyond their useful life, but they continue to hum along behind the substation fence. With the onset of computer modeling for transformer design, the specifications began to be met without overbuilding the transformer. In short, we have been spoiled and have become complacent.
Transformers built today are built with much less margin for error than those of the past. If we take the same approach to reliability based on our past experiences, we are most likely going to see more unplanned outages and downtime from transformer failures than ever before.
Paper Aging
The best measure of paper aging is Furan analysis. As paper ages, it releases Furanic compounds that are good indicators of the aging of the paper. The Department of Defense Reliability Information Analysis Center (RiAC) states the following:
“Furans are a family of organic compounds which are formed by degradation of paper insulation. Overheating, oxidation and degradation by high moisture content contribute to the destruction of insulation and form furanic compounds. Changes in furans between dissolve(d) gas analysis tests are more important than individual numbers.”
The change in furans over time describes how the solid insulation is degrading, therefore reducing the reliable life of the transformer. We can also learn more about what is happening within the unit by looking at regular analysis of gases in the liquid, called Dissolved Gas Analysis (DGA).
Excuse Me, I’ve Got Gas
Gases are dissolved in the transformer oil and are formed by normal operation, aging, and anomalies or abnormal events. By analyzing the volume, types, proportions, and rate of production of dissolved gases, we can get a very good picture of what has happened or is happening inside the unit. Because these gases can reveal a transformer's faults, they are known as "fault gases." Gases are produced by partial discharge, electrolysis, localized and general overheating, and arcing.
While it is becoming increasingly common for primary units within a substation or, in the case of steel production, the furnace units, to have DGA Monitors installed on the units, the monitor will likely only detect that a gas or gases are present. An oil sample with DGA analysis can determine the amount of gas and, in the case of frequent testing, the rate of change in that gas. From that data, knowledge is developed, meaning the cause of the increased gas is determined. In extreme cases, the analysis will indicate a potential catastrophic failure. In most cases, it will indicate a weakening of the overall condition of the unit and the need to avoid overheating at higher loading.
End of Life Risks
Recently, we were asked to consider the End of Life risks for a metal processing facility. While the greatest risk was the application risk, the most costly risk was for several rectifiers that were built in the 90’s. These three rectifiers were not part of the normal inventory or production schedule of the transformer manufacturer. These units were, in essence, custom units with a 26-week lead time. If there is no planned backup, no spare, and no way to operate without one of the rectifiers, then the End of Life risk increases.
Transformer manufacturing and installation peaked in the 1960s and 1970s in America with the rapid expansion of the industry. Many of those units are no longer in production, have very tight physical footprints due to the building of productive infrastructure around them, and may not even be able to travel the roads and rails that were available years ago. In one instance, the old rail spur ended seven miles from the plant, so getting the unit out and back in again was more costly than the cost of rewinding the unit. It also took special transportation and permitting requirements, a competency the company did not have.
When the End of Life risk and the Application risk were combined, we could monetize the cost of developing a Reaction plan. That cost will likely move any decision on planning up the corporate ladder to a much higher level. Is it better to plan ahead by developing a Reaction plan, develop budgetary costs for unit rebuild or replacement, understand and budget for transportation and contractor access, or is it better to wait until something happens? Obviously, in this instance, good Reaction plans will save tremendous amounts of time and money while allowing corporate management to develop capital budget plans over years rather than within days of a failure.
Life Extension
Obviously, the first and most important takeaway from this risk analysis must be that we are doing everything possible and economically feasible to extend the life of these critical assets. Even a company with only one unit is at risk if it happens to run its data center on that one unit. How long can most of us go if our data and/or ERP systems shut down for a couple of weeks? Consider the CEO's call on that one.
The subject of transformer maintenance and Life Extension has been a term that is defined in many different ways and in many different markets. Today, the perspective on transformer maintenance and life extension is becoming a universal concern. The motivating factors behind the unification of strategy on transformer maintenance and life extension have been driven by some common factors that include the aging population of transformers and a higher-than-expected failure rate from newer replacement units.
As pointed out earlier, the general aging population of electrical power equipment, given that the infrastructure building peaked in the 1960s and 1970s, means a great potential for failure over the next decade. Prior to developing Impact assessments, Reaction Plans, and Condition Assessments, we should consider a Life Extension system to develop and maintain the necessary testing and preventive maintenance practices as a priority.
Developing a Reliable System
So where does that leave us? What should any company dependent on the safe, reliable, and uninterrupted use of electrical power do?
- Understand the short and long-term, direct and indirect impact and costs associated with unplanned power outages from the loss of a transformer
- Develop and implement a set of standards for determining the operating condition of your transformer(s) through the use of intrusive and non-intrusive testing
- Develop a Preventative Maintenance plan to implement company-wide, giving reliability professionals the tools they need to monitor and maintain this critical and often overlooked asset class
- Develop a Reaction plan for every critical transformer in your area of responsibility/control
Preventative Maintenance Plans
Obviously, any good PM Plan will start with an assessment of the condition of the unit(s). However, while most transformer oil testing labs in the U.S. are high quality and dependable, the weakest link in that testing process is often the sampling itself. Contaminated sampling leads to invalid test results or, worse, false negatives. When the sampling process is coupled with a reliable Field Inspection, the knowledge gleaned from that sample is more predictive of potential failure. It also leads to a clear maintenance profile for the unit.
Too often, transformer maintenance companies are focused on cleaning the oil, based on good inspection and sampling information. However, cleaning the oil may not clean the paper. Moisture is a perfect example. There is usually much more moisture in the paper than in the oil. processing on the oil to remove the moisture is a temporary fix. Within days or weeks, moisture will leach out of the paper and into the oil, leading to a high Karl Fisher (the proscribed test for moisture) reading. The same principle is true for acid build up in the oil and paper.
A “Best Practices” for transformer maintenance will be the result of a program for:
- Oil Testing on a regular basis with corresponding Maintenance Assessment reporting. Remember, if we test because the insurance company requires it, we are missing the point of developing a reliable electrical system.
- Electrical Testing - For more critical units - and in the event of an oil testing result that indicates a need for additional testing - standardized testing will be required.
- Mechanical Inspections - Checking the level, proper functioning gages, bushings, fans, radiators and connections.
- Infrared (IR) - While most companies provide an annual IR scan of their entire system, there is a great advantage to conducting an annual routine IR at the time of the oil test. The combined information can lead to specifics about faults, loose connections, or potential bushing failure.
- Monitoring for gases. Fault gases can lead to catastrophic failure that goes well beyond a simple unplanned outage. Monitoring for these gases creates an early warning system. Since Hydrogen is usually always present with gases, a cost-effective monitoring solution may well be a single gas monitor with a robust follow-up oil testing program.
- Why do we test? Amazingly, one of the most frequent responses we hear is, “Because our insurance company requires it.” Really? That is why you should perform frequent transformer oil testing. The reason we test is to determine the condition of the unit so we can perform PREVENTIVE maintenance. A well-maintained and service transformer can last over half a century. We test that we might gain knowledge about the condition of our equipment and that knowledge leads to wisdom to provide a low-cost, reliable electric system.
While we may not get every transformer expert to agree on everything, there is universal agreement that a healthy and robust electrical system starts with the heart of that system - the transformer.