UPS Battery Maintenance: When to Replace and How to Test
Introduction #
When this guide fits: You operate or maintain VRLA (or similar) strings behind UPS and need a test and replace discipline tied to runtime risk.
When it is not suitable: You are designing a new DC protection and grounding scheme for a substation—follow utility standards and PE-directed models instead of this UPS-focused overview.
The UPS inverter is only as good as the DC bus behind it. Valve-regulated lead-acid (VRLA) strings remain the default in many plants because they are predictable and well understood—but they age silently. Capacity fades from grid overcharging, stratification, elevated temperature, and shallow cycling. This guide summarizes how maintenance teams test batteries, when to replace them, and how to stay safe while working on live DC systems.
Think of batteries as wear items with hidden state: the charger can look “green” while energy available for your stated minutes is already gone. The sections below turn that hidden state into trendlines, tests, and purchase triggers your CMMS can enforce.
Pair this guide with UPS Battery Calculator and UPS Runtime Calculator so replacement strings still meet the original runtime acceptance test.
Why “runtime minutes” drifts before the UPS blinks #
Operators watch output alarms; batteries fail silently until internal resistance rises enough that terminal voltage collapses under load. Self-tests often use partial discharge and may not reproduce data-center shutdown stacks. Treat minutes as a distribution: new string, mid-life, and EOL scenarios belong in the same CMMS record so finance sees refresh coming, not only engineering.
What to measure (and how often) #
| Test | Typical frequency | What it tells you |
|---|---|---|
| Visual inspection (leaks, swelling, corrosion) | Monthly | Mechanical failure precursors |
| Float voltage per string/block | Quarterly | Charger alignment, weak cells |
| Inter-cell/terminal torque check | Annual | Resistance heating |
| Internal ohmic / impedance test | Semi-annual to annual | Trending toward end-of-life |
| Load test (constant current or AC load bank on UPS) | Per OEM or after events | Remaining actual minutes |
Trending beats spot checks. A cell that jumps 20–30% in impedance versus its baseline is often near failure even if float voltage looks fine. Plot monthly points; annual dots hide acceleration in the last quarter before failure when teams skip winter windows during busy shutdown seasons every fiscal year.
Temperature discipline #
For VRLA, Arrhenius-style rules of thumb: every 10°C above 20°C can roughly halve calendar life. If the battery cabinet shares a hot motor room without dedicated cooling, derate the design replacement interval and re-run UPS Runtime Calculator at elevated resistance assumptions.
When to replace #
Replace a string proactively when:
- Impedance trends cross the vendor threshold or internal policy (many sites use 130–150% of baseline).
- A load test cannot meet the required runtime at end-of-life voltage—even if the UPS still “passes” a brief self-test.
- Physical bulging, terminal corrosion, or thermal hotspots appear on IR inspection.
After any nearby fire, flood, or seismic event, perform impedance and load tests before returning the bus to service.
Root causes teams mislabel as “charger bugs” #
Loose intercell links mimic rectifier ripple. Corroded shunts change sense wiring and drive wrong float. Before swapping expensive charger boards, torque and clean first, then re-measure ripple at known load steps. Document before/after scope captures for the vendor case file.
Safe work practices #
- Use insulated tools, arc-rated PPE per site policy, and LOTO on all AC feeds to the UPS and maintenance bypass.
- Short-circuit current on large parallel strings can be extremely high; use rated disconnects and training.
- Label polarity and torque values at installation so future crews cannot mirror-wire a replacement tray.
Receipt inspection and baseline (day zero) #
Before burn-in, photograph serial numbers, terminal orientation, and inter-tier spacing. Record open-circuit voltage per block after stabilization and compare to OEM ranges. Baseline impedance after temperature equalization gives the trend anchor—do not skip because “the UPS passed self-test.” File the baseline under the string ID, not only the UPS asset ID, because strings rotate independently on some modular systems.
Float, equalize, and charger discipline #
Float voltage that wanders high cooks jars; low float invites sulfation in some designs. Follow the OEM chart for temperature-compensated float where available. Equalize cycles (where permitted) are not “free maintenance”—they consume water in flooded cells and stress VRLA if misapplied. Log equalize events beside room temperature so you can correlate capacity dips later.
| Symptom | Charger-side check | Battery-side check |
|---|---|---|
| High float current | Setpoints, calibration | Internal short risk |
| Ripple on DC bus | Rectifier health | Loose intercell links |
| Uneven block temps | Sharing harness | Weak cell |
Load testing without drama #
Use a dedicated load bank procedure when possible rather than “inventing” a test by opening random plant loads. Define stop conditions: minimum DC voltage, maximum cell temperature, and maximum test duration. Assign one radio net leader so E-stop is unambiguous. After the test, recharge to full before returning to service—returning with a depleted string is how false alarms multiply.
Worked example — impedance trend (illustrative) #
Baseline: 2.10 mΩ average on a 120-block string. Year 3: 2.45 mΩ (+17%). Year 4: 2.90 mΩ (+38% vs baseline). Policy says replace at +35% or failed load test—schedule replacement now rather than waiting for a holiday outage surprise.
Try our UPS Runtime Calculator with elevated internal resistance assumptions to see how minutes collapse at year-end.
Spill containment, seismic, and housekeeping #
VRLA jars can crack from forklift bumps or seismic motion. Spill kits belong inside the room, not in a distant warehouse. Cable dress should not lift terminals—strain shows up first as heat on IR scans. Keep debris out of air intakes on cabinet fans; hot chargers age adjacent jars unevenly.
Disposal, hazmat, and vendor returns #
Used batteries are regulated in many jurisdictions. Keep manifests aligned to serial lists. If a vendor offers credit for cores, verify who owns freight and insurance during return. Never stack unknown chemistries on the same skid.
Insurance and incident documentation #
After thermal events, preserve BMS logs for lithium systems and charger logs for VRLA. Insurers often ask for proof of scheduled tests—CMMS exports beat memory.
Training: what good looks like #
Technicians should demonstrate LOTO, insulated tool selection, and first-responder steps for DC arc risk. Pair senior mentors with contractors on the first string swap. Quiz on polarity mistakes—those cause catastrophic weekends.
Try our UPS Battery Calculator when replacement strings change Ah or count so runtime acceptance matches the original MOP.
Parallel strings, current sharing, and “one weak jar” #
When multiple strings feed a common DC bus, sharing harnesses and fuses must be symmetric. A weak jar raises float current on its neighbors, accelerating imbalance. If you see one string warmer than its twin, investigate connector torque and inter-string voltage offsets before blaming ambient alone.
Interpreting infrared without wishful thinking #
IR photos are useful, but emissivity on plastic cases can lie. Compare delta-T jar-to-jar on the same rack face under similar charge states. Hot terminals often mean resistance; hot case mid-body may mean internal degradation—flag both, but do not conflate root causes.
Alarms you should not silence permanently #
High float current, charger faults, and temperature rate-of-change alarms deserve tickets with owners. If operations masks alarms during busy season, you will discover end-of-life during the first real outage. Write a policy: which alarms may be snoozed, for how long, and who approves.
Spare parts and shelf life #
Spare jars age on the shelf too—track date codes and rotation into service. Torque hardware kits should live with the UPS asset. Fuse types should be non-substitutable without engineering review because AIC and speed matter on DC branches.
Second worked example — minutes collapse (illustrative) #
Design: 10 minutes at 200 kW on a new string. Year 5: internal resistance model suggests 18% longer voltage sag under same kW. Effective minutes drop near 8 without any SNMP warning if thresholds are naive. Update MOP minimum minutes or refresh early.
Vendor PM contracts: read the fine print #
Some contracts count a visual walk as a “test.” Define deliverables: impedance CSV, photos, torque sheet, and load test curve when required. Tie payments to evidence, not checkboxes.
Winter versus summer test windows #
Cold rooms slow chemical kinetics; hot rooms accelerate aging but can mask low float symptoms if chargers temperature-compensate aggressively. When comparing year-to-year tests, annotate ambient and charger mode so you do not misread a seasonal shift as failure.
Handover package checklist #
| Artifact | Owner | Frequency |
|---|---|---|
| Impedance CSV | Electrical | Semi-annual |
| Load test report | Contractor | Per contract |
| Torque log | Mechanical | Annual |
| Firmware rev | IT/OT | Each change |
When to escalate to the OEM early #
Escalate when parallel strings show divergent impedance slopes, when BMS logs show cell imbalance acceleration on lithium, or when charger ripple doubles versus commissioning. Waiting for smoke costs weeks of lead time.
Closing discipline #
Good battery programs are boring: predictable dates, trending graphs, and few surprises during outages. If your program feels exciting, you are probably funding heroics instead of maintenance—and heroics rarely scale across ten sites.
Try our UPS Load Calculator when protected kW changes so runtime tests target the right load step.
Related tools #
Related articles #
- Complete UPS Sizing Guide
- How to Calculate UPS Battery Size
- UPS Runtime: Common Mistakes
- Online vs Offline vs Line-Interactive UPS
Next steps you should take #
- Schedule impedance trending on the next maintenance window; baseline any new strings after burn-in.
- Re-run UPS Runtime Calculator after any string replacement to confirm minutes at design load.
- File reports under the UPS asset ID in your CMMS with next review date.
- Link photos and CSV exports to the same work order so auditors see one chain of evidence.
More UPS tools: UPS calculator hub.
Are UPS “self-tests” enough?
Built-in self-tests are short and may not reveal capacity loss. Schedule periodic impedance and occasional load tests for critical buses.
Can I replace only one jar in a string?
Usually no for VRLA strings: a new cell in an old string becomes the electrical weak point and mismatches charging. Follow OEM guidance; most sites replace entire strings.
What about lithium-ion UPS batteries?
Different BMS alarms, thermal runaway mitigation, and shipping rules. Treat as a separate program—do not assume VRLA checklists transfer.
How often should we load-test a critical string?
Follow OEM and insurance requirements; many critical sites use annual or biennial tests with semi-annual impedance trending between them.
What torque mistakes cause fires?
Under-torque creates resistance heat; over-torque cracks posts. Use the OEM chart and a calibrated wrench.