Back to All Posts

Diagnosing EV Charger Failures

·Kyunghwan Kwon
Operationstech
Diagnosing EV Charger Failures

Introduction

Customer support responses such as "let's reboot it first," "let's go on site and check," or "it might be a relay issue" may work in the early stage of operations, but they quickly reach their limits as the number of chargers grows. Field dispatch costs increase, engineering teams have to track failures that are difficult to reproduce, and operations teams struggle to explain the exact cause to users.

To improve service quality and optimize operational resources, operators need a diagnostic environment that can visualize charger status remotely and identify problems in advance. It is not enough to simply know whether a failure occurred. The system must be able to narrow down at which stage the failure happened.

This article explains how data-driven quantitative analysis can be used to analyze charging start failures and charging interruptions, both of which have an immediate impact on revenue and can significantly harm the service's reputation among users, and how those analyses can provide insights to operations and engineering teams.

reset

Case: Charging Start Failures and Charging Interruptions

A charging start failure is operationally difficult to handle. To the user, the general symptom "charging does not start" looks like a single problem, but in reality, there are many possible causes that can occur across different stages.

  • The vehicle is not electrically detected
  • The charger fails to provide a normal PWM duty cycle or frequency
  • CP measurements fluctuate near threshold values
  • CP state determination is unstable due to ADC calibration or measurement errors
  • The system reaches the charge-ready state, but the relay does not operate
  • The relay closes, but no actual current flows
  • Authentication or session start is delayed due to CSMS communication issues
  • The vehicle or charger stops the session during charging
  • Charging is limited due to temperature or meter issues

To distinguish these problems, we used the CP (Control Pilot) state as a key diagnostic signal.

CP is the basic control signal exchanged between the vehicle and the charger to determine whether charging is possible. By observing the CP state, operators can check whether the vehicle is connected, whether the vehicle has reached the charge-ready state, and whether the charger is providing a normal PWM signal.

When CP state is converted into data, charging failures can be quickly classified as problems before vehicle connection, problems during the charge preparation stage, or problems in the power delivery stage after CP. In other words, CP provides evidence for progressively narrowing down candidate causes rather than relying on vague assumptions.

However, CP does not determine every root cause. CP is a first diagnostic axis that rapidly reduces the set of possible causes. In real operations, CP must be analyzed together with relay, meter, ADC, temperature, and CSMS connectivity metrics.

What We Collected

We designed and collected metrics so the following questions could be answered immediately.

  • Did the user actually plug in the connector?
  • Did the vehicle reach the charge-ready state?
  • Did the charger provide PWM duty cycle and frequency within specification?
  • Are the CP measurements reliable?
  • Are state transitions repeatedly fluctuating?
  • Is the CP voltage outside the normal range?
  • Was charging limited or stopped due to temperature conditions?
  • Is authentication or session start being delayed due to CSMS communication issues?

For this purpose, the charger collects the following CP-related metrics.

CP State and Signal Metrics

PilotStatus PilotDutyPct

Metric Meaning Operational interpretation
PilotStatus Current CP state Distinguishes vehicle not connected, vehicle connected, charge-ready, and fault states
PilotMilliVolts Measured CP voltage Checks state determination stability, voltage abnormalities, and calibration issues
PilotDutyPct CP PWM duty cycle ratio Checks the allowable current level advertised to the vehicle
PilotFrequency CP PWM frequency Checks PWM generation, timer, and firmware abnormalities
PilotMeasureCount Number of CP measurements Denominator for measurement reliability and ratio calculations
PilotDutyErrorCount Number of duty calculation or classification errors Potential abnormality in duty measurement or calculation
PilotBoundaryValueCount Number of measurements near threshold values Possibility of unstable A/B/C state classification
PilotAnomalyCount Number of abnormal CP patterns Signal quality degradation or abnormal state
PilotOutlierCount Number of outlier measurements Potential noise, poor contact, or ADC instability

These metrics can answer the following questions:

  • Was the vehicle connected?
  • Did the vehicle reach the charge-ready state?
  • Did the charger advertise a normal duty cycle?
  • Is the PWM frequency within the normal range?
  • Is the CP signal fluctuating near threshold values?

CP Measurement Quality Metrics

PilotIntervalMax PilotMeasureTimeMax

Metric Meaning Operational interpretation
PilotMeasureTimeMax Maximum time taken for CP measurement Potential measurement routine delay or MCU load
PilotMeasureTimeMin Minimum time taken for CP measurement Baseline for measurement time distribution
PilotIntervalMax Maximum interval between CP measurements Potential measurement cycle delay or scheduler issue
PilotIntervalMin Minimum interval between CP measurements Checks measurement cycle stability
PilotOverrunCount Number of measurement routine overruns Reduced real-time performance, task load, or firmware issue
PilotReadErrorCount Number of CP read failures Potential measurement path or ADC read issue

These metrics explain whether CP is being measured correctly, rather than the CP signal itself. This distinction is important because abnormal CP values can have two different causes.

  • The actual CP signal is abnormal
  • The CP measurement logic or ADC measurement value is abnormal

For example, the following pattern should first lead operators to suspect measurement logic or firmware scheduling issues rather than the connector.

  • Increase in PilotOverrunCount
  • Increase in PilotReadErrorCount
  • Increase in PilotMeasureTimeMax
  • Increase in PilotIntervalMax

Conversely, if measurement quality metrics are normal but only PilotBoundaryValueCount and PilotOutlierCount are repeatedly high on a specific charger, the CP line, connector, grounding, and circuit variation should be examined first.

CSMS / Network Connectivity Metrics

CSMSDowntime

Metric Meaning Operational interpretation
CSMSDowntime Duration of unavailable or abnormal connection to the CSMS Time during which the charger could not communicate normally with the backend
PingFailureCount Number of ping failures Potential reduced network reachability or unstable line
PingTimeMax Maximum ping response time Potential latency spike or degraded line quality
PingTimeMin Minimum ping response time Baseline latency under normal conditions

Even after the charger confirms vehicle connection and the charge-ready state, it still needs to communicate with the CSMS. OCPP is a protocol for standardizing communication between a charge point and a Charging Station Management System, and in real operations, flows such as authorization, sessions, status management, and remote control depend on CSMS connectivity.

Therefore, even if CP is normal, poor CSMS connectivity can still make the user experience the same symptom: "charging does not start."

Relay Operation Metrics

Metric Meaning Operational interpretation
RelayPickupCount Number of relay pickup operations Initial drive attempts to close the relay
RelayHoldCount Number of relay hold operations Relay hold operation
RelayOffCount Number of relay off operations Relay cutoff operation
RelayPickupDuty Drive duty during pickup Initial relay drive strength
RelayHoldDuty Drive duty during hold Relay hold drive strength

Relay metrics are important for diagnosing the stage after CP.

If CP reaches state C, the vehicle is ready for charging. However, for actual charging to start, the relay must pick up normally and remain in the hold state.

For example, the following two situations are completely different cases:

PilotStatus = C
RelayPickupCount unchanged
MeterEnergyDeltaAccumulated = 0

In this case, the vehicle is ready, but the system did not proceed to the relay control stage. Authorization, CSMS control conditions, the state machine, and relay control logic should be examined.

PilotStatus = C
RelayPickupCount increased
RelayHoldCount increased
MeterEnergyDeltaAccumulated = 0

In contrast, this case means the relay operated, but actual energy transfer did not occur. Relay contacts, output path, vehicle OBC, and meter measurement issues should be checked.

State Counters

Metric Meaning Operational interpretation
ChargerStateACount Number of entries into state A during the reporting period Return to vehicle-not-connected state, disconnection
ChargerStateBCount Number of entries into state B during the reporting period Vehicle connection recognition, reconnection, C-to-B return, possible user retry
ChargerStateCCount Number of entries into state C during the reporting period Event where the vehicle reached the charge-ready state
ChargerStateDCount Number of entries into state D during the reporting period Entry into ventilation-required state, exceptional in normal operations
ChargerStateECount Number of entries into state E during the reporting period Entry into CP fault-related state
ChargerStateFCount Number of entries into state F during the reporting period Entry into EVSE fault state

These values are used in two ways.

  1. To calculate the ratio of state entries.
  2. As proxy indicators for estimating charging attempts and charging failure types.

The total number of state observations during the reporting period is calculated as follows: State Sample Total = ChargerStateACount + ChargerStateBCount + ChargerStateCCount + ChargerStateDCount + ChargerStateECount + ChargerStateFCount

Based on this value, each state's ratio can be calculated: State Ratio = ChargerStateXCount / State Sample Total

This ratio is not the state dwell-time ratio. It shows which state entries occurred most frequently among state entry events during the reporting period.

For example, if there are many B-state entry events, the following possibilities can be considered:

  • Vehicle connection recognition is repeating
  • The user attempted connection multiple times
  • The A/B state is flapping
  • The CP signal is fluctuating near threshold values

ADC Error Metrics

Metric Meaning Operational interpretation
ADCErrorPipe ADC pipeline error ADC processing path issue
ADCErrorConversion ADC conversion error Conversion failure or driver issue
ADCErrorRead ADC read error Failure to read ADC data
ADCErrorParam ADC parameter error Configuration value or call parameter issue
ADCErrorCalibration ADC calibration error Calibration value application or calibration data issue
ADCErrorSequence ADC sequence error Measurement order or sequence issue
ADCError General ADC error Aggregate ADC layer error

A common operational mistake is to immediately treat abnormal PilotMilliVolts values as a CP circuit or connector issue. However, if ADC errors are increasing at the same time, the problem may be in the measurement layer rather than the actual signal.

For example, the following pattern strongly suggests an ADC calibration or measurement path issue.

  • Increase in PilotBoundaryValueCount
  • Increase in PilotOutlierCount
  • Increase in ADCErrorCalibration
  • Increase in ADCErrorConversion

Conversely, if there are no ADC errors but PilotAnomalyCount and ChargerStateECount repeatedly occur only on a specific charger, the CP circuit, connector, grounding, and interaction with the vehicle should be examined first.

Temperature Metrics

Metric Meaning Operational interpretation
TemperatureWarningCount Number of temperature warnings Increase in thermal risk
TemperatureErrorCount Number of temperature errors Potential protective action or charging interruption
TemperatureMax Maximum temperature Potential installation environment, cooling, or component degradation issue
TemperatureSampleIntervalMax Maximum temperature sampling interval Potential temperature measurement delay or task issue

Charging interruptions should not be explained by CP alone. Even if CP, relay, and meter data are normal, increasing temperature warnings or errors may indicate thermal protection or installation environment issues.

Insights for Operations Teams

Operations teams do not need raw metrics. They need to know, "So what should we do now?"

For example, a daily report like the following can be produced.

Daily CP Operations Report

Total charging attempts: 8,420
Suspected CP-related failures: 276 (3.3%)
Major failure types:
  1. B-state stall: 112
  2. Charge ready but no current: 84
  3. CP flapping: 51
  4. PWM abnormal: 21
  5. E/F fault: 8

Priority action targets:
  - CHG-1023: Repeated zero current in state C -> relay/output/CT inspection required
  - CHG-1182: High A/B flapping ranking -> connector contact inspection required
  - CHG-2041: Increased B-state stall -> PWM/vehicle compatibility/firmware check required

Once this kind of reporting becomes possible, the field dispatch process changes as well. Instead of visiting blindly, teams can go on site already knowing which parts and which logs need to be checked.

Reports for Engineering Teams

Engineering teams ask different questions from operations teams.

Operations teams ask, "Which charger should we look at first?" Engineering teams need to determine, "Is this a product issue, an environmental issue, or a version-specific issue?"

Therefore, engineering reports should be segmented not only by charger, but also by firmware version, hardware revision, site, and installation age.

An example is shown below.

Weekly CP Engineering Report

1. Firmware Regression
  - After deploying v1.8.3, the PWM abnormal rate increased from 0.2% to 1.9%
  - No increase on chargers with the previous version at the same site
  - Possible regression in PWM timer or duty calculation logic
2. Hardware Revision
  - cp_voltage_out_of_range rate is 2.4x higher on HW rev.B than rev.A
  - Voltage is measured lower by a consistent ratio across A/B/C states
  - ADC calibration table or voltage divider circuit variation should be checked
3. Site Pattern
  - Site-17 has a CP flapping rate 3.1x higher than the overall average
  - Other sites with the same firmware/hardware are normal
  - Grounding, power noise, and connector usage environment should be checked
4. Charger Aging
  - A/B flapping increases on chargers installed for more than 18 months
  - Possible connector contact wear or cable stress

This report goes beyond simple failure response and leads to product improvement. It enables early detection of CP voltage deviations in specific board revisions, PWM abnormalities in specific firmware versions, and connector contact issues related to installation age.

Closing

In EV charger operations, charging start failures and charging interruptions directly affect revenue and user experience. However, it is difficult to identify the cause from user reports alone. Behind the same symptom, "charging does not work," there may be many possibilities: connector contact issues, CP signal abnormalities, vehicle readiness failures, PWM advertisement issues, relay control issues, output path issues, and more.

As charger operations scale, what matters is not performing more field checks, but knowing what to check before going on site. CP-based diagnostics are the starting point.

Through Pulse, Pazzk collects these metrics and turns charging failures from a matter of guesswork into a measurable operational problem.

Comments