-
PDF
- Split View
-
Views
-
Cite
Cite
Kaushik Das, Roushan Kumar, Electric vehicle battery capacity degradation and health estimation using machine-learning techniques: a review, Clean Energy, Volume 7, Issue 6, December 2023, Pages 1268–1281, https://doi.org/10.1093/ce/zkad054
- Share Icon Share
Abstract
Lithium-ion batteries have an essential characteristic in consumer electronics applications and electric mobility. However, predicting their lifetime performance is a difficult task due to the impact of operating and environmental conditions. Additionally, state-of-health (SOH) and remaining-useful-life (RUL) predictions have developed into crucial components of the energy management system for lifetime prediction to guarantee the best possible performance. Due to the non-linear behaviour of the health prediction of electric vehicle batteries, the assessment of SOH and RUL has therefore become a core research challenge for both business and academics. This paper introduces a comprehensive analysis of the application of machine learning in the domain of electric vehicle battery management, emphasizing state prediction and ageing prognostics. The objective is to provide comprehensive information about the evaluation, categorization and multiple machine-learning algorithms for predicting the SOH and RUL. Additionally, lithium-ion battery behaviour, the SOH estimation approach, key findings, advantages, challenges and potential of the battery management system for different state estimations are discussed. The study identifies the common challenges encountered in traditional battery management and provides a summary of how machine learning can be employed to address these challenges.

Introduction
Development of emission-free electrochemical energy storage systems, along with the monitoring and optimization of their performance, has become a key factor in infrastructure development for electric transportation systems [1]. Centralized and decentralized energy storage and dynamic advancement of new technologies [2, 3] deal with rising carbon emissions and their associated climate change and energy shortages [4]. Lithium-ion batteries (LIBs) [5] excel as a prominent choice among different energy storage options [6] and are seen as a viable option due to their low self-discharge rate, high power densities [7] and longer cycle life, which triggered the new path for the electric vehicle (EV) market and enabled the wide emergence of portable electronic devices [8, 9].
LIBs are rechargeable in nature and can withstand several cycles of charging and discharging, but they are sensitive and may fail if not handled carefully [10, 11]. Ions travel from the cathode [12] terminal to the anode terminal via an electrolyte during charging and vice versa. The lithium ions that have been stored travel back to the cathode via the anode during discharge [13, 14]. The performance and safety of batteries are influenced by every component utilized, so selecting the finest possible combination will aid in producing better batteries. Future development of this technology, which has already made great strides, will be driven by advances in material science and parameter monitoring [15].
LIBs continuously linearly age and nonlinearly degrade from the moment they are manufactured [16] and from the time of first use; this is because of the electrochemistry used, which gives rise to unavoidable internal chemical reactions during storage or runtime [17, 18]. The three stages of degradation are the real processes, the modes that may be seen at the cellular level and the operational impacts of capacity fading. The major factors responsible for LIB health degradation [19, 20] are overcharge with both high current and high voltage regions, overdischarge and cycle frequency, and temperature (both low and high) during storage and in operation. It also covers the state of charge, cycle bandwidth, internal short circuits in a cell, external short circuits in a battery, overheating of a battery and accelerated degradation. However, due to certain inherited properties and limitations, LIBs require a battery management system (BMS) [21] and this includes the following multitask activity: gathering battery data, determining battery status, making predictions, controlling charging and discharging processes, providing safety protection, managing thermal conditions, ensuring balancing, exercising control and facilitating on-board communication [22, 23].
The research gap thorough studies says that the performance of these approaches is not satisfactory for electric mobility applications, even though data-driven algorithms are used in several state-of-health (SOH) estimate procedures. The estimating approaches have not been sufficiently developed and the classification methods of the pertinent study work that has previously been published are frequently fairly crude. Very little research describes different methods to estimate the SOH in EV applications in detail. The research objective is to understand the usage of a reliable battery model and the modelling strategy required to obtain the correct state estimation findings. The SOH estimation techniques for EVs must be dynamic to adapt to the operating conditions as various places have varying environmental conditions and driving habits. The selected capacity fading model still severely restricts the forecast accuracy of a machine-learning (ML) algorithm, which increases the computing cost.
The contribution in the paper covers a comprehensive indication and impact of different states in the BMS. It also covers different deep-learning (DL) methods for health management systems to understand ageing and degradation. Remaining-useful-life (RUL) and SOH prediction work through a data acquisition system, data processing, DL model development and different evaluation methods. This article also discusses various aspects related to faults, prognostics and diagnosis methodologies for different types of faults, specifically focusing on LIBs. It also covers the principles and characteristics of prognostics and diagnosis, as well as the indicators used in health management systems. It concludes the future of health management system trends and a few opinions on the forthcoming progress.
The paper organization is as follows. Section 1 deals with LIB behaviour and covers different influencing factors related to the design, production and application. It also covers degradation mechanisms and electrochemical behaviour. Section 2 explains the SOH estimation approach, key findings, advantages, challenges and the potential of the BMS for different state estimations. Section 3 summarizes the challenges and prospects of using artificial intelligence (AI)/ML as a design and optimization accelerator for the effective estimation of different states for electric mobility applications. Section 4 covers conclusions and future works on ML techniques for SOH prediction in LIBs.
1 Lithium-ion battery behaviour
LIB direct measurable indices are current, voltage, internal impedance and temperature [24]. Indirect indices also show an important role in understanding and monitoring the battery system, which is having a different dynamic state of charge (SOC), power, energy, health, function, temperature and RUL [25]. Among them, the SOC and the SOH are of foremost importance for electric mobility applications [26]. Knowing the SOH accurately is essential to extending battery life and ensuring the system operates safely and reliably. The parameter associated with the SOH covers the general condition of the battery and its capability to deliver the output through a measured impedance and capacity when it is not used conditions [27, 28].
The multiple factors responsible for the final capacity and power fade in the LIB start from the design level until its usage, which gives rise to several main and side reactions at the cell level. Understanding the physics of modes and effects of reactions, degradation modes and their resultant effects is of research interest [29, 30]. Cell influence factors responsible for main and side reactions is presented in Fig. 1 and covers the design, production and application influence factor. It leads to different side and main reactions such as electrode particle cracking, binder decomposition and corrosion that produces degradation and fading effects.

Lithium cell influence factors responsible for the main and side reactions for capacity and power fade
The physical and chemical developments that take place inside the LIB cell are described by electrochemical degradation. While mechanisms offer the most in-depth perspectives on deterioration, they are sometimes the most challenging to detect during cell-level or battery-level operation [31]. Fig. 2 explains the electrochemical degradation mechanisms in LIBs and covers the copper particle cracking mechanism, decomposition, precipitation and dendrite formation; it also explains the copper current collector, an aluminium current collector and degradation mechanisms [32]. It shows the most widely known deterioration processes in LIBs working with a physics-based model only to pay attention to particle cracking and the solid electrolyte interphase.

Fig. 3 illustrates the complex electrochemical behaviour of LIBs that enables their energy storage capabilities. Understanding the electrochemical processes within LIBs is crucial for optimizing their performance and addressing challenges related to capacity fade, degradation and safety concerns. The electrochemical behaviour covers an overview of cell chemistry, lithium intercalation, ion transport, charge/discharge reaction, solid electrolyte interphase (SEI), side reaction, degradation, capacity and power fade. The process of ageing and degradation is an electrochemical phenomenon that involves the LIB material and its chemical properties. LIB storage, charging and discharging along with external factors are primarily due to the sophistication and complexities of modern mechanical systems, thus SOH estimation is a very difficult and perplexing issue for prognostics and health management [33, 34]. The modelling enables us to predict electrical parameters on terminals and to anticipate temperatures and the SOC considering a wide range of environmental and operating conditions [35]. It is essential to model and analyse the performance of LIBs under various operating conditions.

Therefore, accurate estimation of the SOH under wide environmental and operating factors is an important issue to be talked about. In academics and research, the estimation of the SOH accurately and efficiently improves the estimation and prediction of the RUL, and analytical alterations to the use of the energy storage system can be made in advance so that the system operation is safe and stable, leading to extended service life. However, because of the complicated internal workings and unpredictable operating circumstances of the battery, developing a precise battery model is challenging. ML technologies [36] are becoming more popular because of their adaptability and easily available battery data [37]. The importance of path dependence degradation, or the order in which the different degradation mechanisms are triggered, has also been taken into account. This refers to the various degradation mechanisms triggered by calendar ageing, which takes place while the battery is at rest or being used or charged.
2 SOH estimation approach and key findings
The SOH is customarily designated for battery performance and its estimation methods are used with several indirect health indicators. It is divided into different model categories, namely model-based, differential analysis, data-driven and hybrid methods.
2.1 SOH estimation
The quantitative definition of the SOH is based on the rise in internal impedance of the LIB over a specified period and current capacity decliner. The total beginning-of-life charge capacity, the total maximum charge volume and the total rated capacity are represented by , and and represented through Equations (1) and (2):
Several parameters are required for depth of discharge (DOD) estimation: it is the ratio of the released charge capacity rating with a total rated capacity and it is the discharge capacity through discharge current , represented through Equations (3–5):
Without considering battery ageing and operating efficiency, the SOC is represented through Equation (6). Direct measuring techniques measure QAged and impedance directly using impedance testing, capacity testing or other testing equipment and then the SOH is computed using Equation (7) [38]:
However, these techniques are not appropriate for actual online applications and may only be used in restricted experimental settings. Indirect analysis methods are of high interest, as these are the most suitable for practical purposes and they are now widely adopted through various methods. The flowchart for the improved coulomb counting method is shown in Fig. 4. The linked memory is first accessed to get the historical battery usage statistics.

SOH estimation flowchart through SOC and DOD data and a coulomb counting algorithm
2.2 Classification of capacity degradation methods
The SOH is presumed to be in good health and has a value of 100% in the absence of information for a recently used battery; the SOC is first approximated by measuring the loaded voltage based on the starting conditions or open-circuit voltage [39]. The approximating procedure is based on keeping an eye on the Id and Vd of the LIB voltage. The quantity and direction of the operating current can be used to determine the battery operation mode. Model-based approaches are equivalent circuit models (ECMs) and electrochemical models (EMs). However, due to the complex degradation mechanism and the non-linear behaviour, it is challenging for ECMs or EMs to accurately simulate the static and dynamic characteristics throughout the life of the battery.
It extracts one or more topographies that describe the degradation from the measured battery data, such as current, voltage and temperature, collectively called direct health indicators, whereas environmental and operating parameters are called indirect health indicators (iHIs). It is used to create a connection between health indicators and the SOH using different ML methods. Fig. 5 illustrates the classification of capacity degradation estimation methods, which cover direct measurement; indirect measurement; data-driven and knowledge-based estimation; and co-estimation techniques. Fuzzy logic, AI, signal processing and linear and non-linear models are used in the estimation approach. When predicting the SOH of EV batteries, it is often useful to consider indirect health indicators in addition to key health indicators. Internal resistance, voltage response, capacity fade, cycling efficiency, self-discharge rate, temperature sensitivity and charge acceptance are indirect indicators that provide comprehensive summarizations of the overall health of the battery [40].

The impedance of the LIB increases as it ages or degrades, so measuring the internal resistance of the battery can serve as an indirect health indicator. Higher internal resistance values typically indicate reduced battery performance and capacity [41]. The voltage profile of a battery during charging and discharging can provide insights into its health. As batteries degrade, their voltage response may deviate from the expected behaviour. Monitoring voltage patterns and comparing them with a healthy reference can help in estimating the SOH indirectly. Capacity fade refers to the reduction in the energy storage capacity of a battery over time. By periodically measuring the available capacity and comparing it to the initial capacity, one can estimate the SOH of the battery indirectly. Capacity fades can be a comprehensive summarization of overall battery health. Cycling efficiency represents the ratio of the energy discharged during a cycle to the energy required to recharge the battery [36]. As batteries degrade, their cycling efficiency tends to decrease due to increased internal losses. Monitoring the changes in cycling efficiency can provide an indirect measure of battery health. A healthy battery should exhibit low self-discharge rates when not in use. Higher self-discharge rates can indicate internal issues or degradation. Monitoring the self-discharge rate over time can provide an indirect assessment of battery health.
Battery health is often affected by temperature, humidity, dust and vibration. Monitoring how the performance and capacity of a battery change with temperature and other parameter variations can serve as an indirect health indicator. Higher temperature sensitivity and larger deviations from the expected behaviour may indicate a reduced SOH. The ability of the battery to accept charge and to discharge efficiently can be an indirect indicator of health. If the charge and discharge acceptance of a battery deteriorate over time, it suggests a reduced SOH. Monitoring the charge acceptance rate during charging sessions can help in estimating battery health indirectly. By considering these indirect health indicators alongside direct health indicators such as capacity and internal resistance, a more comprehensive assessment of battery health and an SOH estimation can be achieved. ML techniques can leverage these health indicators along with historical data to develop accurate models for estimating battery health and predicting the SOH. Table 1 lists the different iHIs for SOH estimation indicators and covers different state estimations using model-based or data-driven approaches. Most indirect iHIs are available in the literature and are based on discharging data and much less on charging data.
Different indirect health indicators for the SOH estimation approach with key health indicators
Studies . | Indirect health indicators . | State . | Type . | State estimation . |
---|---|---|---|---|
[10] | Anticipated charge storage and internal resistance for k-th cycle Total cycle count for k-th cycle | SOH | Cell | Particle filter |
[42] | Variability coefficients of voltage and current curves observed during the partial charging phase | SOH | Cell | Multi-kernel Gaussian process regression model |
[25] | iHI1 is the time needed to get to the lowest discharge point iHI2 is the battery temperature rise at the highest point iHI3 is a time decrement from 3.8 to 3.5 V iHI4 is the first maximum slope of the voltage characteristic curve during the discharge operation iHI5 is the point at which the temperature of the battery peaks iHI6 is the first maximum curve of the current curve when charging at a constant voltage iHI7 is the charging time iHI8 is the area under the current characteristics | SOH and RUL | Cell | Gaussian process regression model |
[43] | Ohmic internal resistance Polarized internal resistance | SOH | Cell | Extreme learning machine (ELM)-based feedforward neural networks |
[44] | Initial discharge voltage drop Time interval to equal discharging voltage difference | SOH | Cell | Long short-term memory neural network |
[45] | Required time for a consistent voltage disparity during charging | SOH and RUL | Cell | ELM and random vector functional link network |
[46] | Time-domain and frequency-domain condition indicators | SOH | Cell | Incremental support vector regression model |
Studies . | Indirect health indicators . | State . | Type . | State estimation . |
---|---|---|---|---|
[10] | Anticipated charge storage and internal resistance for k-th cycle Total cycle count for k-th cycle | SOH | Cell | Particle filter |
[42] | Variability coefficients of voltage and current curves observed during the partial charging phase | SOH | Cell | Multi-kernel Gaussian process regression model |
[25] | iHI1 is the time needed to get to the lowest discharge point iHI2 is the battery temperature rise at the highest point iHI3 is a time decrement from 3.8 to 3.5 V iHI4 is the first maximum slope of the voltage characteristic curve during the discharge operation iHI5 is the point at which the temperature of the battery peaks iHI6 is the first maximum curve of the current curve when charging at a constant voltage iHI7 is the charging time iHI8 is the area under the current characteristics | SOH and RUL | Cell | Gaussian process regression model |
[43] | Ohmic internal resistance Polarized internal resistance | SOH | Cell | Extreme learning machine (ELM)-based feedforward neural networks |
[44] | Initial discharge voltage drop Time interval to equal discharging voltage difference | SOH | Cell | Long short-term memory neural network |
[45] | Required time for a consistent voltage disparity during charging | SOH and RUL | Cell | ELM and random vector functional link network |
[46] | Time-domain and frequency-domain condition indicators | SOH | Cell | Incremental support vector regression model |
Different indirect health indicators for the SOH estimation approach with key health indicators
Studies . | Indirect health indicators . | State . | Type . | State estimation . |
---|---|---|---|---|
[10] | Anticipated charge storage and internal resistance for k-th cycle Total cycle count for k-th cycle | SOH | Cell | Particle filter |
[42] | Variability coefficients of voltage and current curves observed during the partial charging phase | SOH | Cell | Multi-kernel Gaussian process regression model |
[25] | iHI1 is the time needed to get to the lowest discharge point iHI2 is the battery temperature rise at the highest point iHI3 is a time decrement from 3.8 to 3.5 V iHI4 is the first maximum slope of the voltage characteristic curve during the discharge operation iHI5 is the point at which the temperature of the battery peaks iHI6 is the first maximum curve of the current curve when charging at a constant voltage iHI7 is the charging time iHI8 is the area under the current characteristics | SOH and RUL | Cell | Gaussian process regression model |
[43] | Ohmic internal resistance Polarized internal resistance | SOH | Cell | Extreme learning machine (ELM)-based feedforward neural networks |
[44] | Initial discharge voltage drop Time interval to equal discharging voltage difference | SOH | Cell | Long short-term memory neural network |
[45] | Required time for a consistent voltage disparity during charging | SOH and RUL | Cell | ELM and random vector functional link network |
[46] | Time-domain and frequency-domain condition indicators | SOH | Cell | Incremental support vector regression model |
Studies . | Indirect health indicators . | State . | Type . | State estimation . |
---|---|---|---|---|
[10] | Anticipated charge storage and internal resistance for k-th cycle Total cycle count for k-th cycle | SOH | Cell | Particle filter |
[42] | Variability coefficients of voltage and current curves observed during the partial charging phase | SOH | Cell | Multi-kernel Gaussian process regression model |
[25] | iHI1 is the time needed to get to the lowest discharge point iHI2 is the battery temperature rise at the highest point iHI3 is a time decrement from 3.8 to 3.5 V iHI4 is the first maximum slope of the voltage characteristic curve during the discharge operation iHI5 is the point at which the temperature of the battery peaks iHI6 is the first maximum curve of the current curve when charging at a constant voltage iHI7 is the charging time iHI8 is the area under the current characteristics | SOH and RUL | Cell | Gaussian process regression model |
[43] | Ohmic internal resistance Polarized internal resistance | SOH | Cell | Extreme learning machine (ELM)-based feedforward neural networks |
[44] | Initial discharge voltage drop Time interval to equal discharging voltage difference | SOH | Cell | Long short-term memory neural network |
[45] | Required time for a consistent voltage disparity during charging | SOH and RUL | Cell | ELM and random vector functional link network |
[46] | Time-domain and frequency-domain condition indicators | SOH | Cell | Incremental support vector regression model |
2.2.1 Advantages and disadvantages of different ML methods
Discharging data are affected widely by operating and environmental conditions; the data are widely varied with a low level of accuracy. The charging and discharging data are normally defined as a pattern with a low level of variance widely explored for indirect health indicator parameters for the estimation and co-estimation of the SOH and RUL. Fig. 6 illustrates the advantages and disadvantages of ML techniques for SOH prediction. Advantages deal with easily identifying trends and patterns, less human involvement and handling multidimensional data whereas disadvantages cover data acquisition complexity, high error susceptibility and interpretation of results.

From material science to engineering application, the lithium-ion battery poses a complex multivariable challenge with a variety of properties including cost, environmental effects, life-cycle analyses, safety, performance and abundant resource issues [47]. These properties generate a large set of data that has been growing exponentially in recent times [15, 48]. AI methods that are learning from training experience and applied to any system have come up exponentially with the advent of associated hardware and software development over the period and are ubiquitous in our modern world.
2.2.2 Different ML methods for SOH estimation
The massive data generated for LIB to manage, monitor, understand and assimilate are a huge challenge, where AI and its subset, ML, is handy to assist researchers in efficiently solving the parameterization and data challenges on the user and manufacturing side. A strong collaboration between experimentalists, modelling specialists and AI/ ML experts is needed. Fig. 7 explains different ML methods for SOH estimation. It covers regression, classification, clustering, association and control methods. The primary data-driven approaches at this level are classified into four categories: unsupervised, supervised, semi-supervised and reinforced learning, which is further categorized according to the differences in data-mining methods as the support vector machine (SVM), relevance vector machine (RVM), long short-term memory (LSTM), Gaussian process regression (GPR) and DL [5]. In supervised learning, we have a labelled data set in which each example is associated with a corresponding target label. In unsupervised learning, we have an unlabelled data set and identify patterns or structures within the data. Semi-supervised learning is a ML paradigm that lies between supervised learning and unsupervised learning. Approaches for data-driven methods are further classified into four domains: statistical data, artificial neural networks (ANNs), filtration and vector machines [49].

Semi-supervised learning [36] combines the elements of supervised and unsupervised approaches by utilizing a small amount of labelled data along with a larger amount of unlabelled data. The functionality of the labelled data provides valuable information for training a model, while the functionality of unlabelled data helps in leveraging the underlying structure of the data. In EVs, lithium-ion battery ML and its algorithms depend on the quantity, quality and reliability of multidimensional data sets, and uncover patterns in data and open applications that are challenging to deal with using other methods. This has high relevance for material discoveries, manufacturing, performance optimization, diagnostics, and prognostics and health management (PHM) in which multitudes of input parameters are to be considered simultaneously.
2.2.3 Assessment of SOH estimation approach
Table 2 lists different assessments of the SOH estimation approach through ML and key observations, scope, content and limitations. In its traditional form, ML relies on data and its physics agnostic. This implies that instead of providing a physical explanation for a link, ML might seek to find it through interpolation of training data [11, 50, 51].
Assessment of SOH estimation approach through ML and key observations and findings
Studies . | Scope and contents . | Observations . | Limitations . |
---|---|---|---|
[27] | The principle of each algorithm is rigorously derived followed by flow charts with a unified form | SVM, ANN and DL have high potential and are hotspots for research | SOH features from short-term measurement data for facilitating practical applications are not explored |
[30] | Data-driven methods with real-world EV battery data for performance evaluation, advantages and limitations | SVM, RVM and LSTM are medium to high in accuracy, robustness and generalization | A comprehensive study on ML methods is not evaluated on various parameters |
[49] | Summarizes advantages and disadvantages of mainstream SOH estimation and prediction methods | The fusion method, having high accuracy but also high algorithm complexity and algorithm compatibility, remains to be proven | Methodologies were not elaborated on nor reviewed for errors in estimation |
[52] | Available cell technologies, various methods and models, their advantages and disadvantages and quantitative analysis of various methods for SOH | A general review of SOH estimation methods is conducted | Measurement errors are not notified. Non-destructive methods for battery condition monitoring are proposed but no details of methods or methodologies were presented |
[53] | Charging–discharging mode, discharging mode and charging mode as three parallel experiment methods | Different methods are not extensively explained in the literature | Defined only based on certain available data sets and any validation of the results not carried out |
[54] | Battery models such as electrochemical models, mathematical models, circuit-oriented models and combination models and ML and metaheuristic methods for SOx estimation | The necessity for different state estimations and modelling for LIB is established | Detailed descriptions of methods and methodologies were not carried out for different chemistry, form factors and operating and environmental conditions |
[55] | Various models for SOH, publicly available battery data sets and estimation methods | ML is dependent on data characteristics and is computationally complex. Estimation methods are ideal for an experimental environment but are not suitable for practical engineering applications | Error in measurements such as MAF, root mean square error, etc. is not reported or reviewed. Comparative analysis of different chemistries of LIBs is not discussed |
[56] | Performance prognostics and characteristics on physics-based, data-driven and hybrid classes | PHM based on SOH and RUL is very important for the application | Comparative analysis of different ML algorithms for SOH is not reviewed or introduced comprehensively |
[57] | Compared feature-based classification of SOH estimation methods and issues and challenges in predicting SOH for LIBs with possible solutions. Also explained internal and external challenges in SOH estimation | Efficient filtering is required in ML whereas model-based features are efficient for laboratory analysis and take care of dynamic performance | Comparative analysis of different ML algorithms for SOH is not reviewed or not fully involved |
[58] | Discussed challenges of real-time PHM, feasibility and cost-effective ML-based SOH with different stress factors | SOH is often used as an input for ageing model predictors and is heavily correlated and co-estimation is required | The health feature generation process is neglected |
[59] | ML-based SOH and health feature extraction methods with accuracy and execution process | Multi-kernel relevance vector machine (MK–RVM) method has higher SOH prediction accuracy than other relevant methods such as support vector regression and GPR models | |
[60] | Different data-driven approaches to SOH modelling | Comparative review of a different approach with advantages and disadvantages | Algorithms for SOH were not reviewed |
[61] | SOC and SOH joint estimation methods | Effective feature selection and prediction methods based on less sample data should be discovered to improve prediction accuracy and efficiency | ML-based algorithms for SOC–SOH co-estimation were not explored in depth |
[62] | SOH estimation with iHI, its advantages and restrictions in terms of real-time automotive applications | Real-time estimation of battery SOH is important regarding battery fault diagnosis | ML-based SOH was not elaborated on and explained |
[63] | SOH estimation methods according to signals that are used to extract iHIs | Thorough comparison among the existing methods to provide a comprehensive understanding of SOH estimation | ML-based SOH was not elaborated on and explained |
Studies . | Scope and contents . | Observations . | Limitations . |
---|---|---|---|
[27] | The principle of each algorithm is rigorously derived followed by flow charts with a unified form | SVM, ANN and DL have high potential and are hotspots for research | SOH features from short-term measurement data for facilitating practical applications are not explored |
[30] | Data-driven methods with real-world EV battery data for performance evaluation, advantages and limitations | SVM, RVM and LSTM are medium to high in accuracy, robustness and generalization | A comprehensive study on ML methods is not evaluated on various parameters |
[49] | Summarizes advantages and disadvantages of mainstream SOH estimation and prediction methods | The fusion method, having high accuracy but also high algorithm complexity and algorithm compatibility, remains to be proven | Methodologies were not elaborated on nor reviewed for errors in estimation |
[52] | Available cell technologies, various methods and models, their advantages and disadvantages and quantitative analysis of various methods for SOH | A general review of SOH estimation methods is conducted | Measurement errors are not notified. Non-destructive methods for battery condition monitoring are proposed but no details of methods or methodologies were presented |
[53] | Charging–discharging mode, discharging mode and charging mode as three parallel experiment methods | Different methods are not extensively explained in the literature | Defined only based on certain available data sets and any validation of the results not carried out |
[54] | Battery models such as electrochemical models, mathematical models, circuit-oriented models and combination models and ML and metaheuristic methods for SOx estimation | The necessity for different state estimations and modelling for LIB is established | Detailed descriptions of methods and methodologies were not carried out for different chemistry, form factors and operating and environmental conditions |
[55] | Various models for SOH, publicly available battery data sets and estimation methods | ML is dependent on data characteristics and is computationally complex. Estimation methods are ideal for an experimental environment but are not suitable for practical engineering applications | Error in measurements such as MAF, root mean square error, etc. is not reported or reviewed. Comparative analysis of different chemistries of LIBs is not discussed |
[56] | Performance prognostics and characteristics on physics-based, data-driven and hybrid classes | PHM based on SOH and RUL is very important for the application | Comparative analysis of different ML algorithms for SOH is not reviewed or introduced comprehensively |
[57] | Compared feature-based classification of SOH estimation methods and issues and challenges in predicting SOH for LIBs with possible solutions. Also explained internal and external challenges in SOH estimation | Efficient filtering is required in ML whereas model-based features are efficient for laboratory analysis and take care of dynamic performance | Comparative analysis of different ML algorithms for SOH is not reviewed or not fully involved |
[58] | Discussed challenges of real-time PHM, feasibility and cost-effective ML-based SOH with different stress factors | SOH is often used as an input for ageing model predictors and is heavily correlated and co-estimation is required | The health feature generation process is neglected |
[59] | ML-based SOH and health feature extraction methods with accuracy and execution process | Multi-kernel relevance vector machine (MK–RVM) method has higher SOH prediction accuracy than other relevant methods such as support vector regression and GPR models | |
[60] | Different data-driven approaches to SOH modelling | Comparative review of a different approach with advantages and disadvantages | Algorithms for SOH were not reviewed |
[61] | SOC and SOH joint estimation methods | Effective feature selection and prediction methods based on less sample data should be discovered to improve prediction accuracy and efficiency | ML-based algorithms for SOC–SOH co-estimation were not explored in depth |
[62] | SOH estimation with iHI, its advantages and restrictions in terms of real-time automotive applications | Real-time estimation of battery SOH is important regarding battery fault diagnosis | ML-based SOH was not elaborated on and explained |
[63] | SOH estimation methods according to signals that are used to extract iHIs | Thorough comparison among the existing methods to provide a comprehensive understanding of SOH estimation | ML-based SOH was not elaborated on and explained |
Assessment of SOH estimation approach through ML and key observations and findings
Studies . | Scope and contents . | Observations . | Limitations . |
---|---|---|---|
[27] | The principle of each algorithm is rigorously derived followed by flow charts with a unified form | SVM, ANN and DL have high potential and are hotspots for research | SOH features from short-term measurement data for facilitating practical applications are not explored |
[30] | Data-driven methods with real-world EV battery data for performance evaluation, advantages and limitations | SVM, RVM and LSTM are medium to high in accuracy, robustness and generalization | A comprehensive study on ML methods is not evaluated on various parameters |
[49] | Summarizes advantages and disadvantages of mainstream SOH estimation and prediction methods | The fusion method, having high accuracy but also high algorithm complexity and algorithm compatibility, remains to be proven | Methodologies were not elaborated on nor reviewed for errors in estimation |
[52] | Available cell technologies, various methods and models, their advantages and disadvantages and quantitative analysis of various methods for SOH | A general review of SOH estimation methods is conducted | Measurement errors are not notified. Non-destructive methods for battery condition monitoring are proposed but no details of methods or methodologies were presented |
[53] | Charging–discharging mode, discharging mode and charging mode as three parallel experiment methods | Different methods are not extensively explained in the literature | Defined only based on certain available data sets and any validation of the results not carried out |
[54] | Battery models such as electrochemical models, mathematical models, circuit-oriented models and combination models and ML and metaheuristic methods for SOx estimation | The necessity for different state estimations and modelling for LIB is established | Detailed descriptions of methods and methodologies were not carried out for different chemistry, form factors and operating and environmental conditions |
[55] | Various models for SOH, publicly available battery data sets and estimation methods | ML is dependent on data characteristics and is computationally complex. Estimation methods are ideal for an experimental environment but are not suitable for practical engineering applications | Error in measurements such as MAF, root mean square error, etc. is not reported or reviewed. Comparative analysis of different chemistries of LIBs is not discussed |
[56] | Performance prognostics and characteristics on physics-based, data-driven and hybrid classes | PHM based on SOH and RUL is very important for the application | Comparative analysis of different ML algorithms for SOH is not reviewed or introduced comprehensively |
[57] | Compared feature-based classification of SOH estimation methods and issues and challenges in predicting SOH for LIBs with possible solutions. Also explained internal and external challenges in SOH estimation | Efficient filtering is required in ML whereas model-based features are efficient for laboratory analysis and take care of dynamic performance | Comparative analysis of different ML algorithms for SOH is not reviewed or not fully involved |
[58] | Discussed challenges of real-time PHM, feasibility and cost-effective ML-based SOH with different stress factors | SOH is often used as an input for ageing model predictors and is heavily correlated and co-estimation is required | The health feature generation process is neglected |
[59] | ML-based SOH and health feature extraction methods with accuracy and execution process | Multi-kernel relevance vector machine (MK–RVM) method has higher SOH prediction accuracy than other relevant methods such as support vector regression and GPR models | |
[60] | Different data-driven approaches to SOH modelling | Comparative review of a different approach with advantages and disadvantages | Algorithms for SOH were not reviewed |
[61] | SOC and SOH joint estimation methods | Effective feature selection and prediction methods based on less sample data should be discovered to improve prediction accuracy and efficiency | ML-based algorithms for SOC–SOH co-estimation were not explored in depth |
[62] | SOH estimation with iHI, its advantages and restrictions in terms of real-time automotive applications | Real-time estimation of battery SOH is important regarding battery fault diagnosis | ML-based SOH was not elaborated on and explained |
[63] | SOH estimation methods according to signals that are used to extract iHIs | Thorough comparison among the existing methods to provide a comprehensive understanding of SOH estimation | ML-based SOH was not elaborated on and explained |
Studies . | Scope and contents . | Observations . | Limitations . |
---|---|---|---|
[27] | The principle of each algorithm is rigorously derived followed by flow charts with a unified form | SVM, ANN and DL have high potential and are hotspots for research | SOH features from short-term measurement data for facilitating practical applications are not explored |
[30] | Data-driven methods with real-world EV battery data for performance evaluation, advantages and limitations | SVM, RVM and LSTM are medium to high in accuracy, robustness and generalization | A comprehensive study on ML methods is not evaluated on various parameters |
[49] | Summarizes advantages and disadvantages of mainstream SOH estimation and prediction methods | The fusion method, having high accuracy but also high algorithm complexity and algorithm compatibility, remains to be proven | Methodologies were not elaborated on nor reviewed for errors in estimation |
[52] | Available cell technologies, various methods and models, their advantages and disadvantages and quantitative analysis of various methods for SOH | A general review of SOH estimation methods is conducted | Measurement errors are not notified. Non-destructive methods for battery condition monitoring are proposed but no details of methods or methodologies were presented |
[53] | Charging–discharging mode, discharging mode and charging mode as three parallel experiment methods | Different methods are not extensively explained in the literature | Defined only based on certain available data sets and any validation of the results not carried out |
[54] | Battery models such as electrochemical models, mathematical models, circuit-oriented models and combination models and ML and metaheuristic methods for SOx estimation | The necessity for different state estimations and modelling for LIB is established | Detailed descriptions of methods and methodologies were not carried out for different chemistry, form factors and operating and environmental conditions |
[55] | Various models for SOH, publicly available battery data sets and estimation methods | ML is dependent on data characteristics and is computationally complex. Estimation methods are ideal for an experimental environment but are not suitable for practical engineering applications | Error in measurements such as MAF, root mean square error, etc. is not reported or reviewed. Comparative analysis of different chemistries of LIBs is not discussed |
[56] | Performance prognostics and characteristics on physics-based, data-driven and hybrid classes | PHM based on SOH and RUL is very important for the application | Comparative analysis of different ML algorithms for SOH is not reviewed or introduced comprehensively |
[57] | Compared feature-based classification of SOH estimation methods and issues and challenges in predicting SOH for LIBs with possible solutions. Also explained internal and external challenges in SOH estimation | Efficient filtering is required in ML whereas model-based features are efficient for laboratory analysis and take care of dynamic performance | Comparative analysis of different ML algorithms for SOH is not reviewed or not fully involved |
[58] | Discussed challenges of real-time PHM, feasibility and cost-effective ML-based SOH with different stress factors | SOH is often used as an input for ageing model predictors and is heavily correlated and co-estimation is required | The health feature generation process is neglected |
[59] | ML-based SOH and health feature extraction methods with accuracy and execution process | Multi-kernel relevance vector machine (MK–RVM) method has higher SOH prediction accuracy than other relevant methods such as support vector regression and GPR models | |
[60] | Different data-driven approaches to SOH modelling | Comparative review of a different approach with advantages and disadvantages | Algorithms for SOH were not reviewed |
[61] | SOC and SOH joint estimation methods | Effective feature selection and prediction methods based on less sample data should be discovered to improve prediction accuracy and efficiency | ML-based algorithms for SOC–SOH co-estimation were not explored in depth |
[62] | SOH estimation with iHI, its advantages and restrictions in terms of real-time automotive applications | Real-time estimation of battery SOH is important regarding battery fault diagnosis | ML-based SOH was not elaborated on and explained |
[63] | SOH estimation methods according to signals that are used to extract iHIs | Thorough comparison among the existing methods to provide a comprehensive understanding of SOH estimation | ML-based SOH was not elaborated on and explained |
The research community unanimously agreed that ML-based SOH is one of the best among all other types of SOH estimation methodologies with specific advantages and limitations. Online estimation of SOH and RUL is not a trivial assignment due to its non-linear behaviour and the computing exhaustive processes it requires [64, 65]. Table 3 lists different algorithms and methods used for SOH estimation with different data sets such as own experimental data, National Aeronautics and Space Administration (NASA), Oxford Battery Degradation Dataset (OBDD), Center for Advanced Life Cycle Engineering (CALCE) and Massachusetts Institute of Technology (MIT). While mathematical models suffer from significant estimation errors when attempting to precisely estimate internal battery dynamics, the combination of mathematical and EMs yields improved results, albeit at the cost of increased complexity and a higher demand for computational resources. In contrast, data-driven methods aim to address the limitations of both mathematical and EMs, each of which has its respective drawbacks.
Different methods and algorithms used for SOH estimation with different categories of data sets
Studies . | Algorithms/methods used . | Category . |
---|---|---|
[17] | Random forest regressor (RFR) | Own experiment |
[37] | Gaussian process regression (GPR) | Own experiment |
[66] | Support vector regression (SVR) | Own experiment |
[67] | Cross D-Markov Machine and Fast Wavelet Transform | Own experiment |
[68] | Temporal convolution network (TCN) and long short-term memory (LSTM) network | Experimental and NASA—Prognostics Center of Excellence (PCOE) |
[69] | Convolutional neural network (CNN) | Experimental and NASA–PCOE |
[70] | Metabolic extreme learning machine (MELM) | Own experiment |
[23] | Secondary structural ensemble learning (SSEL) | OBDD |
[71] | Second-order central difference particle filter (SCDPF) | CALCE and NASA–PCOE |
[72] | Support vector machine (SVM) with radial basis function as kernel function | NASA–PCOE |
[73] | LSTM recurrent neural network | MIT data set |
[74] | Fusion of Autoregressive Moving Average Model and Elman Neural Network | Own experiment |
[75] | Gated Recurrent (GR) Unit Neural Network (UNN) | OBDD |
[76] | Backpropagation long short-term memory (B-LSTM) neural network (NN) | CALCE |
[28] | Attention depth-wise temporal convolutional network | NASA–PCOE |
[77] | Least squares support vector machine (LSSVM) | NASA–PCOE |
[78] | GPR | Sandia National Lab |
[3] | Multiple linear regression (MLR), sparse Gaussian process regression (SGPR), Deep convolutional neural network (DCNN) method | Own experiment |
[21] | Artificial neural network (ANN) | Own experiment |
[79] | Parallel layer ELM | NASA–PCOE |
[80] | Gate recurrent unit CNN | NASA–PCOE and OBDD |
[81] | SVM | MIT data set |
[82] | SVM | Own experiment |
[83] | SVM | MIT data set |
[84] | bat algorithm- Extreme learning machine (BA- ELM) | NASA–PCOE |
[85] | SVR | CALCE |
[38] | LSTM | Own experiment |
[86] | LSTM | SMC–EV, China |
[87] | SVR | NASA–PCOE |
[17] | Gradient boosting ELM | NASA–PCOE |
[88] | Deep extreme learning machine (DELM) | NASA–PCOE |
[89] | ELM–LSTM | NASA–PCOE |
[33] | CNN–LSTM | NASA–PCOE |
[90] | Feedforward Neural Network (FNN)–CNN–LSTM | Own experiment |
[91] | Recurrent neural networks (RNN) | Own experiment |
[92] | Bidirectional LSTM (BiLSTM) | NASA–PCOE |
[93] | Deep Neural Network (DNN) | NASA–PCOE |
[2] | NN–LSTM | NASA–PCOE |
[94] | LSTM | NASA–PCOE |
[95] | Improved ant lion optimization algorithm (IALO)–SVR | NASA–PCOE |
Studies . | Algorithms/methods used . | Category . |
---|---|---|
[17] | Random forest regressor (RFR) | Own experiment |
[37] | Gaussian process regression (GPR) | Own experiment |
[66] | Support vector regression (SVR) | Own experiment |
[67] | Cross D-Markov Machine and Fast Wavelet Transform | Own experiment |
[68] | Temporal convolution network (TCN) and long short-term memory (LSTM) network | Experimental and NASA—Prognostics Center of Excellence (PCOE) |
[69] | Convolutional neural network (CNN) | Experimental and NASA–PCOE |
[70] | Metabolic extreme learning machine (MELM) | Own experiment |
[23] | Secondary structural ensemble learning (SSEL) | OBDD |
[71] | Second-order central difference particle filter (SCDPF) | CALCE and NASA–PCOE |
[72] | Support vector machine (SVM) with radial basis function as kernel function | NASA–PCOE |
[73] | LSTM recurrent neural network | MIT data set |
[74] | Fusion of Autoregressive Moving Average Model and Elman Neural Network | Own experiment |
[75] | Gated Recurrent (GR) Unit Neural Network (UNN) | OBDD |
[76] | Backpropagation long short-term memory (B-LSTM) neural network (NN) | CALCE |
[28] | Attention depth-wise temporal convolutional network | NASA–PCOE |
[77] | Least squares support vector machine (LSSVM) | NASA–PCOE |
[78] | GPR | Sandia National Lab |
[3] | Multiple linear regression (MLR), sparse Gaussian process regression (SGPR), Deep convolutional neural network (DCNN) method | Own experiment |
[21] | Artificial neural network (ANN) | Own experiment |
[79] | Parallel layer ELM | NASA–PCOE |
[80] | Gate recurrent unit CNN | NASA–PCOE and OBDD |
[81] | SVM | MIT data set |
[82] | SVM | Own experiment |
[83] | SVM | MIT data set |
[84] | bat algorithm- Extreme learning machine (BA- ELM) | NASA–PCOE |
[85] | SVR | CALCE |
[38] | LSTM | Own experiment |
[86] | LSTM | SMC–EV, China |
[87] | SVR | NASA–PCOE |
[17] | Gradient boosting ELM | NASA–PCOE |
[88] | Deep extreme learning machine (DELM) | NASA–PCOE |
[89] | ELM–LSTM | NASA–PCOE |
[33] | CNN–LSTM | NASA–PCOE |
[90] | Feedforward Neural Network (FNN)–CNN–LSTM | Own experiment |
[91] | Recurrent neural networks (RNN) | Own experiment |
[92] | Bidirectional LSTM (BiLSTM) | NASA–PCOE |
[93] | Deep Neural Network (DNN) | NASA–PCOE |
[2] | NN–LSTM | NASA–PCOE |
[94] | LSTM | NASA–PCOE |
[95] | Improved ant lion optimization algorithm (IALO)–SVR | NASA–PCOE |
Different methods and algorithms used for SOH estimation with different categories of data sets
Studies . | Algorithms/methods used . | Category . |
---|---|---|
[17] | Random forest regressor (RFR) | Own experiment |
[37] | Gaussian process regression (GPR) | Own experiment |
[66] | Support vector regression (SVR) | Own experiment |
[67] | Cross D-Markov Machine and Fast Wavelet Transform | Own experiment |
[68] | Temporal convolution network (TCN) and long short-term memory (LSTM) network | Experimental and NASA—Prognostics Center of Excellence (PCOE) |
[69] | Convolutional neural network (CNN) | Experimental and NASA–PCOE |
[70] | Metabolic extreme learning machine (MELM) | Own experiment |
[23] | Secondary structural ensemble learning (SSEL) | OBDD |
[71] | Second-order central difference particle filter (SCDPF) | CALCE and NASA–PCOE |
[72] | Support vector machine (SVM) with radial basis function as kernel function | NASA–PCOE |
[73] | LSTM recurrent neural network | MIT data set |
[74] | Fusion of Autoregressive Moving Average Model and Elman Neural Network | Own experiment |
[75] | Gated Recurrent (GR) Unit Neural Network (UNN) | OBDD |
[76] | Backpropagation long short-term memory (B-LSTM) neural network (NN) | CALCE |
[28] | Attention depth-wise temporal convolutional network | NASA–PCOE |
[77] | Least squares support vector machine (LSSVM) | NASA–PCOE |
[78] | GPR | Sandia National Lab |
[3] | Multiple linear regression (MLR), sparse Gaussian process regression (SGPR), Deep convolutional neural network (DCNN) method | Own experiment |
[21] | Artificial neural network (ANN) | Own experiment |
[79] | Parallel layer ELM | NASA–PCOE |
[80] | Gate recurrent unit CNN | NASA–PCOE and OBDD |
[81] | SVM | MIT data set |
[82] | SVM | Own experiment |
[83] | SVM | MIT data set |
[84] | bat algorithm- Extreme learning machine (BA- ELM) | NASA–PCOE |
[85] | SVR | CALCE |
[38] | LSTM | Own experiment |
[86] | LSTM | SMC–EV, China |
[87] | SVR | NASA–PCOE |
[17] | Gradient boosting ELM | NASA–PCOE |
[88] | Deep extreme learning machine (DELM) | NASA–PCOE |
[89] | ELM–LSTM | NASA–PCOE |
[33] | CNN–LSTM | NASA–PCOE |
[90] | Feedforward Neural Network (FNN)–CNN–LSTM | Own experiment |
[91] | Recurrent neural networks (RNN) | Own experiment |
[92] | Bidirectional LSTM (BiLSTM) | NASA–PCOE |
[93] | Deep Neural Network (DNN) | NASA–PCOE |
[2] | NN–LSTM | NASA–PCOE |
[94] | LSTM | NASA–PCOE |
[95] | Improved ant lion optimization algorithm (IALO)–SVR | NASA–PCOE |
Studies . | Algorithms/methods used . | Category . |
---|---|---|
[17] | Random forest regressor (RFR) | Own experiment |
[37] | Gaussian process regression (GPR) | Own experiment |
[66] | Support vector regression (SVR) | Own experiment |
[67] | Cross D-Markov Machine and Fast Wavelet Transform | Own experiment |
[68] | Temporal convolution network (TCN) and long short-term memory (LSTM) network | Experimental and NASA—Prognostics Center of Excellence (PCOE) |
[69] | Convolutional neural network (CNN) | Experimental and NASA–PCOE |
[70] | Metabolic extreme learning machine (MELM) | Own experiment |
[23] | Secondary structural ensemble learning (SSEL) | OBDD |
[71] | Second-order central difference particle filter (SCDPF) | CALCE and NASA–PCOE |
[72] | Support vector machine (SVM) with radial basis function as kernel function | NASA–PCOE |
[73] | LSTM recurrent neural network | MIT data set |
[74] | Fusion of Autoregressive Moving Average Model and Elman Neural Network | Own experiment |
[75] | Gated Recurrent (GR) Unit Neural Network (UNN) | OBDD |
[76] | Backpropagation long short-term memory (B-LSTM) neural network (NN) | CALCE |
[28] | Attention depth-wise temporal convolutional network | NASA–PCOE |
[77] | Least squares support vector machine (LSSVM) | NASA–PCOE |
[78] | GPR | Sandia National Lab |
[3] | Multiple linear regression (MLR), sparse Gaussian process regression (SGPR), Deep convolutional neural network (DCNN) method | Own experiment |
[21] | Artificial neural network (ANN) | Own experiment |
[79] | Parallel layer ELM | NASA–PCOE |
[80] | Gate recurrent unit CNN | NASA–PCOE and OBDD |
[81] | SVM | MIT data set |
[82] | SVM | Own experiment |
[83] | SVM | MIT data set |
[84] | bat algorithm- Extreme learning machine (BA- ELM) | NASA–PCOE |
[85] | SVR | CALCE |
[38] | LSTM | Own experiment |
[86] | LSTM | SMC–EV, China |
[87] | SVR | NASA–PCOE |
[17] | Gradient boosting ELM | NASA–PCOE |
[88] | Deep extreme learning machine (DELM) | NASA–PCOE |
[89] | ELM–LSTM | NASA–PCOE |
[33] | CNN–LSTM | NASA–PCOE |
[90] | Feedforward Neural Network (FNN)–CNN–LSTM | Own experiment |
[91] | Recurrent neural networks (RNN) | Own experiment |
[92] | Bidirectional LSTM (BiLSTM) | NASA–PCOE |
[93] | Deep Neural Network (DNN) | NASA–PCOE |
[2] | NN–LSTM | NASA–PCOE |
[94] | LSTM | NASA–PCOE |
[95] | Improved ant lion optimization algorithm (IALO)–SVR | NASA–PCOE |
The necessity for battery ageing data is an integral part of the data-driven ML method, although fetching authentic data is a time-consuming and cost-intensive process. Table 4 explains the SOH estimation findings with different algorithms/methods used with feature source, index and precision. A big share of researchers had adopted raw and processed data from internationally acclaimed labs and companies for the development of their ML-based algorithm and their findings are well adopted. NASA–PCOE is very popular among several data repositories. Table 5 lists the advantages and disadvantages of the multiple mainstream ML approach used for SOH estimation.
Key findings with different algorithms/methods used for SOH estimation with feature source, index and precision
Studies . | Method . | Feature sources . | Index . | Precision . |
---|---|---|---|---|
[7] | Classification and regression trees | Charging and discharging profile | MSE | 0.03 |
[67] | FC–FNN | Impedance measurements | MAE | 0.9% |
[68] | LSTM–TCN | Charging and discharging profile | RMSE | 0.0273% (Ah) 0.0547% (IR) |
[96] | genetic algorithms - wavelet neural networks (GA-WNN) | Charging and discharging profile | MAE | 1.53–1.81% |
[69] | CNN | Charging profile | MAE | 1.6% |
[23] | Secondary structural ensemble learning (SSEL) | Charging profile | RMSE | 0.3556–0.5718% |
[73] | LSTM–RNN model | Impedance measurements | RMSE | 0.25–0.50% |
[28] | AD–TCN | Charging and discharging profile | RMSE | 0.014–0.083% |
[85] | SVR | Charging profile | MAE | 0.0039 |
[97] | Single hidden layer feedforward neural network | Charging and discharging profile | MAE | 1.87–2.65% |
[98] | SVM | Current pulse test | MAE | 0.89% |
[99] | ANN | Charging profile | R2 | 0.9943 |
[34] | Online sequential ELM | Charging and discharging | R2 | 0.99 |
[91] | RNN | Charging profile | R2 | 0.994–0.997 |
Studies . | Method . | Feature sources . | Index . | Precision . |
---|---|---|---|---|
[7] | Classification and regression trees | Charging and discharging profile | MSE | 0.03 |
[67] | FC–FNN | Impedance measurements | MAE | 0.9% |
[68] | LSTM–TCN | Charging and discharging profile | RMSE | 0.0273% (Ah) 0.0547% (IR) |
[96] | genetic algorithms - wavelet neural networks (GA-WNN) | Charging and discharging profile | MAE | 1.53–1.81% |
[69] | CNN | Charging profile | MAE | 1.6% |
[23] | Secondary structural ensemble learning (SSEL) | Charging profile | RMSE | 0.3556–0.5718% |
[73] | LSTM–RNN model | Impedance measurements | RMSE | 0.25–0.50% |
[28] | AD–TCN | Charging and discharging profile | RMSE | 0.014–0.083% |
[85] | SVR | Charging profile | MAE | 0.0039 |
[97] | Single hidden layer feedforward neural network | Charging and discharging profile | MAE | 1.87–2.65% |
[98] | SVM | Current pulse test | MAE | 0.89% |
[99] | ANN | Charging profile | R2 | 0.9943 |
[34] | Online sequential ELM | Charging and discharging | R2 | 0.99 |
[91] | RNN | Charging profile | R2 | 0.994–0.997 |
MSE, mean squared error; MAE, mean absolute error; RMSE, root mean square error.
Key findings with different algorithms/methods used for SOH estimation with feature source, index and precision
Studies . | Method . | Feature sources . | Index . | Precision . |
---|---|---|---|---|
[7] | Classification and regression trees | Charging and discharging profile | MSE | 0.03 |
[67] | FC–FNN | Impedance measurements | MAE | 0.9% |
[68] | LSTM–TCN | Charging and discharging profile | RMSE | 0.0273% (Ah) 0.0547% (IR) |
[96] | genetic algorithms - wavelet neural networks (GA-WNN) | Charging and discharging profile | MAE | 1.53–1.81% |
[69] | CNN | Charging profile | MAE | 1.6% |
[23] | Secondary structural ensemble learning (SSEL) | Charging profile | RMSE | 0.3556–0.5718% |
[73] | LSTM–RNN model | Impedance measurements | RMSE | 0.25–0.50% |
[28] | AD–TCN | Charging and discharging profile | RMSE | 0.014–0.083% |
[85] | SVR | Charging profile | MAE | 0.0039 |
[97] | Single hidden layer feedforward neural network | Charging and discharging profile | MAE | 1.87–2.65% |
[98] | SVM | Current pulse test | MAE | 0.89% |
[99] | ANN | Charging profile | R2 | 0.9943 |
[34] | Online sequential ELM | Charging and discharging | R2 | 0.99 |
[91] | RNN | Charging profile | R2 | 0.994–0.997 |
Studies . | Method . | Feature sources . | Index . | Precision . |
---|---|---|---|---|
[7] | Classification and regression trees | Charging and discharging profile | MSE | 0.03 |
[67] | FC–FNN | Impedance measurements | MAE | 0.9% |
[68] | LSTM–TCN | Charging and discharging profile | RMSE | 0.0273% (Ah) 0.0547% (IR) |
[96] | genetic algorithms - wavelet neural networks (GA-WNN) | Charging and discharging profile | MAE | 1.53–1.81% |
[69] | CNN | Charging profile | MAE | 1.6% |
[23] | Secondary structural ensemble learning (SSEL) | Charging profile | RMSE | 0.3556–0.5718% |
[73] | LSTM–RNN model | Impedance measurements | RMSE | 0.25–0.50% |
[28] | AD–TCN | Charging and discharging profile | RMSE | 0.014–0.083% |
[85] | SVR | Charging profile | MAE | 0.0039 |
[97] | Single hidden layer feedforward neural network | Charging and discharging profile | MAE | 1.87–2.65% |
[98] | SVM | Current pulse test | MAE | 0.89% |
[99] | ANN | Charging profile | R2 | 0.9943 |
[34] | Online sequential ELM | Charging and discharging | R2 | 0.99 |
[91] | RNN | Charging profile | R2 | 0.994–0.997 |
MSE, mean squared error; MAE, mean absolute error; RMSE, root mean square error.
Advantages and disadvantages of the multiple mainstream ML approach used for SOH estimation
Methods . | Advantages . | Disadvantages . |
---|---|---|
Supervised | Allows collecting ageing data or producing a SOH data output and helps in optimizing performance criteria from the previous experience | Computation time is vast, unwanted data down efficiency, preprocessing of data is a big challenge and always-in need for updates |
Semi-supervised | Easy to understand and has a simple, stable and highly efficient algorithm. Also reduces the amount of annotated data used | Iteration results are not stable and not applicable to network-level data. It has low accuracy |
Unsupervised | High costs and time are involved in selecting training data. Varying consistency in classes may not match spectral classes | Opportunity for human error is minimized and is relatively easy and fast to carry out |
Reinforced | High capability to solve complex problems and is capable of achieving long-term results for SOH. It is very useful for achieving perfection, especially in non-linear health profiles | Only useful for complex situations such as SOH affected by environmental and operational circumstances. The algorithm needs a lot of ageing data and a lot of computation work |
Methods . | Advantages . | Disadvantages . |
---|---|---|
Supervised | Allows collecting ageing data or producing a SOH data output and helps in optimizing performance criteria from the previous experience | Computation time is vast, unwanted data down efficiency, preprocessing of data is a big challenge and always-in need for updates |
Semi-supervised | Easy to understand and has a simple, stable and highly efficient algorithm. Also reduces the amount of annotated data used | Iteration results are not stable and not applicable to network-level data. It has low accuracy |
Unsupervised | High costs and time are involved in selecting training data. Varying consistency in classes may not match spectral classes | Opportunity for human error is minimized and is relatively easy and fast to carry out |
Reinforced | High capability to solve complex problems and is capable of achieving long-term results for SOH. It is very useful for achieving perfection, especially in non-linear health profiles | Only useful for complex situations such as SOH affected by environmental and operational circumstances. The algorithm needs a lot of ageing data and a lot of computation work |
Advantages and disadvantages of the multiple mainstream ML approach used for SOH estimation
Methods . | Advantages . | Disadvantages . |
---|---|---|
Supervised | Allows collecting ageing data or producing a SOH data output and helps in optimizing performance criteria from the previous experience | Computation time is vast, unwanted data down efficiency, preprocessing of data is a big challenge and always-in need for updates |
Semi-supervised | Easy to understand and has a simple, stable and highly efficient algorithm. Also reduces the amount of annotated data used | Iteration results are not stable and not applicable to network-level data. It has low accuracy |
Unsupervised | High costs and time are involved in selecting training data. Varying consistency in classes may not match spectral classes | Opportunity for human error is minimized and is relatively easy and fast to carry out |
Reinforced | High capability to solve complex problems and is capable of achieving long-term results for SOH. It is very useful for achieving perfection, especially in non-linear health profiles | Only useful for complex situations such as SOH affected by environmental and operational circumstances. The algorithm needs a lot of ageing data and a lot of computation work |
Methods . | Advantages . | Disadvantages . |
---|---|---|
Supervised | Allows collecting ageing data or producing a SOH data output and helps in optimizing performance criteria from the previous experience | Computation time is vast, unwanted data down efficiency, preprocessing of data is a big challenge and always-in need for updates |
Semi-supervised | Easy to understand and has a simple, stable and highly efficient algorithm. Also reduces the amount of annotated data used | Iteration results are not stable and not applicable to network-level data. It has low accuracy |
Unsupervised | High costs and time are involved in selecting training data. Varying consistency in classes may not match spectral classes | Opportunity for human error is minimized and is relatively easy and fast to carry out |
Reinforced | High capability to solve complex problems and is capable of achieving long-term results for SOH. It is very useful for achieving perfection, especially in non-linear health profiles | Only useful for complex situations such as SOH affected by environmental and operational circumstances. The algorithm needs a lot of ageing data and a lot of computation work |
To get the appropriate state estimate results, a trustworthy battery model and modelling approach must be used. The classification methods of the applicable research work published are often rather rudimentary and there has not been sufficient work done on the estimation techniques. Very little effective research describes different methods to estimate the SOH in EV applications in detail. For EVs operating in different locations with diverse environmental conditions and driving patterns, the SOH estimate methods must be dynamic to adopt the operating conditions [57].
ML amplifies the computational demands, while the effectiveness of its predictions remains significantly constrained by the employed capacity fading model [58].
3 Challenges and prospects
The current research discusses the ideas, methods, tools, results and difficulties of applying AI/ML as a design and optimization accelerator for the effective estimation of SOH, which is the latest research issue in current LIBs for electric mobility application. Despite all the expectations and promises around AI and ML, there is still a long way to go before data-driven techniques are widely used in the SOH estimation area. Table 6 lists the different challenges in the implementation of ML in the SOH. The thorough explanation of the SOH, its estimate and the application still present some significant challenges in the BMS.
Studies . | Challenges . |
---|---|
[15] | Descriptors, user-friendly tools, lack of standards, immature presentations, data scarcity, error determination and bridging scales |
[37] | Inadequate selection of functions can result in limited knowledge generalization and the emergence of overfitting |
[52] | Studying battery chemistry is necessary to associate it with the SOH |
[100] | Filtering and denoising, further research on the SOH estimation of EVs |
[93] | Higher computational time |
[101] | Wide choice of methods and standards |
[102] | Data scarcity and cost safety concern |
Studies . | Challenges . |
---|---|
[15] | Descriptors, user-friendly tools, lack of standards, immature presentations, data scarcity, error determination and bridging scales |
[37] | Inadequate selection of functions can result in limited knowledge generalization and the emergence of overfitting |
[52] | Studying battery chemistry is necessary to associate it with the SOH |
[100] | Filtering and denoising, further research on the SOH estimation of EVs |
[93] | Higher computational time |
[101] | Wide choice of methods and standards |
[102] | Data scarcity and cost safety concern |
Studies . | Challenges . |
---|---|
[15] | Descriptors, user-friendly tools, lack of standards, immature presentations, data scarcity, error determination and bridging scales |
[37] | Inadequate selection of functions can result in limited knowledge generalization and the emergence of overfitting |
[52] | Studying battery chemistry is necessary to associate it with the SOH |
[100] | Filtering and denoising, further research on the SOH estimation of EVs |
[93] | Higher computational time |
[101] | Wide choice of methods and standards |
[102] | Data scarcity and cost safety concern |
Studies . | Challenges . |
---|---|
[15] | Descriptors, user-friendly tools, lack of standards, immature presentations, data scarcity, error determination and bridging scales |
[37] | Inadequate selection of functions can result in limited knowledge generalization and the emergence of overfitting |
[52] | Studying battery chemistry is necessary to associate it with the SOH |
[100] | Filtering and denoising, further research on the SOH estimation of EVs |
[93] | Higher computational time |
[101] | Wide choice of methods and standards |
[102] | Data scarcity and cost safety concern |
Since exterior behaviours are a composite of many interconnected internal responses, it is challenging to monitor and simulate the internal condition and ageing mechanism of LIBs since they are still poorly understood and cannot be directly monitored. In the minor ageing conditions (C-rate@1C, T@25 °C), the major ageing cause is believed to be the development of the SEI layer. But even in moderate cases of ageing, ‘abnormal’ phenomena such as abrupt capacity decline have been noted, suggesting an ageing process distinct from SEI growth.
The heterogeneity of ageing processes makes SOH and RUL prediction difficult. Diverse ageing circumstances exhibit the same capacity/resistance at one stage but differ dramatically at the next, validating the need for a multi-parameter SOH model. Periods of fast discharging may be alternated with slower discharging or periods of inactivity in real-world usage patterns; consequently, further research into this component is required for more precise health evaluation and lifetime prediction. Voltage, current and temperature are frequent measures that are effects of the interaction of several internal electrochemical processes. It is still difficult to extract and estimate the right characteristics to fully define the SOH using the working data that are currently available. EV battery capacity degradation and health estimation pose several challenges, and ML methods can help to address these challenges, as shown in Fig. 8. The challenges include lack of standards, improper selection of functions, battery ageing and degradation, battery chemistry, variability and uncertainty, lack of real-time monitoring, limited training data, cost safety system and model interpretability.

General challenges of electric vehicle battery capacity degradation and health estimation using ML methods
The precise threshold for end-of-life (EOL) parameters based on the ageing mechanism is presently undisclosed and, unlike capacity or impedance, they have clear EOL values; substantial trials are required to corroborate the EOL parameters based on the ageing mechanism. The SOH mapping from the cell level to the battery pack level is complicated and the disparity in the number of cells in the LIB pack design and industrial and laboratory scales make it difficult to accurately and practically monitor the SOH at the pack level. For battery packs, SOH estimates are challenging to reconcile due to structural complications and computing demands. Due to its sluggish variations, the SOH does not always need to be updated immediately like the SOC, but this does not imply that the computing cost is negligible.
The authentication of the suggested SOH prediction techniques is currently inadequate and only a small number of ageing tests are conducted under wide environmental and operating conditions dealing at the battery pack level with varying degrees of uncertainty. The majority of the work is verified at the cell level, charged or discharged under distinct current profiles with set ambient temperatures. The majority of research methods are created utilizing laboratory data, leading to a substantial shortfall between laboratory test results and real-world applications. To increase the prediction accuracy and effectiveness using data-driven methodologies, it is recommended to investigate effective feature selection and estimation techniques based on small sample data. Ideally, a SOH estimation model should start estimating the SOH right after the first charge–discharge cycle.
4 Conclusions
AI/ML techniques provide promising results for improving lithium-ion battery SOH and RUL estimation by leveraging vast amounts of battery data and capturing complex relationships. However, challenges such as data availability, interpretability and model generalization must be carefully addressed to ensure the effective and reliable application of AI/ML in the electric mobility domain. Continued research and collaboration are required to overcome limitations and completely unlock the potential of AI/ML in lithium-ion battery management for EVs. Different kinds of EVs employ LIBs and, in related fields, these applications call for a safe, dependable and thermally suitable selection of batteries. Battery prognostics techniques are commonly misused and their usefulness is underappreciated due to a lack of adequate communication tools between material scientists possessing a comprehensive understanding of LIB internals and engineers responsible for guaranteeing optimal battery management for EVs.
The cycle life of a lithium-ion battery is increasing from a few hundred to thousands of cycles and all state computations that previously required a smaller data set must now accommodate more data. The selection of an efficient DL strategy must handle a sample of data with numerous dimensions. Currently, the majority of research is based on laboratory data, although certain public data sets, such as NASA, MIT, Oxford, Panasonic and CALCE, were captured under different dynamic operating conditions. To increase the prediction accuracy and effectiveness using data-driven methodologies, it is recommended to investigate effective feature selection and estimation techniques based on small sample data. The SOH estimation of LIBs has been published and addresses how to apply the technique in real time in EV applications. A SOH estimate model should ideally begin estimating the SOH immediately following the initial charge–discharge cycle. DL technique interpretability in the LIB for SOH and RUL estimation uses a supervised and unsupervised learning approach and it is a type of black box. Investigating the underlying significance of computed findings can provide more insightful information and can indicate the SOH of a battery and improve performance in health prediction and evaluation.
Acknowledgements
The authors were able to develop, implement and assess the performance of the system in a cutting-edge electric and hybrid vehicle laboratory facility at the University of Petroleum and Energy Studies, Dehradun, India.
Conflict of interest statement
None declared.
Data Availability
In this study data sharing is not applicable as no new data was generated or old data sets are analysed.