Stability and uncertainty assessment of geoelectrical resistivity model parameters: a new hybrid metaheuristic algorithm and posterior probability density  function approach

Sarkar, Kuldeep; Tiwari, Jit V.; Singh, Upendra K.

doi:https://doi.org/10.5194/npg-31-7-2024

Articles | Volume 31, issue 1

https://doi.org/10.5194/npg-31-7-2024

Articles | Volume 31, issue 1

Research article

10 Jan 2024

Research article |

| 10 Jan 2024

Stability and uncertainty assessment of geoelectrical resistivity model parameters: a new hybrid metaheuristic algorithm and posterior probability density function approach

Kuldeep Sarkar, Jit V. Tiwari, and Upendra K. Singh

Abstract

Estimating a reliable subsurface resistivity structure using conventional techniques is challenging due to the nonlinear nature of the inverse problems. The performance of the inversion techniques can be pretty ambiguous based on the optimal error, although traditional methods have proven to be quite effective. In this work, the impacts of the constraints accessible from a borehole are examined for further assessment and to enhance algorithm effectivity. The vPSOGWO strategy is a new approach that is based on a model search space without any prior information, and it describes the hybridization of particle swarm optimization (PSO) with the Grey Wolf Optimizer (GWO). To understand the efficiency and novelty of the algorithm, it has been validated on two different kinds of synthetic resistivity data with various sets of noise and, subsequently, applied to three field datasets of different geological terrains. The analyzed results suggest that the subsurface resistivity model shows considerable uncertainty. Thus, it is superior to examine the histograms and posterior probability density functions (PDFs) of such solutions to exemplify the global solution. A PDF with a 68.27 % confidence interval (CI) selects a region with a higher probability. Therefore, the inverted models are used to estimate the mean global solution and the most negligible uncertainties, where the mean global solution represents the best solution. Our vPSOGWO-inverted outcomes have been proven to be more accurate than classic PSO, GWO, and state-of-the-art variants of classic approaches. As a result, this novel method plays a vital role in vertical electrical sounding (VES) data inversion.

Download & links

Article (PDF, 3797 KB)

Download & links

How to cite.

Received: 27 Aug 2022 – Discussion started: 23 Nov 2022 – Revised: 25 Oct 2023 – Accepted: 26 Oct 2023 – Published: 10 Jan 2024

1 Introduction

The vertical electrical resistivity sounding (VES) technique is an economical and simple method that has been used to determine the layered parameters in a wide range of applications in the hydrogeological, groundwater, mineral, geothermal, hydrocarbon, engineering, and environmental fields, among others (Sen et al., 1993; Sharma, 2012; Panda et al., 2018). VES data interpretation is challenging due to its unstable, nonunique solution and algorithm sensitivity (Narayan et al., 1994; Oldenburg and Li, 1994; Singh et al., 2005, 2013). Therefore, many researchers have developed several inversion algorithms to improve accuracy and stability and to reduce uncertainty in the solutions. These inversion techniques are grouped into local and global optimization techniques. In the local inversion techniques, a logical initial guess is required to get the solution. This has led researchers to think about alternative methods via which a broad range of parameters can be established. Researchers have developed various metaheuristic optimization algorithms to solve various real-world problems. These algorithm types, inspired by natural phenomenon, include ant colony optimization (ACO; Colorni et al., 1991), the Bat Algorithm (Yang, 2010), biogeographically based optimization (Simon, 2008), differential evolution (DE; Storn and Price, 1997), the Firefly Algorithm (Yang, 2010), the Genetic Algorithm (GA; Whitley, 1994; Mitchell, 1998), the Gravitational Search Algorithm (GSA; Rashedi et al., 2009), the Grey Wolf Optimizer (GWO; Mirjalili et al., 2014), and particle swarm optimization (PSO; Kennedy and Eberhart, 1995). These optimization techniques aim to have an optimum solution and fast convergent rate to obtain global minima. However, unique characteristics, namely, exploration and exploitation, in global optimization algorithms persist. For example, PSO has a very high potential regarding exploitation, implying that the algorithm performs well with respect to a local search (Şenel et al., 2019), but it is inferior regarding exploration, which means that the algorithm has less ability with respect to establishing the starting position near global minima and, due to low exploration characteristics, it gets trapped at the local minima (Eiben and Schippers, 1998; Mirjalili and Hashim, 2010). Therefore, integrating two algorithms with the opposite characteristics is the best way to solve the exploration characteristics and exploitation characteristics and to provide a more accurate and reliable solution, compared with results obtained with an individual algorithm. Many authors have developed various hybrid metaheuristic algorithms such as PSOGA for fundamental function analysis, PSOACO for data mining, PSODE for global optimization using the standard function, and PSOGSA using the standard function (Esmin et al., 2013; Lai and Zhang, 2009; Rashedi et al., 2009).

This study focuses on a variable-weight hybrid algorithm, known as vPSOGWO (Şenel et al., 2019), that fuses the exploration ability of PSO with the exploration ability of GWO. In this algorithm, some random particles of PSO are replaced with new ones obtained from GWO. In prior work, the constant-weight hybrid technique of PSO and GWO, known as HPSOGWO, has been used by some authors for different applications, such as for single-area-unit commitment problems (Kamboj, 2015), mathematical problems (Singh and Singh, 2017), and benchmark functions and real-world issues (Şenel et al., 2019). However, to the best of our knowledge, none of these researchers have tested these methods on geophysical data inversion. Thus, in this study, the applicability of the vPSOGWO algorithm is demonstrated on synthetic data with noise, synthetic data without noise, and various field resistivity sounding data to estimate the resistivity distribution in a 1D Earth's subsurface model. This work also calculates the posterior probability density functions (PDFs) with a 68.27 % confidence interval (CI) and correlation matrix on all accepted models to determine the mean global model and uncertainty. As a result, we analyzed and compared the effectiveness of the proposed algorithms with classic PSO, GWO, and state-of-the-art variants of classic methods. Our analysis advocates for the fact that the vPSOGWO algorithm produces a more accurate and reliable model with excellent stability, the least model uncertainty, and the ability to successfully resist noise.

2 Forward modeling algorithm

With the help of the kernel function (Koefoed, 1979) and Schlumberger resistivity configuration (Fig. 1), the forward code is developed and synthetic resistivity datasets are produced from known parameters, such as the current electrode spacing and the number of geological multilayers of true resistivities and their corresponding thicknesses. The mathematical expression of the apparent resistivity is expressed as follows:

\begin{matrix} (1) & ρ_{a} (s, m) = ρ_{1} + s^{2} ρ_{1} \int_{0}^{\infty} T_{1} (λ, m) J_{1} (λ s) d λ, \end{matrix}

where J₁ is the first-order Bessel function, λ represents the integration variables, s is the current electrode spacing, and m is the model.

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f01

Figure 1Schlumberger array configuration for the three-layer case: C1 and C2, through which the current is injected, are current electrodes with spacing s, while P1 and P2 are potential electrodes with spacing b.

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f02

Figure 2Three-layer synthetic data: (a) observed data (*) and best-fitted calculated apparent resistivity curve (>68.27 % PDF); (b) the 1D mean model (>68.27 % PDF) for the true model (black), vPSOGWO (red), GWO (blue), and PSO (green).

Download

For each layer, the kernel's resistivity transform T_k has been determined by Pekeris (1940). The apparent resistivity, T_k(λ), is convoluted with linear filter theory to compute the following:

\begin{matrix} (2) & \begin{aligned} T_{k} (λ) & = ρ_{k} \times (T_{k + 1} (λ) + ρ_{k} \tanh (λ t_{k})) / \\ (ρ_{k} + T_{k + 1} (λ) \tanh (λ t_{k})) . \end{aligned} \end{matrix}

3 Inverse modeling algorithm

The geophysical inverse problem can be formulated through a forward modeling operator/functional with the aim of achieving the geophysical model/solution that best illuminates the observed data. This operator integrates the geophysical problems and maps between the observed data y and the solution x as follows:

\begin{matrix} (3) & y = f (x) . \end{matrix}

The inversion techniques minimize the cost functional/misfit functional, which is generally a degree of the relationship between the N number of observed data (y_o) and the calculated data (y_c). This misfit functional can be introduced here as a mean square error (MSE) and can be defined as follows:

\begin{matrix} (4) & MSE = \frac{1}{N} \sum_{i = 1}^{N} {(y_{o} - y_{c})}^{2} . \end{matrix}

3.1 Particle swarm optimization

Particle swarm optimization (PSO) is based on the social behavior of animals, such as the schooling behavior of fish or the flocking behavior of birds (Kennedy and Eberhart, 1995). When birds go in search of food, they scatter randomly within a search space before they can determine the position of food. While searching for food, there is always a bird who is aware of the position of food, and they share this information with others. Using this method, each bird is referred to as a particle and is represented by geophysical solutions/models (i.e., here, a particle is a resistivity layer parameter). The capability/fitness of each swarm/bird is estimated between the N number of observed data (y_o), which measure the swarm and the food distance, and the computed data (y_c), which measure the swarm and the estimated position (resistivity layer parameter/solution) of the prey distance using Eq. (4).

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f03

Figure 3Convergence curve for the best-fitted model parameters for the vPSOGWO algorithm.

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f04

Figure 4Convergent curve, also known as the error versus iteration curve, for three-layer noiseless synthetic data.

Download

The best position among particles with information about it is stored in memory for each iteration. The new velocity and position of the population pool are accepted if their possibility is large, otherwise they are rejected. In that case, the particles are randomly distributed in the search space in order to escape the local optima. The search continues until it gains a maximum possibility or reaches the maximum iteration. In the global search space, the position of each particle is updated by the following two mathematical equations:

\begin{matrix} (5) & \begin{aligned} v_{i} (t + 1) & = v_{i} (t) + c_{1} \times rand (x_{p} (t) - x_{i} (t)) \\ + c_{2} \times rand \times (x_{g} - x_{i} (t)), \end{aligned} \\ (6) & x_{i} (t + 1) = x_{i} (t) + v_{i} (t + 1) . \end{matrix}

Here, v_i represents the velocity of the ith particle with position x_i, x_p is the best position obtained by the ith particle, x_g is the best position, t is the number of the iteration, i represents the number of the model ( $i = 1, 2, 3, \dots, N)$ , rand represent the random values with a range of [0, 1], and the coefficients c₁ and c₂ represent the optimization parameter. The disadvantage of the PSO algorithm is that, while directing particles to random positions, it has a small possibility of escaping the local minima.

3.2 The Grey Wolf Optimizer (GWO) algorithm

The GWO algorithm mimics the leadership hierarchy and hunting mechanics of grey wolves and is used to solve both standard and real-life problems. In the grey wolf community, animals are divided in four groups: (i) alpha animals, (ii) beta animals, (iii) delta animals, and (iv) omega animals. Alpha, beta, and delta animals are the fittest wolves, and they guide omega animals towards promising areas of the search space. The alpha is the pack leader and generally makes important and final decision for all of the wolves; thus, the alpha represents the fittest solution. The betas are subordinates and help the alphas in their decision-making. However, betas cannot force alphas into any decision; they can only order the lower wolves. The beta group takes orders from the alpha group, enforces orders with respect to the other groups, and sends feedback back to the alpha group. All of the groups are dominant with respect to the omega group. Nevertheless, the omega group is an important component of the pack during hunting, as they play the role of the scapegoat and are only allowed to eat at the end. If a wolf is not part of the alpha, beta, or omega group, they are known as delta and only summit to alpha and beta groups. In the GWO algorithm, the alpha group represents the best position, i.e., geophysical model/solution. In our case, the geophysical model is the resistivity layer parameters. The beta and delta groups are consecutive best solutions, and the omega group is the best solution that always follows the other groups. The capability/fitness of each wolf is estimated between the observed data (which measure the wolf and prey distance) and the computed data (which measure the wolf and the estimated position of the prey distance) using Eq. (4).

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f05

Figure 5(a) Histogram and (b) posterior PDF of all 10 000 solutions corresponding to the output of each run for the three-layer synthetic Earth model.

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f06

Figure 6Correlation plot between model parameters (off-diagonal) and the posterior PDF curve (diagonal) for those models whose PDF exceeds 68.27 % of the confidence interval (CI).

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f07

Figure 7Four-layer synthetic data: (a) observed data (*) and the best-fitted calculated apparent resistivity curve (>68.27 % PDF); (b) the 1D mean model (>68.27 % PDF) for the true model (black), vPSOGWO (red), GWO (blue), and PSO (green).

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f08

Figure 8Convergent curve, also known as the error versus iteration curve, for four-layer noiseless synthetic resistivity sounding data.

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f09

Figure 9Histogram of the logarithmic mean square error for vPSOGWO, GWO, and PSO over 10 000 models. The x axis of the three histograms represents the misfit error corresponding to 10 000 models.

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f10

Figure 10(a) Histogram and (b) posterior PDF of all 10 000 solutions corresponding to the output of each run for four-layer synthetic resistivity sounding data.

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f11

Figure 11Correlation plot between the model parameters (off-diagonal) and posterior PDF curve (diagonal) for models whose PDF exceeds 68.27 % of the confidence interval (CI).

Download

Hunting in the grey wolf community has been divided into three groups: searching for prey, encircling the prey, and attacking the prey. The encircling nature of the wolves is defined by the following equations:

\begin{matrix} (7) & d = |c \times (t) - x_{i} (t)|, \\ (8) & x_{i} (t + 1) = x_{p} (t) - a \times d . \end{matrix}

Here, x_p is the prey position, x_i is the grey wolves' positions, and a and c are the vectors mathematically formulated as follows:

\begin{matrix} (9) & a = a_{1} \times (2 \times rand - 1), \\ (10) & c = 2 \times rand . \end{matrix}

Here, $a_{1} = 2 \times (1 - t / l)$ and varies from 2 to 0 in decreasing order with increasing iteration (t), l represents the maximum iteration, and rand is the random number in the range of [0, 1].

In the grey wolf community, the alpha group leads, the beta and the delta groups search for the prey location, and the omega group follows the other groups. Therefore, the alpha group gives the best solution, while the respective second- and third-best solutions are provided by the beta and the delta groups. Thus, the remaining wolves, i.e., the omega group, follow the best solution (the other wolf groups) to obtain the best location. This is mathematically equated as follows:

\begin{matrix} (11) & d_{α, β, δ} = | c_{1, 2, 3} \times x_{α, β, δ} - x | . \end{matrix}

The best location/position for alpha, beta, and delta wolves in each iteration is given by x_α, x_β, and x_δ, respectively:

\begin{matrix} (12) & x_{1, 2, 3} = | x_{α, β, δ} - a_{1, 2, 3} \times d_{α, β, δ} | . \end{matrix}

Here, x_p(t+1) describes the updated position of the prey in the (t+1) iteration and is obtained from the mean position of three best wolves in the population; thus,

\begin{matrix} (13) & x_{p} (t + 1) = (x_{1} + x_{2} + x_{3}) / 3 . \end{matrix}

The values of a are utilized by wolves who force the search to move away from the prey. When a≥1, hunting is abandoned in order to find a better solution; in contrast, when a<1, the wolves are forced to attack the prey. In Eq. (9), a varies in the range of $[- 2 a_{1}, 2 a_{1}]$ .

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f12

Figure 12Three-layer field data over Mount Turner, northern Queensland, Australia: (a) observed data (*) and the best-fitted calculated apparent resistivity curve (>68.27 % PDF); (b) the 1D mean model (>68.27 % PDF) for the true model (black), vPSOGWO (red), GWO (blue), and PSO (green).

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f13

Figure 13Five-layer field data: (a) observed data (*) and the best-fitted calculated apparent resistivity curve (>68.27 % PDF); (b) the 1D mean model (>68.27 % PDF) for the true model (black), vPSOGWO (red), GWO (blue), and PSO (green).

Download

https://npg.copernicus.org/articles/31/7/2024/npg-31-7-2024-f14

Figure 14Six-layer field data over Keshiari (Kharagpur), India: (a) observed data (*) and the best-fitted calculated apparent resistivity curve (>68.27 % PDF); (b) the 1D mean model (>68.27 % PDF) for the true model (black), vPSOGWO (red), GWO (blue), and PSO (green).

Download

3.3 Variable-weight hybrid PSOGWO (vPSOGWO)

Despite usefulness of the PSO technique with respect to achieving successful results in real-world problems, it tends to fall into the local minima, causing the solution to move away from global minima. This tendency for deterioration within the local minima is stopped by the explorative ability of the GWO algorithm. Therefore, the variable-weight hybrid PSOGWO, known as vPSOGWO, fuses the exploitation potential of PSO with the exploration potential of GWO to overcome each other's discrepancy via the implementation of varying weight. Due to the involvement of two distinct variants running together to solve the problem, this hybrid vPSOGWO is called a coevolutionary hybrid algorithm. The encircling behavior of each wolf is updated by the following:

\begin{matrix} (14) & d_{α, β, δ} = | c_{1, 2, 3} \times x_{α, β, δ} - w \times x |, \\ (15) & where, w = w_{\max} - (w_{\max} - w_{\min}) \times t / l . \end{matrix}

Here, w_max=0.9 and w_min=0.2 are found to be more appropriate after tuning for our study.

The best location/position (geophysical model) for alpha, beta, and delta wolves in each iteration is given by x_α, x_β, and x_δ, respectively.

\begin{matrix} (16) & x_{1, 2, 3} = | x_{α, β, δ} - a_{1, 2, 3} \times d_{α, β, δ} |, \end{matrix}

where

\begin{matrix} (17) & a_{1, 2, 3} = a_{1} \times (2 \times rand - 1), \\ (18) & c_{1, 2, 3} = 0.5 (chosen after tuning), \\ (19) & a_{1} = 2 \times (1 - t / l) . \end{matrix}

The updated velocity and position of vPSOGWO are as follows:

\begin{matrix} (20) & \begin{aligned} v_{i} (t + 1) & = w \times v_{i} (t) + c_{1} \times rand \times (x_{1} - x_{i} (t)) \\ + c_{2} \times rand \times (x_{2} - x_{i} (t)) + c_{3} \times rand \\ \times (x_{3} - x_{i} (t)), \end{aligned} \\ (21) & x_{i} (t + 1) = x_{i} (t) + v_{i} (t + 1) . \end{matrix}

Here, the value of 1.5 is found to be more suitable for each of the coefficients (c₁, c₂, and c₃) after tuning the parameters in the present study (Roshan and Singh, 2017).

Algorithm 1The vPSOGWO algorithm.

Max_Iter: maximum iterations set

Pop_no: population size

Para: number of parameters

Fitness = infinite: already set

Lb and Ub: set lower bound (Lb) and upper bound (Ub) for different parameters

Initialize particles randomly

Procedure

for l=1 to Max_Iter do

for i=1 to Pop_no do

for j=1 to Para do

check the Lb and Ub for randomly created particles

end for

for i=1 to Pop_no do

Calculate the fitness form cost function

Update the wolves' fitness and position

end for

Update a1, a, c, and w using Eqs. (15) and (17)–(19)

for i=1 to Pop_no do

for j=1 to Para do

Update position of x₁, x₂, and x₃ using Eqs. (14) and (16)

Update best particle velocity and position using Eqs. (20)–(21)

end for

Table 1Optimization of the mean model result for three-layer synthetic resistivity sounding data.

Stability and uncertainty assessment of geoelectrical resistivity model parameters: a new hybrid metaheuristic algorithm and posterior probability density function approach

3.1 Particle swarm optimization

3.2 The Grey Wolf Optimizer (GWO) algorithm

3.3 Variable-weight hybrid PSOGWO (vPSOGWO)

6.1 Example 1: synthetic data – three-layer case

6.2 Example 2: synthetic data – four-layer case

6.3 Example 3: field data – three-layer case

6.4 Example 4: field data – five-layer case

6.5 Example 5: field data – six-layer case