Bayesian inference based on algorithms: MH, HMC, MALA and Lip-MALA for prestack seismic inversion

Perez-Roa, Richard; Infante, Saba; Barragan, Gabriel; Manzanilla, Raul

doi:10.5194/npg-33-173-2026

Articles | Volume 33, issue 2

https://doi.org/10.5194/npg-33-173-2026

Articles | Volume 33, issue 2

Research article

20 Apr 2026

Research article |

| 20 Apr 2026

Bayesian inference based on algorithms: MH, HMC, MALA and Lip-MALA for prestack seismic inversion

Richard Perez-Roa, Saba Infante, Gabriel Barragan, and Raul Manzanilla

Abstract

Seismic inversion for estimating elastic properties is a key technique for reservoir characterization after drilling. The choice of inversion method strongly influences the accuracy, efficiency, and reliability of results. Bayesian inference based on Markov Chain Monte Carlo (MCMC) algorithms provides a robust framework for incorporating data uncertainty and prior geological knowledge. In this study, we compare the performance of four inversion methods – Metropolis-Hastings (MH), Hamiltonian Monte Carlo (HMC), the Metropolis-Adjusted Langevin Algorithm (MALA), and its variant Lip-MALA – in prestack seismic inversion using both synthetic models and real data from an eastern Venezuelan hydrocarbon reservoir. Results indicate that gradient-based methods (HMC, MALA, Lip-MALA) outperform MH in velocity estimation, while density inversion remains more challenging. MH and MALA achieve shorter execution times, whereas HMC and Lip-MALA improve accuracy at higher computational cost. This analysis evaluates mean values and standard deviation (SD) estimates for P-wave velocity, S-wave velocity, and density, with quality assessed through correlation metrics, objective function behavior, seismic traces, and Root Mean Square Error (RMSE). A two-dimensional inversion with real data further demonstrates algorithms performance under complex geological conditions. The findings highlight trade-offs between accuracy and efficiency, providing practical guidelines for selecting inversion method in seismic reservoir characterization.

Download & links

How to cite.

Received: 28 Aug 2024 – Discussion started: 21 Oct 2024 – Revised: 19 Dec 2025 – Accepted: 04 Apr 2026 – Published: 20 Apr 2026

1 Introduction

The accurate characterization of hydrocarbon reservoirs is fundamental for effective exploration and production decisions. This requires integrating complementary sources of information: general geological knowledge from analogous reservoirs and rock physics principles, and reservoir-specific observations from well logs, seismic surveys, and production history.

Seismic data play a central role due to their extensive spatial coverage, providing a continuous description of the subsurface beyond individual well locations. To translate seismic amplitudes into meaningful elastic and petrophysical properties, such as P-wave velocity (V_p), S-wave velocity (V_s), and density (ρ), seismic inversion is employed. This geophysical inverse problem aims to infer subsurface properties from observed seismic data, typically expressed as (Tarantola, 2005):

\begin{matrix} (1) & d_{obs} = g (m) + ξ \end{matrix}

where d_obs are the observed data, g is the forward operator linking the model parameters m to the data, and ξ represents measurement and modeling errors.

Traditional seismic inversion methods can be broadly categorized into deterministic and stochastic approaches. Deterministic methods, such as least-squares inversion (Tarantola, 2005) and gradient-based optimization (Buland and Omre, 2003), provide single best-fit models but often lack robust uncertainty quantification. Stochastic methods, including simulated annealing (Ma, 2002) and genetic algorithms, explore the model space more broadly but can be computationally intensive and may not guarantee convergence to the posterior distribution.

In prestack amplitude versus offset (AVO) inversion (Helland-Hansen et al., 1997; Ma, 2002; Buland and Omre, 2003), the problem is ill-posed: solutions are non-unique and highly sensitive to data noise and modeling errors (Landa and Treitel, 2016). The elastic parameters $(V_{p}, V_{s}, ρ)$ , from which Lamé parameters can be derived. These are sensitive to rock fluid content and saturation (Clochard et al., 2009) and can be further related to petrophysical parameters such as porosity, sand $/$ shale ratio, and gas saturation (Goodway, 2001). Accurately estimating these parameters is therefore crucial for identifying hydrocarbon accumulations.

To address these challenges, Bayesian inference has emerged as a powerful framework, as it explicitly incorporates prior geological knowledge and quantifies uncertainty through the posterior distribution. Markov Chain Monte Carlo (MCMC) methods are particularly well-suited for sampling complex, high-dimensional posterior distributions in nonlinear inverse problems (Bosch et al., 2007).

Several MCMC algorithms have been applied in geophysical inversion. The Metropolis–Hastings (MH) algorithm (Metropolis et al., 1953; Hastings, 1970) is a foundational method but can suffer from slow convergence in high-dimensional spaces. Hamiltonian Monte Carlo (HMC) (Duane et al., 1987; Neal, 2012) leverages gradient information to propose more efficient moves, making it suitable for high-dimensional problems, as demonstrated in full-waveform inversion (Gebraad et al., 2020) and acoustic inversion (de Lima et al., 2023). Langevin dynamics-based methods, such as the Metropolis-Adjusted Langevin Algorithm (MALA) (Roberts and Tweedie, 1996) and its adaptive variant Lip-MALA (Nemeth and Fearnhead, 2021; Izzatullah et al., 2021), use gradient information to guide proposals, improving mixing and convergence in complex posteriors.

Despite these advances, a systematic comparison of these MCMC algorithms in the context of prestack seismic inversion – particularly using both synthetic and real data in 1D and 2D settings – remains limited. Most studies focus on a single algorithm or synthetic cases, leaving a gap in understanding their practical performance under realistic geological complexity and computational constraints.

This study aims to fill this gap by conducting a comprehensive evaluation of four MCMC algorithms – MH, HMC, MALA, and Lip-MALA – for Bayesian prestack seismic inversion. We assess their performance in estimating elastic parameters (V_p, V_s, ρ) using both noise-free synthetic data and real data from an eastern Venezuelan reservoir. Our analysis includes:

Quantitative metrics: mean, standard deviation, correlation, and RMSE.
Computational efficiency: acceptance rates and execution times.
Convergence diagnostics: multivariate effective sample size (mESS).
Practical applicability: extension to 2D inversion with real data.

Our contributions are threefold:

We provide a benchmark comparison of MCMC algorithms in seismic inversion, highlighting trade-offs between accuracy, uncertainty quantification, and computational cost.
We demonstrate the scalability of these methods from 1D to 2D real-data scenarios, validating their robustness in geologically complex settings.
We offer practical guidelines for algorithm selection based on project requirements, data characteristics, and available computational resources.

In Prestack AVO, Bayesian approaches with MCMC have been previously applied to estimate impedance or elastic parameters (e.g., Bosch et al., 2007; Wu et al., 2019), but these studies focus on a single algorithm or mainly on synthetic cases, but we go further and test it on real data.

The remainder of this paper is organized as follows: Sect. 2 formulates the seismic inversion problem within a Bayesian framework and describes the forward modeling approach based on AVO theory. Section 3 details the theoretical background of the MCMC algorithms. Section 4 presents result from synthetic and real data experiments. Section 5 discusses the implications and practical recommendations. Section 6 concludes the study.

2 The Seismic Inversion Problem

Having established the importance of Bayesian MCMC methods for prestack seismic inversion in Sect. 1, this section provides a comprehensive formulation of the inverse problem within a rigorous statistical framework. We detail the mathematical foundations, model parameterization, forward modeling approach, and inference workflow that form the basis of our comparative study. The complete Bayesian inference flow implemented in this study is visually summarized in Fig. 1, which illustrates the steps from input data to subsequent uncertainty quantification.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f01

Figure 1Bayesian seismic inversion workflow implemented in this study, integrating prior information, seismic data, forward modelling, MCMC sampling, and posterior analysis to produce probabilistic estimates of elastic parameters with quantified uncertainties.

Download

2.1 Bayesian Formulation of the Inverse Problem

Within the Bayesian paradigm, the solution to the inverse problem is expressed probabilistically through the posterior distribution (Tarantola, 2005; Bosch et al., 2007) p(m|d_obs), which combines prior knowledge with information from the observed data. Applying Bayes' theorem, we have:

\begin{matrix} (2) & p (m | d_{obs}) = \frac{p (d_{obs} | m) p (m)}{p (d_{obs})} \propto c L (m) ρ (m), \end{matrix}

where:

ρ(m) is the prior distribution, representing geological knowledge independent of the seismic data.
L(m) is the likelihood function, quantifying the probability of observing the data given a specific model.
c is a normalization constant.
And $p (d_{obs}) = \int p (d_{obs} | m) p (m) d m$

2.1.1 Prior Distribution Specification

The prior distribution encodes constraints on the model parameters based on independent information such as well logs, regional geology, and rock physics relationships. We employ a Gaussian prior distribution $ρ (m) \sim N (m_{prior}, C_{m})$ (Bosch, 2004; Buland and Omre, 2003):

\begin{matrix} (3) & ρ (m) = c_{1} \exp [- \frac{1}{2} {(m - m_{prior})}^{T} C_{m}^{- 1} (m - m_{prior})], \end{matrix}

where N denotes a Gaussian probability distribution, with mean $m_{prior} \in R^{N_{m}}$ , typically derived from low-frequency interpolation of well data or seismic velocities, covariance matrix given by $C_{m} \in R^{N_{m} \times N_{m}}$ , which imposes spatial correlation and smoothness constraints, effectively regularizing the ill-posed inverse problem and c₁ an appropriate normalization constant.

2.1.2 Likelihood Function and Data Errors

Assuming Gaussian-distributed errors in Eq. (1) with zero mean and covariance matrix C_d, the likelihood function takes the form:

\begin{matrix} (4) & L (m) = \exp [- \frac{1}{2} {(d_{obs} - g (m))}^{T} C_{d}^{- 1} (d_{obs} - g (m))], \end{matrix}

The likelihood function assumes Gaussian-distributed errors, consistent with standard formulations in Bayesian seismic inversion (Tarantola, 2005; Izzatullah et al., 2021), $g : m \to d_{obs}$ , being the function solving the seismic forward problem and the data covariance matrix C_d accounts for measurement noise and uncertainties in the forward model. While often assumed diagonal (uncorrelated errors), it can incorporate more complex error structures when available.

2.1.3 Posterior Distribution and MCMC Sampling

Combining Eqs. (3) and (4), the posterior distribution can be expressed through the negative log-posterior (misfit function) S(m):

\begin{matrix} (5) & \begin{aligned} S (m) & = - \log p (m | d_{obs}) \\ = \frac{1}{2} {(d_{obs} - g (m))}^{T} C_{d}^{- 1} (d_{obs} - g (m)) \\ + \frac{1}{2} {(m - m_{prior})}^{T} C_{m}^{- 1} (m - m_{prior}), \end{aligned} \end{matrix}

Thus,

\begin{matrix} (6) & p (m | d_{obs}) \propto \exp [- S (m)], \end{matrix}

In practice, the posterior distribution defined above is analytically intractable. Markov Chain Monte Carlo (MCMC) methods are therefore employed to generate samples ${\{m^{(i)}\}}_{i = 1}^{N}$ from Eq. (6), enabling comprehensive uncertainty quantification through estimation of posterior means, variances, and marginal distributions (Robert and Casella, 2004; Neal, 2012; Gebraad et al., 2020).

2.2 Model Parameterization and Forward Modelling

2.2.1 Logarithmic Parameterization

To ensure positivity of physical parameters (V_p>0, V_s>0, ρ>0) and improve numerical stability, we adopt a logarithmic parameterization. For a 1D model with N layers, the parameter vector is (Buland and Omre, 2003):

\begin{matrix} (7) & \begin{aligned} m & = [\ln (V_{p, 1}), \dots, \ln (V_{p, N}), \ln (V_{s, 1}), \dots, \\ \ln (V_{s, N}), \ln (ρ_{1}), \dots, \ln (ρ_{N})]^{T}, \end{aligned} \end{matrix}

This transformation also aligns naturally with the logarithmic form of the Aki-Richards reflectivity approximation used in our forward model.

2.2.2 AVO Forward Modeling Formulation

The forward operator g(m) implements the Aki-Richards linearized approximation for PP reflectivity. The AVO method was created in the early 1980s to analyze the amplitudes of seismic CMP gathers as a function of angle to find hydrocarbons. The Aki-Richards equation (Aki and Richards, 2002) is the foundation of AVO analysis. The original form of the equation can be rewritten for a weak-contrast interface to give (Buland and Omre, 2003; Niu et al., 2020):

\begin{matrix} (8) & R_{pp} (θ) = c_{1} (θ) \frac{Δ V_{p}}{{\overline{V}}_{p}} + c_{2} (θ) \frac{Δ V_{s}}{{\overline{V}}_{s}} + c_{3} (θ) \frac{Δ ρ}{\overline{ρ}}, \end{matrix}

where

\begin{matrix} (9) & c_{1} (θ) = \frac{1}{2} (1 + \tan^{2} θ), \\ (10) & c_{2} (θ) = - 4 \frac{{\overline{V}}_{s}}{{\overline{V}}_{p}} \sin^{2} θ, \\ (11) & c_{3} (θ) = \frac{1}{2} (1 - 4 \frac{{\overline{V}}_{s}}{{\overline{V}}_{p}} \sin^{2} θ), \end{matrix}

In Eqs. (8)–(11), the incident angle θ is the angle at which a wave reflected off a surface. V_p, V_s and ρ represent the velocities of P-waves, S-waves, and the density of a material, respectively. ΔV_p, ΔV_s and Δρ are the changes in V_p, V_s and ρ across a reflective interface. $\overline{V_{p}}$ , $\overline{V_{s}}$ and $\overline{ρ}$ are the average values of V_p, V_s and ρ, respectively.

To obtain the seismic trace for a certain theta angle we can use the approximation for small reflectivity (Russell et al., 2006),

\begin{matrix} (12) & \begin{aligned} T (θ) & = \frac{1}{2} c_{1} W (θ) D L_{V_{p}} + \frac{1}{2} c_{2} W (θ) D L_{V_{s}} \\ + \frac{1}{2} c_{3} W (θ) D L_{ρ}, \end{aligned} \end{matrix}

where $L_{V_{p}} = \ln (V_{p})$ , $L_{V_{s}} = \ln (V_{s})$ , L_ρ=ln (ρ), W is the wavelet matrix and D is the derivative matrix. Equation (12) can be implemented in matrix form as

\begin{matrix} (13) & \begin{aligned} [\begin{array}{c} T (θ_{1}) \\ T (θ_{2}) \\ ⋮ \\ T (θ_{n}) \end{array}] = \\ \frac{1}{2} [\begin{array}{ccc} c_{1} W (θ_{1}) D & c_{2} W (θ_{1}) D & c_{3} W (θ_{1}) D \\ c_{1} W (θ_{2}) D & c_{2} W (θ_{2}) D & c_{3} W (θ_{2}) D \\ ⋮ & ⋮ & ⋮ \\ c_{1} W (θ_{n}) D & c_{2} W (θ_{n}) D & c_{3} W (θ_{n}) D \end{array}] [\begin{array}{c} L_{V_{p}} \\ L_{V_{s}} \\ L_{ρ} \end{array}] \end{aligned} \end{matrix}

In discrete matrix form, for multiple angles $\{θ_{1}, θ_{2}, \dots, θ_{n}\}$ , the complete forward modeling operation can be expressed compactly as:

\begin{matrix} (14) & d_{syn} = g (m) = G m, \end{matrix}

where:

\begin{matrix} (15) & \begin{aligned} d_{syn} = [\begin{array}{c} T (θ_{1}) \\ T (θ_{2}) \\ ⋮ \\ T (θ_{n}) \end{array}]; \\ G = \frac{1}{2} [\begin{array}{ccc} c_{1} W (θ_{1}) D & c_{2} W (θ_{1}) D & c_{3} W (θ_{1}) D \\ c_{1} W (θ_{2}) D & c_{2} W (θ_{2}) D & c_{3} W (θ_{2}) D \\ ⋮ & ⋮ & ⋮ \\ c_{1} W (θ_{n}) D & c_{2} W (θ_{n}) D & c_{3} W (θ_{n}) D \end{array}] \end{aligned} \end{matrix}

and m was defined in Eq. (7). For how to construct W and D, known as the wavelet matrix and the derivative matrix respectively, the reader is advised to review Hampson et al. (2005).

2.3 Inference Workflow and Uncertainty Quantification

The complete Bayesian inversion workflow implemented in this study is visually summarized in Fig. 1. The process integrates prior geological knowledge, seismic data, physical modelling, and statistical sampling to produce probabilistic estimates of subsurface elastic properties.

Bayesian inversion process implemented in this study follows a systematic workflow:

Prior Model Construction: Define m_prior and C_m using available well-log data, geological interpretation, and spatial correlation models.
Data Preparation and Uncertainty Quantification: Extract angle-stacked seismic traces d_obs, estimate corresponding wavelets for multiple angles $\{θ_{1}, θ_{2}, \dots, θ_{n}\}$ , and characterize data uncertainties through the covariance matrix C_d
Forward Operator Assembly: Construct the matrix G according to Eqs. (8)–(15) to implement the forward modelling operator g(m).
MCMC Sampling: Apply one of the four sampling algorithms (MH, HMC, MALA, or Lip-MALA) to generate a Markov chain of model samples from the posterior distribution p(m|d_obs)
Posterior Analysis: Compute statistical summaries (posterior means, standard deviations, correlation coefficients, RMSE) and marginal probability distributions from the sampled models to assess the inverted elastic parameters and their associated uncertainties.

This comprehensive framework provides a rigorous approach to seismic inversion where solutions are characterized not as single models but as probability distributions that fully quantify uncertainty, directly addressing the ill-posed nature of the inverse problem.

3 Theoretical Background for Metropolis-Hastings, Hamiltonian Monte Carlo and Langevin Diffusion

This section presents the theoretical foundations of the four MCMC algorithms compared in this study. A visual summary of their computational workflows and key characteristics is provided in Fig. 2, followed by detailed mathematical descriptions and implementation considerations.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f02

Figure 2Comparative schematic of MCMC algorithms evaluated in this study. All methods generate samples from the posterior distribution but differ in proposal mechanisms, gradient requirements, and parameter tuning strategies.

Download

3.1 Algorithm Overview and Comparative Framework

Figure 2 provides a schematic comparative view of the four MCMC algorithms, highlighting their proposal mechanisms, gradient requirements, and tuning strategies. All methods share the common goal of generating samples from the posterior distribution p(m|d_obs) but employ different strategies for proposing and accepting new states.

3.2 Metropolis–Hastings Algorithm

MH algorithm is one of the most widely used MCMC methods for sampling from complex probability distributions. Originally proposed by Metropolis et al. (1953) and generalized by Hastings (1970), MH defines a transition probability that ensures ergodicity, detailed balance, and reversibility of the chain (Chib and Greenberg, 1995).

3.2.1 Mathematical Formulation

Given the current state m^(t) at iteration t, MH proposes a new candidate m^∗ from a proposal distribution q(m^∗|m^(t)). The candidate is accepted with probability:

\begin{matrix} (16) & α_{MH} = min \{1, \frac{L (m^{*})}{L (m^{(t)})} \cdot \frac{q (m^{(t)} | m^{*})}{q (m^{*} | m^{(t)})}\}, \end{matrix}

where L(⋅) represents the likelihood function (Eq. 4). For symmetric proposal distributions where $q (m^{*} | m^{(t)}) = q (m^{(t)} | m^{*})$ , this simplifies to the Metropolis acceptance ratio.

Implementation Details

In our seismic inversion context, we employ a Gaussian random walk proposal:

\begin{matrix} (17) & q (m^{*} | m^{(t)}) = N (m^{*} | m^{(t)}, σ^{2} I), \end{matrix}

Where σ is a tuning parameter controlling the proposal step size. The algorithm proceeds as follows:

Step 1. Initialization: Start with an initial model m⁽⁰⁾=m_prior and set t=0.
Step 2. Propose a new state: Generate $m^{*} \sim N (m^{(t)}, σ^{2} I)$ .
Step 3. Compute Acceptance: Calculate α_MH using Eq. (16).
Step 4. Accept/Reject: Draw $u \sim Uniform (0, 1)$ . If u≤α_MH, set $m^{(t + 1)} = m^{*}$ ; otherwise $m^{(t + 1)} = m^{(t)}$
Step 5. Iterate: Increment t and repeat from step 2 until convergence.

3.2.2 Tuning Considerations and Limitations

The performance of MH critically depends on the choice of σ. Optimal scaling theory suggests aiming for acceptance rates of approximately 23.4 % for random walk proposals in high dimensions (Roberts et al., 1997). In our implementation, σ was tuned during a 1000-iteration burn-in period to achieve this target.

While MH is conceptually simple and requires no gradient computations, its random walk behavior leads to slow exploration of high-dimensional parameter spaces. The autocorrelation between samples remains high, requiring longer chains to achieve effective sample sizes comparable to gradient-based methods.

3.3 Hamiltonian Monte Carlo (HMC)

HMC is a sampling algorithm that was first introduced in molecular dynamics (Duane et al., 1987) and later adapted to Bayesian inference problems (Neal, 2012; Fichtner and Zunino, 2019). It is particularly effective for high-dimensional posteriors when gradient information of the σ is available and it is more efficient than standard MH. It introduces an auxiliary momentum variable to simulate Hamiltonian dynamics in an augmented parameter space.

3.3.1 Hamiltonian Dynamics Formulation

HMC is an MCMC algorithm that uses classical Hamiltonian mechanics (Landau and Lifshitz, 1976) to sample from an arbitrary n-dimensional probability density function (PDF). HMC introduces momentum variables $p ϵ R^{N_{m}}$ and samples from the joint distribution:

\begin{matrix} (18) & p (m, p) = p (m) p (p) \propto \exp (- U (m) - K (p)) \end{matrix}

where $U (m) = - \log (p (m))$ , which is defined as the negative logarithm of the PDF (Gebraad et al., 2020), and $K (p) = \frac{1}{2} p^{T} M^{- 1} p$ is the kinetic energy with mass matrix M.

The Hamiltonian equations of motion are:

\begin{matrix} (19) & \frac{d m}{d τ} = \frac{\partial H}{\partial p} = M^{- 1} p, \frac{d p}{d τ} = - \frac{\partial H}{\partial m} = - \nabla U (m) \end{matrix}

where $H (m, p) = U (m) + K (p)$ is the Hamiltonian and τ is fictitious time.

3.3.2 Leapfrog Integration and Algorithm

Since analytical solutions to Eq. (19) are unavailable for nonlinear problems, HMC employs the symplectic leapfrog integrator with step size ϵ and L steps. For our seismic inversion, the algorithm proceeds as (Gebraad et al., 2020):

Step 1. Start with an initial model m⁽⁰⁾=m_prior, random momenta p⁽⁰⁾∼N(M). and set t=0.
Step 2. Evaluate the Hamiltonian H(m^(t)p^(t)) starting with t=0.
Step 3. Leapfrog integration (repeated L times): Calculate:
$\begin{matrix} (20) & p^{*} = p^{(t)} - \frac{ϵ}{2} \nabla U (m^{(t)}) \\ (21) & m^{*} = m^{(t)} + ϵ p^{*} \\ (22) & p^{*} = p^{*} - \frac{ϵ}{2} \nabla U (m^{*}) \end{matrix}$
Step 4. Calculate the Hamiltonian $H (m^{*} p^{*})$ .
Step 5. Compute Acceptance: Calculate α_HMC using:
$\begin{matrix} (23) & α_{HMC} = min (1, \exp (H (m^{(t)}, p^{(t)}) - H (m^{*}, p^{*}))), \end{matrix}$
Step 6. Accept/Reject: Draw $u \sim Uniform (0, 1)$ . If u≤α_HMC, set $m^{(t + 1)} = m^{*}$ and $H (m^{(t + 1)}, p^{(t + 1)}) = H (m^{*} p^{*})$ ; otherwise $m^{(t + 1)} = m^{(t)}$ and $p^{(t + 1)} \sim N (0, M)$
Step 7. Iterate: Increment t and repeat from step 2 until convergence.

The acceptance rate of the leapfrog integration algorithm commonly used in Step 3 is largely influenced by how well it conserves energy in the trajectory. If the time steps are too large or the gradients of the fitting function are incorrectly calculated, the algorithm will save less energy, and the acceptance rate will decrease. Simply put, the leapfrog integration algorithm works by bouncing model parameters back and forth across the simulated energy landscape. The acceptance rate determines how often the algorithm accepts a new proposed model parameter. If the time steps are too large or the gradients are calculated incorrectly, the algorithm cannot follow the energy landscape accurately and will likely reject the proposed model parameters. This leads to slower convergence and increases computational cost.

3.3.3 Parameter Tuning for Seismic Inversion

HMC requires careful tuning of two parameters: the step size ϵ and trajectory length L. We employed a dual adaptation strategy:

Step size (ϵ): Tuned during burn-in to achieve acceptance rates optimal for HMC.
Trajectory length (L): Fixed at L=10 based on preliminary experiments showing adequate exploration without excessive computational cost.

3.4 Langevin Diffusion-Based Algorithms

3.4.1 Langevin Dynamics

Langevin dynamics are a mathematical model of Brownian motion, named after the French physicist Paul Langevin (Lemons and Gythiel, 1997) who developed them in 1908. Langevin dynamics is a simplification of Albert Einstein's approach to Brownian motion, which is based on Newton's second law of motion. The Langevin dynamics for target distribution σ(m_t), is a continuous-time stochastic process ${(m_{t})}_{t \geq 0}$ in $R^{N_{m}}$ that evolves following the stochastic differential equation (SDE) (Roberts and Stramer, 2002; Nemeth et al., 2016; Izzatullah et al., 2021) and (Sánchez et al., 2016; Infante et al., 2019):

\begin{matrix} (24) & d m (τ) = - Σ \nabla \log [p (m (τ))] d τ + \sqrt{2} Σ^{\frac{- 1}{2}} d W (τ) \end{matrix}

where W(τ) is a standard n-dimensional Brownian motion, Σ is a symmetric positive definite matrix (in this paper we use Σ=I), ∇log [σ(m_t)] is the drift term of the Brownian particle m_t and σ(m_t) is a stationary posterior distribution.

3.4.2 Metropolis-adjusted Langevin algorithm (MALA)

In the practice, a standard approach is to discretize the Eq. (13) using the Euler-Maruyama discretization (Stuart et al., 2004) and we obtained the Unadjusted Langevin algorithm (ULA) given by

\begin{matrix} (25) & \begin{aligned} m^{*} = m^{(t)} + τ Σ \nabla \log [p (m^{(t)})] + \sqrt{2 τ} Σ^{\frac{- 1}{2}} ϵ_{t}, \\ ϵ_{t} \sim N (0, I_{n \times n}) \end{aligned} \end{matrix}

where τ is the step-length for each iteration. ULA is simple in its implementation, yet it introduces a bias, then we need to introduce the acceptance-rejection step through the MH algorithm. By introducing MH algorithm into ULA, we will obtain the Metropolis-Adjusted Langevin algorithm (MALA), (Izzatullah et al., 2020, 2021). The procedure consists of constructing a Markov chain at each step t, given m^(t), a new observation is generated from a proposal density q(m^∗). The candidate is then accepted with probability α_MALA given by,

\begin{matrix} (26) & α_{MALA} = min \{1, \frac{L (m^{*})}{L (m^{(t)})} \cdot \frac{q (m^{(t)} | m^{*})}{q (m^{*} | m^{(t)})}\}, \end{matrix}

In summary, MALA algorithm is obtained as follows:

Step 1. Choose an initial solution m⁽⁰⁾=m_prior and the discretization step-length τ.
Step 2. Draw $ϵ_{t} \sim N (0, I_{n \times n})$ and simulate a new sample from the Langevin diffusion (Eq. 25).
Step 3. Compute the accept-reject probability α_MALA using Eq. (26).
Step 4. Draw u from a uniform distribution $u \sim Uniform (0, 1)$ , if α_MALA<u set $m^{(t + 1)} = m^{*}$ ; otherwise $m^{(t + 1)} = m^{(t)}$
Step 5. Then, repeat this process until the convergence.

3.4.3 MALA with locally Lipschitz adaptive step size (Lip-MALA)

In the MALA algorithm, it is required to calibrate the step-size τ, because τ must decrease with dimension, n. then τ can be turned such that the MCMC achieve better mixing performance. An extension of ULA and similar in spirit with Stochastic Gradient Langevin Dynamics algorithm proposed by Welling and Teh (2011) by suppressing the MH acceptance steps. See in (Izzatullah et al., 2021) proposed ULA variant, Lip-MALA, in which the step-length τ is adapted according to the Lipschitz condition,

\begin{matrix} (27) & τ_{t} = \frac{1}{2} \frac{| | m^{(t)} - m^{(t - 1)} | |_{2}}{| | \nabla \log [p (m^{(t)})] - \nabla \log [p (m^{(t - 1)})] | |_{2}} \end{matrix}

The general steps for Lip-MALA MCMC with locally Lipschitz adaptive step size are:

Step 1. Choose an initial solution m⁽⁰⁾=m_prior, the discretization step-length τ, $β_{0} = + \infty$ and $L_{c} = N_{m}^{- 1 / 3}$
Step 2. Draw $ϵ_{t} \sim N (0, I_{n \times n})$ and simulate a new sample from the Langevin diffusion (Eq. 25).
Step 3. Compute the accept-reject probability α_MALA using Eq. (26).
Step 4. Draw u from a uniform distribution $u \sim Uniform (0, 1)$ , if α_MALA<u set $m^{(t + 1)} = m^{*}$ , then update,
$\begin{matrix} (28) & τ_{t} = min \{\begin{array}{l} \sqrt{1 + β_{t - 1}} τ_{t - 1}, \\ L_{c} \frac{| | m^{(t)} - m^{(t - 1)} | |_{2}}{| | \nabla \log [p (m^{(t)})] - \nabla \log [p (m^{(t - 1)})] | |_{2}} \end{array}\} \\ (29) & β_{t} = \frac{τ_{t}}{τ_{t - 1}} \end{matrix}$
Step 5. Else reject $m^{(t + 1)} = m^{(t)}$ Then, repeat this process until the convergence.

The resulting Lip-MALA algorithm follows the same steps as MALA but dynamically adjusts ϵ, improving stability and sampling efficiency in high dimensions.

3.5 Algorithm Comparison and Tuning Considerations

Table 1 summarizes the key characteristics, advantages, and limitations of the four algorithms, supported by theoretical and empirical studies from the literature.

Table 1Comparative analysis of MCMC algorithms for seismic inversion.

Download Print Version | Download XLSX

3.5.1 Practical Implementation Notes

For our seismic inversion experiments, all algorithms were implemented with the following considerations:

Gradient Computation: The gradient $\nabla \log p (m) = - \nabla S (m)$ was computed analytically, ensuring computational efficiency.
Step Size Tuning: Each algorithm's step size parameter was tuned during a preliminary burn-in phase to achieve target acceptance rates.
Convergence Diagnostics: We employed diagnostics using Effective Sample Size (ESS) and multivariate ESS (Vats et al., 2019)
Computational Resources: All experiments were conducted on a workstation with Intel Core i9-10900K, 64GB RAM, without GPU acceleration. Execution times were measured as wall-clock time for fair comparison.

The following sections present application of these algorithms to synthetic and real seismic data, evaluating their performance in terms of accuracy, uncertainty quantification, and computational efficiency.

4 Results

4.1 Synthetic test

We test our algorithms using noise-free synthetic seismic traces that were obtained from real data of V_p, V_s and ρ for which synthetic seismic traces were generated from the Eq. (28) for the angles θ₁=9°, θ₂=18.5° and θ₃=27.5° and these synthetic seismic traces will be our observed data. We ran the sampling algorithms described in Sect. 3, producing a large chain of realizations, starting from a prior model configuration corresponding to a low frequency model of V_p, V_s and ρ.

Figure 3 shows the objective function variation curves for the different sampling algorithms. Each iteration involves randomly perturbing the velocities and density of a subset of layers and recalculation of seismic traces. The vertical axis represents the objective function calculated using Eq. (5). The horizontal axis shows the number of steps in the Markov chain, each associated with an accepted or rejected perturbation of the velocity and density configuration. The first stage of the chain, associated with the initial configuration and large residues, is called the burn-in or warm-up stage. After subtracting the residuals, the model realizations of velocities and densities satisfactorily explain the seismic data within the data errors. This is called the sampling phase. Realizations produced during the sampling phase are treated as samples from the probability density.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f03

Figure 3Progress with iterations in the MH (blue line), HMC (red line), MALA (green line) and Lip-MALA (black line) sampling algorithms for synthetic test.

Download

The model settings were modified during the sampling phase, but remain within the probability function, as shown in Fig. 4. Figure 5 shows all realizations taken (gray area) in the chain sampling phase for the different algorithms tested in this work, all adjusting the observed seismic data and within the uncertainties of the data. These realizations indicate the features and variability of the velocities and density. Table 2 shows the statistical parameters of mean and standard deviation (SD) which we will compare then with the data obtained from the inference in the different algorithms used.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f04

Figure 4True data (black line), prior model (blue line), and accepted realizations of the model (gray) for the synthetic test where (a) MH, (b) HMC, (c) MALA, and (d) Lip-MALA.

Download

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f05

Figure 5Observed seismic data (black line) and seismic traces obtained from the accepted model realizations (gray) for the synthetic test where (a) MH, (b) HMC, (c) MALA, and (d) Lip-MALA.

Download

Table 2Mean and Standard deviation of elastic parameters used in the synthetic test.

Download Print Version | Download XLSX

Our chain sampling phase yielded 10 000 realizations. From these realizations, we calculated the expected values and marginal probabilities of P-wave and S-wave velocities and density as a function of two-way reflection time. These calculations were based on averaging the model performances over the sampling phase. Figure 6 presents the marginal cumulative probability distributions for P and S wave velocities and density, as estimated by the inversion, along with the actual P and S wave velocities and density of the synthetic test. The figure demonstrates the successful prediction of the actual values for all tested algorithms, accurately identifying the main stratification characterized by high and low velocities and the corresponding high and low density.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f06

Figure 6Marginal cumulative probability distributions (color map), true data (black line) and seismic inversion model result (red line) for the synthetic test where (a) MH, (b) HMC, (c) MALA, and (d) Lip-MALA.

Download

Table 3 summarizes the performance of the different algorithms tested in predicting P-wave and S-wave velocities and density. The mean, Standard Deviation (SD), correlation, and Root Mean Squared Error (RMSE) are presented for each parameter.

Table 3Statistical parameters for the results obtained for algorithms tested for the synthetic test.

Download Print Version | Download XLSX

The mean and standard deviation values indicate that the predicted values are closely aligned with the true values. Regarding correlation, MH exhibits the lowest correlation for velocity prediction, while HMC achieves the highest. For density prediction, MH and HMC show correlations below 0.29, while MALA and Lip-MALA achieve correlations above 0.60.

In terms of RMSE, MH demonstrates the highest error for velocity prediction, while HMC achieves the lowest. For density prediction, MH and HMC exhibit errors above 75.75, while MALA and Lip-MALA maintain errors below 51.04.

Table 4 presents various performance parameters, including acceptance rate and total execution time. Lip-MALA exhibits the highest acceptance rate, while HMC exhibits the lowest. Conversely, MH boasts the lowest total execution time, while HMC demonstrates the highest.

Table 4Other parameters for synthetic test.

Download Print Version | Download XLSX

Finally, the convergence of the samples was analyzed a posteriori of the unknown parameters (seismic data parameters) m obtained from the different algorithms used. The multivariate effective sample size (mESS) statistic was used. The mESS is a measure that determines the size of an independent and identically distributed sample with the same covariance structure as the sample obtained from an MCMC method for the multivariate case- If we want to know if the chain converges by we can calculate minimum effective sample size (minESS) so that if mESS > minESS we say that the chain converges, if the reader is recommended to review Vats et al. (2019) to delve deeper into the convergence test used in this work. Table 5 shows the summary of mESS and minESS obtained for each method.

Table 5Convergence test for synthetic data.

Download Print Version | Download XLSX

4.2 Application to real data

To demonstrate the effectiveness of the algorithms, we applied them to a real dataset of an oil field in eastern Venezuela. The site is located in a formation dominated by clastic rocks, a type of sedimentary rock characterized by alternating layers of sand and shale. The fluids in the pore spaces of these rocks are brine water and oil, without gas. As a preliminary step, we upscaled the P-wave and S-wave velocities obtained from well log data to the corresponding seismic scale using a bandpass filter. This process ensures that the velocity data is consistent with the frequency range of seismic waves. Table 6 presents the descriptive statistics, including mean and standard deviation (SD), for the real data. These values will serve as a baseline for comparison with the results obtained from the inference procedures employed by the various algorithms under consideration.

Table 6Mean and Standard deviation of elastic parameters used for real data.

Download Print Version | Download XLSX

The seismic traces were obtained from partial stacks for the angles θ₁=19°, θ₂=24°, and θ₃=29°. Utilizing V_p, V_s and ρ logs in seismic scale and wavelets were extracted from the partial stacked seismic data using the frequency content of the data, the synthetic trace was generated using Eq. (13). The synthetic trace obtained was correlated with observed traces for seismic well tie (see Fig. 7) obtaining a correlation value of 0.55. The sampling algorithms described in Sect. 3 were implemented, generating a large chain of realizations starting from a prior model configuration corresponding to a low-frequency model of V_p, V_s and ρ.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f07

Figure 7Seismic well tie for real data used.

Download

As depicted in Fig. 8, the objective function variation curves for each sampling algorithm are presented. During each iteration, a subset of layers undergoes a random perturbation of their velocities and density, followed by a recalculation of the seismic trace. The objective function, calculated using Eq. (5), is represented on the vertical axis, while the horizontal axis represents the number of steps in the Markov chain. Each step corresponds to an accepted or rejected perturbation of the velocities and density configuration.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f08

Figure 8Progress with iterations in the MH (blue line), HMC (red line), MALA (green line) and Lip-MALA (black line) sampling algorithms for real data.

Download

The model settings were adjusted during the sampling phase, ensuring they remained within the probability function (Fig. 9). Figure 10 illustrates all realizations sampled (gray area) in the chain sampling phase for the various algorithms tested in this study, all of which align with the observed seismic data and fall within the data's uncertainty bounds. These realizations highlight the characteristics and variability of the velocities and density.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f09

Figure 9True data (black line), prior model (blue line), and accepted realizations of the model (gray) for real data where (a) MH, (b) HMC, (c) MALA, and (d) Lip-MALA.

Download

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f10

Figure 10Observed seismic data (black line) and seismic traces obtained from the accepted model realizations (gray) for real data where (a) MH, (b) HMC, (c) MALA, and (d) Lip-MALA.

Download

Employing a chain sampling scheme, we generated 9000 realizations from which we extracted the expected values and marginal probabilities of P-wave and S-wave velocities and density, all as functions of two-way reflection time. These calculations were derived by averaging the model performances across the sampling phase. Figure 11 depicts the marginal cumulative probability distributions for P and S wave velocities and density, as inferred from the inversion process, alongside the actual P and S wave velocities and density of the synthetic test.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f11

Figure 11Marginal cumulative probability distributions (color map), true data (black line) and seismic inversion model result (red line) for real data where (a) MH, (b) HMC, (c) MALA, and (d) Lip-MALA.

Download

Table 7 summarizes the performance of the tested algorithms in predicting P-wave and S-wave velocities and density. The mean, Standard Deviation (SD), correlation, and Root Mean Squared Error (RMSE) are presented for each parameter. The predicted values closely align with the true values as evidenced by the mean and standard deviation values. MH exhibits the lowest correlation for velocity prediction, while Lip-MALA achieves the highest. For density prediction, MH and HMC show correlations below 0.28, while MALA and Lip-MALA achieve correlations above 0.48. MH demonstrates the highest error for velocity prediction, while Lip-MALA achieves the lowest. For density prediction, MH and HMC exhibit errors above 151.41, while MALA and Lip-MALA maintain errors below 122.22.

Table 7Statistical parameters for the results obtained for algorithms tested for real data.

Download Print Version | Download XLSX

Table 8 presents various performance parameters, including acceptance rate and total execution time. Lip-MALA exhibits the highest acceptance rate, while HMC exhibits the lowest. Conversely, MH boasts the lowest total execution time, while HMC demonstrates the highest.

Table 8Other parameters for real data.

Download Print Version | Download XLSX

And a final step, as in the synthetic data, was to test the convergence of the chains, this study employed a posteriori analysis to assess the convergence of samples obtained for the unknown seismic data parameters (denoted by m) using various algorithms. The multivariate effective sample size (mESS) statistic served as the convergence metric. The mESS quantifies the equivalent size of an independent and identically distributed (iid) sample possessing the same covariance structure as the sample generated by a Markov Chain Monte Carlo (MCMC) method in the multivariate case.

To formally determine chain convergence, a minimum effective sample size (minESS) threshold can be established. If the mESS value surpasses the minESS threshold, convergence is achieved. For a more in-depth exploration of the convergence test employed in this work, readers are referred to Vats et al. (2019). Table 9 summarizes the mESS and minESS values obtained for each method.

Table 9Convergence test for real data.

Download Print Version | Download XLSX

Table 10Posterior standard deviations (2D real data).

Download Print Version | Download XLSX

4.3 Two-dimensional test with real data

In line with the main objective of this study, which is to comparatively evaluate the performance of different MCMC algorithms applied to prestack seismic inversion in geological contexts of varying complexity, this section includes an additional test using real data in a two-dimensional setting. This extension allows us to validate the scalability and robustness of the methods beyond idealized one-dimensional cases, providing evidence of their practical applicability in environments with greater structural and lithological heterogeneity, typical of exploration in real reservoirs.

The study area corresponds to a sector of the Eastern Basin of Venezuela, characterized by clastic lithology with alternating sandstones and shales. The seismic data used was acquired using conventional reflection techniques and subsequently reprocessed to generate prestack gathers organized by incidence angle. Three partial stacks were selected, corresponding to central angles of approximately 19, 24, and 29°, and used in the inverse process.

The two-dimensional inversion grid was designed with a vertical resolution of 351 samples, corresponding to the two-way time axis from 850 to 1550 ms, and a horizontal extent of 136 cells from 0 to 340 m. Three fundamental elastic parameters were considered: V_p, V_s and ρ. The prior model was built by integrating the structural interpretation of horizons and faults obtained from seismic data, together with low-frequency interpolation of the elastic properties V_p, V_s and ρ. derived from the well logs available in the study area. This approach allowed the establishment of a consistent geological model that served as an initial reference for the Bayesian inversion process.

The four MCMC sampling algorithms analyzed were applied in 1D: MH, HMC, MALA, and Lip-MALA. The resulting realizations were used to calculate the posterior marginal distributions for each parameter, as well as their summary statistics (mean and standard deviation, Table 10).

Quantitative analysis of the Figs. 12 to 15 reveals important differences between the algorithms. The HMC method presents the lowest standard deviations for all three parameters, indicating greater accuracy and lower uncertainty in the estimation, although at the cost of higher computational costs. MALA and Lip-MALA also offer robust results, with Lip-MALA demonstrating a better compromise between accuracy and computational efficiency, especially in density estimation. In contrast, the MH algorithm shows the greatest uncertainties, confirming its limitations in contexts of higher dimensionality and complexity.

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f12

Figure 12Inverted models of V_p, V_s and ρ obtained using the MH algorithm.

Download

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f13

Figure 13Inverted models of V_p, V_s and ρ obtained using the HMC algorithm.

Download

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f14

Figure 14Inverted models of V_p, V_s and ρ obtained using the MALA algorithm.

Download

https://npg.copernicus.org/articles/33/173/2026/npg-33-173-2026-f15

Figure 15Inverted models of V_p, V_s and ρ obtained using the Lip-MALA algorithm.

Download

From an applied perspective, these results are relevant for choosing the most appropriate method in exploratory contexts. In situations where high accuracy is required and sufficient computational resources are available, HMC may be the preferred option. On the other hand, in scenarios where computational time is a constraint, methods such as Lip-MALA offer an efficient and stable alternative. Density (ρ) remains the most challenging parameter to accurately recover, especially for MH and HMC, which is consistent with the results obtained in the one-dimensional case.

Finally, the incorporation of this two-dimensional test with real data demonstrates the practical applicability of the evaluated algorithms, not only under controlled conditions but also in situations representative of real-world geology. Furthermore, it reinforces the validity of the study's overall conclusions by confirming that gradient-based algorithms are more appropriate for complex inverse scenarios, and that algorithm selection must consider both the level of uncertainty and computational cost.

5 Discussion

This study presents a comparative analysis of Markov Chain Monte Carlo (MCMC) methods for estimating elastic properties from seismic amplitudes. We demonstrate the application of these methods in a field case, employing the following assumptions: (1) a one-dimensional reservoir model represented by stacked seismic traces, (2) seismic data simulation using the small reflectivity approximation, and (3) the Aki-Richards equation for weak contrast to establish the relationship between seismic data and elastic parameters. Notably, the proposed general formulation transcends these assumptions, allowing for the integration of more sophisticated seismic simulation techniques and comprehensive petrophysical models within a similar framework. Why does Lip-MALA yield smaller errors in the theoretical (1D) test while HMC performs better in the two-dimensional real case? In our experiments, two factors explain this: (i) the evaluation metric and (ii) the dimensionality/structure of the posterior.

i.
Metric. In 1D (synthetic and real), we compared primarily RMSE along the trace; Lip-MALA, by enforcing a Lipschitz-based step control, stabilizes local moves and reduces pointwise error (RMSE) in V_p and V_s. In 2D, we mainly reported posterior SD per parameter; HMC, via long Hamiltonian trajectories, explores connected valleys of the posterior more efficiently and achieves lower global uncertainty (SD), even if RMSE 2D is not emphasized in the main text.
ii.
Dimensionality/heterogeneity. In real 2D, lateral couplings and heterogeneity induce a rough/multimodal posterior; HMC traverses basins more effectively than local Lip-MALA steps, reducing posterior variance.

The four methods studied demonstrate acceptable performance, but in-depth analysis reveals notable differences:
- –
  Velocity estimation: In both the synthetic and real-world scenarios, methods that incorporate gradient calculations (HMC, MALA, and Lip-MALA) outperform MH in estimating velocities.
- –
  Density estimation: Density estimation proves to be the most challenging parameter, with MH and HMC exhibiting unsatisfactory results. However, MALA and Lip-MALA showcase more promising performance.
- –
  Execution time: A significant difference emerges in execution time between methods. MH and MALA exhibit shorter execution times compared to HMC and Lip-MALA, which are considerably more time-consuming.

A natural progression of this research would be to invert prestack seismic data to extract additional elastic parameters and reservoir properties, revealing a more comprehensive subsurface understanding. Similarly, incorporating well log conditioning into the model holds promise, as it could enhance vertical resolution near wells and guarantee that the model aligns with well data at drilling locations.

From an applied perspective, the results obtained in this study are relevant for decision-making in real-world exploration settings. For example, in frontier areas with poor well control, the MH algorithm could be used as a rapid evaluation tool due to its low computational cost, albeit with accuracy limitations. In contrast, methods such as HMC or Lip-MALA would be more suitable for mature fields where higher fidelity in estimating elastic properties is required, despite their greater computational demand. The choice of algorithm should be guided not only by statistical metrics but also by the specific requirements of the geophysical project, the geological setting, and the time and resource constraints available.

The results obtained in this work show consistency with previous research. For example, Gebraad et al. (2020) highlights the effectiveness of the HMC algorithm in full-waveform elastic inversion problems, particularly due to its ability to efficiently explore the posterior space. However, unlike their FWI-oriented approach, our AVO inversion results indicate that Lip-MALA achieves a better balance between accuracy and computational cost, particularly in density estimation, which is crucial in clastic media with gradual transitions. Similarly, Izzatullah et al. (2021) demonstrated that Langevin dynamics-based methods, such as MALA and its adaptive variants, are more efficient in high-dimensional spaces, which is reflected in our study by better definition of lithological boundaries in inverted images. These parallels confirm that Langevin-derived methods are viable and robust options in real-world seismic scenarios where efficiency and stability are practical priorities.

Although the HMC algorithm presents significantly longer runtimes, these may be acceptable within a seismic interpretation workflow that includes validation and multidisciplinary analysis phases. In contexts where the inversion must be performed in near-real time, such as during well drilling (geosteering), methods such as MH or MALA may be more appropriate despite their lower resolution. Therefore, computation time should not be evaluated in isolation, but rather based on operational priorities, available computational resources, and the criticality of the information to be estimated at each stage of the geophysical project.

6 Conclusions

This study compares various pre-stack inversion methods under an MCMC framework for the estimation of elastic parameters. We invert pre-stacked seismic data to infer velocities (V_p and V_s) and density (ρ), which are linked to the seismic data via the Aki-Richards equation. All methods employed effectively handle the inherent uncertainties associated with seismic and elastic data.

The proposed algorithms allow estimating several important aspects of the posterior distribution, such as the means and standard deviations of the posterior parameters. We rigorously validated the algorithms by measuring the quality of the MCMC sample through correlations, plotting the objective function, seismic traces and estimating the RMSE.

The four methods evaluated in this study exhibit acceptable performance overall, but a closer examination reveals notable differences in their specific capabilities. Velocity estimation: In both the simulated and real-world scenarios, methods that leverage gradient calculations (HMC, MALA, and Lip-MALA) demonstrate superior performance in estimating velocities compared to MH. Density estimation: Density estimation poses the most significant challenge, with MH and HMC exhibiting unsatisfactory results. However, MALA and Lip-MALA demonstrate more promising performance in this area. Execution time: A clear distinction emerges in execution time between the methods. MH and MALA exhibit significantly shorter execution times compared to HMC and Lip-MALA, which are considerably more time-consuming.

Furthermore, the results of the two-dimensional test with real data showed that in situations where high accuracy is required and sufficient computational resources are available, HMC may be the preferred option. On the other hand, in scenarios where computational time is a constraint, methods such as Lip-MALA offer an efficient and stable alternative. This validation in a context closer to real geology strengthens the study's conclusions. The choice of algorithm must consider not only statistical metrics but also the geophysical context, resource availability, and project purpose. In summary we have:

Balance runtime vs accuracy: HMC yields lower SD at higher cost; Lip-MALA provides strong local accuracy efficiently.
Choose metrics deliberately: report both RMSE (fit) and SD (uncertainty) to avoid metric-induced contradictions.
Use HMC for 2D (and higher) problems to reduce global posterior SD and traverse multi-basin landscapes.
Use MALA/Lip-MALA to stabilize density estimation and reduce pointwise errors.
Use Lip-MALA when local accuracy (lower RMSE) in VP/VS is prioritized on 1D settings.

Code and data availability

The codes and data for this project are not available due to restrictions imposed by the data provider and the university that funded the project.

Author contributions

RPR and SI designed the study, performed the research, analyzed data, and wrote the paper. GB and RM contributed to refining the ideas, proof the results, carrying out additional analyses, and finalizing this paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

The authors used AI tools to improve the language, readability, and structure of the manuscript. The authors reviewed and edited the output and take full responsibility for the content of the article.

Financial support

The authors would like to thank Yachay Tech University for funding this study through the Research Project MATH24-08.

Review statement

This paper was edited by Richard Gloaguen and reviewed by Chunjie Zhang and five anonymous referees.

References

Aki, K. and Richards, P.: Quantitative Seismology, University Science Books, ISBN 978-0935702965, 2002.

Beskos, A., Pillai, N. S., Roberts, G. O., Sanz-Serna, J.-M., and Stuart, A. M.: Optimal tuning of the hybrid Monte Carlo algorithm, Bernoulli, 19, 1501–1534, https://doi.org/10.3150/12-BEJ414, 2013.

Bosch, M.: The optimization approach to lithological tomography: Combining seismic data and petrophysics for porosity prediction, Geophysics, 69, 1272–1282, https://doi.org/10.1190/1.1801944, 2004.

Bosch, M., Cara, L., Rodrigues, J., Navarro, A., and Díaz, M.: A Monte Carlo approach to the joint estimation of reservoir and elastic parameters from seismic amplitudes, Geophysics, 72, O29–O39, https://doi.org/10.1190/1.2783766, 2007.

Buland, A. and Omre, H.: Bayesian linearized AVO inversion, Geophysics, 68, 185–198, https://doi.org/10.1190/1.1543206, 2003.

Chib, S. and Greenberg, E.: Understanding the Metropolis-Hastings algorithm, Am. Stat., 49, 327–335, https://doi.org/10.2307/2684568, 1995.

Clochard, V., Delépine, N., Labat, K., and Ricarte, P.: Post-stack versus pre-stack stratigraphic inversion for monitoring purposes: a case study for the saline aquifer of the Sleipner field, SEG Technical Program Expanded Abstracts 2009, Society of Exploration Geophysicists, 2417–2421, https://doi.org/10.1190/1.3255345, 2009.

de Lima, P. D. S., Corso, G., Ferreira, M. S., and de Araújo, J. M.: Acoustic full waveform inversion with Hamiltonian Monte Carlo method, Physica A, 617, 128618, https://doi.org/10.1016/j.physa.2023.128618, 2023.

Duane, S., Kennedy, A. D., Pendleton, B. J., and Roweth, D.: Hybrid Monte Carlo, Phys. Lett. B, 195, 216–222, https://doi.org/10.1016/0370-2693(87)91197-X, 1987.

Fichtner, A. and Zunino, A.: Hamiltonian nullspace shuttles, Geophys. Res. Lett., 46, 644–651, https://doi.org/10.1029/2018GL080931, 2019.

Gebraad, L., Boehm, C., and Fichtner, A.: Bayesian elastic full-waveform inversion using Hamiltonian Monte Carlo, J. Geophys. Res.-Sol. Ea., 125, e2019JB018428, https://doi.org/10.1029/2019JB018428, 2020.

Goodway, B.: AVO and Lamé constants for rock parameterization and fluid detection, CSEG Recorder, 26, 39–60, 2001.

Hampson, D. P., Russell, B. H., and Bankhead, B.: Simultaneous inversion of prestack seismic data, SEG Technical Program Expanded Abstracts 2005, Society of Exploration Geophysicists, 1633–1637, https://doi.org/10.1190/1.2148008, 2005.

Hastings, W. K.: Monte Carlo sampling methods using Markov chains and their applications, Biometrika, 57, 97–109, https://doi.org/10.1093/biomet/57.1.97, 1970.

Helland-Hansen, D., Magnus, I., Edvardsen, A., and Hansen, E.: Seismic inversion for reservoir characterization and well planning in the Snorre field, Leading Edge, 16, 269–274, https://doi.org/10.1190/1.1437616, 1997.

Infante, S., Sanchez, L., and Hernández, A.: Stochastic models to estimate population dynamics, Statistics, Optimization & Information Computing, 7, 311–328, https://doi.org/10.19139/soic.v7i2.538, 2019.

Izzatullah, M., van Leeuwen, T., and Peter, D.: Accelerated Langevin Dynamics for Bayesian seismic inversion, J. Geophys. Res.-Sol. Ea., 125, e2019JB018428, https://doi.org/10.1029/2019JB018428, 2020.

Izzatullah, M., van Leeuwen, T., and Peter, D.: Bayesian seismic inversion: a fast sampling Langevin dynamics Markov chain Monte Carlo method, Geophys. J. Int., 227, 1523–1553, https://doi.org/10.1093/gji/ggab287, 2021.

Landa, E. and Treitel, S.: Seismic inversion: What it is, and what it is not, Leading Edge, 35, 277–279, https://doi.org/10.1190/tle35030277.1, 2016.

Landau, L. D. and Lifshitz, E. M.: Mechanics, Elsevier Science, ISBN 978-0750628969, 1976.

Lemons, D. S. and Gythiel, A.: Paul Langevin's 1908 paper “On the Theory of Brownian Motion” [“Sur la théorie du mouvement brownien,” C. R. Acad. Sci. (Paris) 146, 530–533 (1908)], Am. J. Phys., 65, 1079–1081, https://doi.org/10.1119/1.18725, 1997.

Ma, X.: Simultaneous inversion of prestack seismic data for rock properties using simulated annealing, Geophysics, 67, 1877–1885, https://doi.org/10.1190/1.1527087, 2002.

Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E.: Equation of state calculations by fast computing machines, J. Chem. Phys., 21, 1087–1092, https://doi.org/10.1063/1.1699114, 1953.

Neal, R. M.: MCMC using Hamiltonian dynamics, in: Handbook of Markov Chain Monte Carlo, edited by: Brooks, S., Gelman, A., Jones, G. L., and Meng, X.-L., Chapman and Hall/CRC, Boca Raton, 113–162, https://doi.org/10.1201/b10905-6, 2012.

Nemeth, C. and Fearnhead, P.: Stochastic Gradient Markov Chain Monte Carlo, J. Am. Stat. Assoc., 116, 433–450, https://doi.org/10.1080/01621459.2020.1847120, 2021.

Nemeth, C., Sherlock, C., and Fearnhead, P.: Particle Metropolis-adjusted Langevin algorithms, Biometrika, 103, 701–717, 2016.

Niu, L., Geng, J., Wu, X., Zhao, L., and Zhang, H.: Data-driven method for an improved linearised AVO inversion, J. Geophys. Eng., 18, 1–22, https://doi.org/10.1093/jge/gxaa065, 2020.

Robert, C. P. and Casella, G.: Monte Carlo Statistical Methods, 2nd edn., Springer, New York, https://doi.org/10.1007/978-1-4757-4145-2, 2004.

Roberts, G. and Stramer, O.: Langevin diffusions and Metropolis-Hastings algorithms, Methodol. Comput. Appl., 4, 337–357, https://doi.org/10.1023/A:1023562417138, 2002.

Roberts, G. O. and Rosenthal, J. S.: Optimal scaling of discrete approximations to Langevin diffusions, J. R. Stat. Soc. B, 60, 255–268, https://doi.org/10.1111/1467-9868.00123, 1998.

Roberts, G. O. and Tweedie, R. L.: Exponential convergence of Langevin distributions and their discrete approximations, Bernoulli, 2, 341–363, https://doi.org/10.2307/3318418, 1996.

Roberts, G. O., Gelman, A., and Gilks, W. R.: Weak convergence and optimal scaling of random walk Metropolis algorithms, Ann. Appl. Probab., 7, 110–120, https://doi.org/10.1214/aoap/1034625254, 1997.

Russell, B., Hampson, D., and Bankhead, B.: An Inversion Primer, CSEG Recorder, 31, 96–103, 2006.

Stuart, A. M., Voss, J., and Wilberg, P.: Conditional path sampling of SDEs and the Langevin MCMC method, Commun. Math. Sci., 2, 685–697, https://doi.org/10.4310/CMS.2004.v2.n4.a7, 2004.

Sánchez, L., Infante, S., Griffin, V., and Rey, D.: Spatio-temporal dynamic model and parallelized ensemble Kalman filter for precipitation data, Braz. J. Probab. Stat., 30, https://doi.org/10.1214/15-BJPS297, 2016.

Tarantola, A.: Inverse Problem Theory and Methods for Model Parameter Estimation, Society for Industrial and Applied Mathematics, https://doi.org/10.1137/1.9780898717921, 2005.

Vats, D., Flegal, J. M., and Jones, G. L.: Multivariate output analysis for Markov chain Monte Carlo, Biometrika, 106, 321–337, https://doi.org/10.1093/biomet/asz002, 2019.

Welling, M. and Teh, Y. W.: Bayesian learning via stochastic gradient Langevin dynamics, in: Proceedings of the 28th International Conference on Machine Learning (ICML 2011), Bellevue, WA, USA, 681–688, 2011.

Wu, H., Chen, Y., Li, S., and Peng, Z.: Acoustic impedance inversion using Gaussian Metropolis-Hastings sampling with data driving, Energies, 12, 2744, https://doi.org/10.3390/en12142744, 2019.

Articles

Short summary

We explored four methods to improve how underground rock properties are estimated from seismic data. By comparing these methods on both simulated and real-world oilfield data, we found that techniques using gradient information give better accuracy but require more computing time. Our results help guide the choice of method depending on whether speed or precision is more important in subsurface exploration.