Uncertainty assessment of aquifer hydraulic parameters from pumping test data

Bashandy, Azza M.; Bekhit, Hesham M.; Radwan, Hany G.

doi:10.1007/s13201-024-02134-1

Uncertainty assessment of aquifer hydraulic parameters from pumping test data

Original Article
Open access
Published: 21 March 2024

Volume 14, article number 84, (2024)
Cite this article

Download PDF

You have full access to this open access article

Applied Water Science Aims and scope Submit manuscript

Uncertainty assessment of aquifer hydraulic parameters from pumping test data

Download PDF

480 Accesses
Explore all metrics

Abstract

Data from pumping tests is a noisy process, and therefore, performing the pumping test numerous times will not get the same drawdown values. As a consequence, various pumping experiments lead to different values for aquifer parameter estimates. The data of pumping tests are usually analyzed using traditional methods (aquifer tests and AQtesolv software), which depend on trial and error technique. During these methods, non-unique values of hydraulic parameters are selected, which usually have a high level of uncertainty. Uncertainty must be taken into account in determining aquifer parameters, especially when using groundwater models for decision makers. The main goal of this study is to build a comprehensive tool for quantifying uncertainty associated with hydraulic parameter estimation from different pumping test conditions for fully penetrating wells in confined and semi-confined aquifers. To achieve the previous objective, a FORTRAN code was developed to apply the Markov Chain Monte Carlo (MCMC) algorithm using different likelihood functions (exponential, inverse, and log). This developed tool can be used to detect the most probable range of aquifer parameters that are consistent with pumping test data with a high degree of confidence. The tool was successfully used to several hypothetical cases to demonstrate the uncertainty in the quantification of aquifer parameters and compare the findings to the standard method's results. Also, the concept was verified numerically (using Modflow program) with satisfactory results using a hypothetical case with well-known aquifer parameters. Finally, the tool was applied for actual pumping test data with good results.

Automatic estimation of aquifer parameters using long-term water supply pumping and injection records

Article Open access 19 April 2016

Uncertainty analysis of aquifer parameters using deterministic and stochastic algorithms for robust groundwater exploitation decision in a basement complex region of SW Nigeria

Article 30 January 2023

Estimation of Aquifer Parameters from Pumping Test Data Using the Only Corresponding Competitor Method (OCC): Case Oude of Korendijk (South of Rotterdam)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Accurate and reliable predictions of groundwater flow are crucial to sustainable groundwater management methods with rising human and climate pressure on groundwater resources (Rojas et al. 2008). Without a detailed and accurate knowledge of the aquifer, rational and reliable calculation of hydrogeological parameters not only increases the precision of these experiments but also decreases the time spent on realistic study, thus further enhancing the efficiency of the work. In recent decades, significant attempts have been made to develop techniques for the determination of ideal groundwater parameters and uncertainty connected with insecurity in the quantitative model forecast in the estimated parameters. The result was a number of reverse methods for groundwater models (Carrera et al. 2005). Various aquifer systems have different hydro-geological parameter values like Transmissivity (T) (a function of hydraulic conductivity (k)) and storativity (S). Those hydraulic parameters are essential for hydro-geological and environmental studies such as the assessment of groundwater resources and aquifer vulnerability assessments. A model could have several parameters to specify, often in the absence of sufficient knowledge, leading to parametric uncertainty (Singh and Hespanha 2010). Stochastic uncertainty is mainly attributed to the difficulty of identifying the distribution of aquifer properties in space. For a long time, it has been debated in the hydrologic community how to achieve the relevant parameters for every modeling research (Marshall et al. 2004).

Estimation of the aquifer parameter is generally achieved using pumping test information. Analyzing pumping tests involves obtaining the parameters using trial-and-error, graphical approach (traditional methods), or least square error estimation. In these approaches, the hydraulic parameters are usually considered to be fixed and unique. In reality, pumping test data is noisy in field pumping experiments, meaning that no two pump tests can result in the same drawdown traces except under ideal conditions. Typically, when using the constant rate pumping test study, the data are matched with one of several standard curves according to the aquifer type. During this process, only one excellent point should be chosen, and the hydraulic parameters of the aquifer should be calculated accordingly. Once the best match point is obtained and the associated parameters are determined, these parameters are used in modeling without questioning or quantifying the uncertainty of aquifer parameters. For example, if it tried to apply the previous concept to actual pumping test data (for confined aquifer type, as an example) and tried to choose the best matching point, we would have different results for the same pumping test data. Figure 1 shows the results of two cases for choosing the best matching point to detect the values of transmissivity (T) and storativity (S); the two cases give different values for T & S.

Consequently, if one conducts several pumping tests on a well under the same conditions, different estimates of aquifer parameters may be obtained. Using single values for the aquifer hydraulic parameters may lead to erroneous and unreliable flow modeling and further transport predictions due to uncertainty arising from this estimation. To obtain the most optimal of these models of decision support, data should also be contained on the uncertainties of each decision-making option that could influence management strategy selection (Borsuk et al. 2004).

In the field of groundwater modeling, Markov chain Monte Carlo (MCMC) is used to sample the posterior distribution of an aquifer of uncertain permeability for a geostatistical model. The reverse model was created and included both direct log-hydraulic conductivity measurements using MCMC method as well as head measurements (standard state or transient) (Lu et al. 2004). Hassan et al. (2009) evaluated parameter uncertainty for a density-driven groundwater flow model utilizing a Bayesian statistical framework and MCMC sampling approach to address the inverse problem (Hassan et al. 2009).

A new algorithm has been described and proven by Fu and Gómez (2009) to produce a stochastic log conductivity based on log conductivity and piezometric head data based on Markov's Monte Carlo chain block hypothesis (Fu and Gómez 2009). Uncertainties are applied to a synthetic isotropic lognormal 2D confined aquifer with an unvarying natural-gradient flowing condition.

Jardani et al. (2012) found that there was an alluvial, heterogeneous, semi-confined aquifer transmitting field that was related to the tidal-influenced estuary (Jardani et al. 2012). Ward and Fox (2012) used Bayesian inference to quantify the parameters of aquifer uncertainty from pumping test data on three examples: the first for synthetic data derived from the Thies (1935) model, the second for synthetic and field data analyzed for the Boulton delayed yield model, and the last for synthetic data from the Hunt-Scott multilayer model (Ward and Fox 2012; Hunt and Scott 2007).

Bardsley and Fox (2012) investigated linear inverse problems with nonnegativity requirements and presented MCMC approach for such problems, whose output (samples) can be utilized to get estimates of unknowns as well as to quantify uncertainty in those estimates (Bardsley and Fox 2012). Shi et al. (2014) assessed parameter uncertainty of groundwater simulations with a specific focus on uranium reactive modeling by using the Monte Carlo Chain method as a solution to the challenge of nonlinearity models and the non-precision parameter (Shi et al. 2014). Zheng and Han (2016) assessed the applicability of MCMC in assessing the uncertainty of watershed-scale water quality (WWQ) modeling (Zheng and Han 2016).

One of the main advantages of adopting MCMC method is that it provides an estimate of the uncertainty associated with the aquifer parameters. This is quite important, especially in light of how climate change is affecting everything. The hydrological cycle is significantly more uncertain as a result of climate change. Aquifers will be subject to variations in soil behavior, groundwater levels, and recharge rates due to changes in temperatures, precipitation patterns, and streamflow (Kidmose et al. 2013, Swin et al. 2022, Osman et. al. 2022). Extreme weather conditions, such prolonged droughts, can also affect aquifer responses and add uncertainty to parameter estimation. Therefore, providing aquifer parameter range, associated uncertainties, and probability distributions is an added value to the hydrological community specially when considering climate change.

In a multi-aquifer groundwater system in China, Ha et al. (2020) investigated the applicability of various methods for calculating hydraulic parameters. A hybrid algorithm, GALMA (a genetic algorithm (GA) paired with the Levenberg–Marquardt (LM) algorithm), is used with the Neuman and Witherspoon model and ratio approach to prevent errors in the graphic-analytical process and to improve the efficiency and accuracy (Ha et al. 2020). Zhang et al. (2020) assessed the effects of the shorter pumping test time on the predicted hydraulic conductivity. The Monte Carlo approach was first used to create a heterogeneous hydraulic conductivity K field. Then, to carry out numerical pumping tests, a MODFLOW groundwater flow model was built (Zhang et al. 2020).

Singh and Tripura (2022) used pumping test analysis to estimate the hydraulic parameters in order to evaluate the formation of an aquifer system on a hilly terrain using 16 bore wells (Singh and Tripura 2022). To obtain the Aquifer hydraulic parameters, Li et al. (2023) created a novel dimensionless-form analytical solution for variable-rate pumping tests using piecewise-constant approximations for varied pumping rates (Li et al. 2023). For unconfined alluvial aquifers in Iran, Dashti et al. (2023) used machine learning models to assess transmissivity by pumping test data. Pumping tests and hydrogeological data for 96 pumping wells were gathered, with 30% of the data being utilized for testing and 70% being used for training (Dashti et al. 2023).

Finally, the main objective of this paper is to develop a tool to quantify the uncertainty in parameter estimation for confined and leaky aquifers (fully penetrating wells) obtained from traditional pumping test analysis using MCMC based on Bayesian inference technique.

Materials and methods

Theoretical background

The following sections discuss the main principals of the methods and techniques that are used in this paper.

Bayesian statistical inference

Bayesian statistical inference techniques are an alternative to the traditional statistical assessment process. Bayesian procedures enable the combination of pre-existing understanding of model parameters and observed information with the output of the model. This leads to a likelihood distribution in the parameter space that sums up uncertainty concerning the parameters; this is regarded as the reverse distribution. The possibility to utilize a precise prior should be based on existing parameter knowledge (Marshall et al. 2004). Optimum values of the preceding studies’ parameters can be calculated usefully. A parameter posterior distribution set ${\text{P}}\left( {{\Theta }/{\text{D}}} \right)$ may be found through the implementation of Bayes ' theorem as follows (Marshall et al. 2004):

$${\text{P}}\left( {\Theta /D} \right) = \frac{{P\left( \Theta \right)P\left( {D/\Theta } \right)}}{P\left( D \right)}$$

(1)

where ${\text{P}}\left( {\Theta /D} \right)$: The posterior distribution. $P\left( {D/\Theta } \right)$: Model parameters are appropriately set, the likelihood function. $P\left( D \right)$: The probability of observed data $\left( D \right)$. ${\text{P}}\left( \Theta \right)$: The prior distribution.

It should be noted that if the existing information is scarce, the posterior distribution follows an analogical form as before. However, the posterior distribution is shaped more by the information than by the preceding distribution when big information sets are in existence. As well, if the previous distribution is chosen to reflect a vague previous understanding, the fresh data in the sample will predominate ${\text{ P}}\left( {{\Theta }/{\text{D}}} \right)$. This includes all the information accessible from previous knowledge as well as from data gathered.

Markov Chain Monte Carlo (MCMC)

MCMC technique is a random multi-dimensional settings method in which the value of the next sample parameter is simply dependent on the present sampling point parameter value. In MCMC, the model parameters posterior distribution is created by drawing of many thousand samples and by computing their empirical average, default and percentiles, similar to the Monte-Carlo simplicity of simulation, except because Monte Carlo pulls samples and probably not upgraded from what was actually a previous distribution (Marshall et al. 2004).

The MCMC methods are conducted by a random or autonomous chain. An autonomous chain recommends using candidate states at every stage, despite of the present state, with the same likelihood distribution. Although many separate MCMC sample algorithms exist, the Metropolis Hastings algorithm and the Adaptive Metropolis algorithm are in place. The Metropolis–Hastings algorithm creates a Markov chain series of samples which represents an altered step in the space parameter. The suggested parameter samples are made using an appropriate arbitrary probability distribution called the proposition density or the jump, Adaptive Metropolis algorithm is a possible solution, using the process history to "tune" the proposed distribution appropriately. The adaptive algorithm Metropolis (AM) continually adapts to the destination distribution.

A. Metropolis–Hastings algorithm

The use of the Metropolis–Hastings algorithm to draw samples from the rear distribution can be explained simply (Marshall et al. 2004):

(1)
With a random beginning parameter vector, start the simulation at iteration I = 0.$\Theta = \Theta^{^\circ }$.
(2)
Generate a proposed value $\Theta^{*}$ for $\Theta$ from a density of the proposal according to the current value $\Theta^{i} of \Theta$.
(3)
Compute a probability of acceptance $\propto$ (depending on $\Theta^{i} ,\Theta^{*}$, the density of proposal and the model) that determines whether the proposed value $\Theta^{*}$ is accepted or not.
(4)
Accept $\Theta^{i + 1} = \Theta^{*}$ with the likelihood $\propto$, Otherwise $\Theta^{i + 1} = \Theta^{i} .$
(5)
Continue with steps 2 through 4 and do an iteration increment of I

This algorithm disregards the challenging step of assessing moments and other post-distribution data from the Bayesian reference, as a reliable estimation of these moments may be obtained once a large parameter sample is produced. As a result, the sample moments and distribution moments converge for a given beginning value and density of the proposal. When it comes to distribution, the parameter sequence ${\Theta }^{{\text{i}}} { }$ has converged to a stationary state. MCMC system utilizes either a single website or a block update to the Metropolis–Hastings algorithm. Candidate values are produced in turn in single-site updated applications for each element. A candidate value $\theta_{i}^{\prime }$ from densities with a single $\left( { q_{i} \left( {\Theta ,\theta_{i}^{\prime } } \right) } \right)$ is proposed that depends on the present value $\Theta$. The transition from the present state $\Theta = \left( {\theta_{1} , \ldots ,\theta_{i} , \ldots ,\theta_{N} } \right)$ to the proposed state $\Theta = \left( {\theta_{1} , \ldots ,\theta_{i}^{\prime } , \ldots ,\theta_{N} } \right)$ is accepted with probability:

$$\alpha = \min \cdot \left( {1,\frac{{p\left( {\theta \prime /D} \right)q_{{i\left( {\theta \prime ,\theta_{i} } \right)}} }}{{p\left( {\theta /D} \right)q_{i} \left( {\Theta ,\theta_{i}^{\prime } } \right)}}} \right)$$

(2)

B. Adaptive metropolis algorithm

The adaptation significantly impacts the size and spatial orientation of the proposed distribution. In addition, in practice, the new algorithm is easy to introduce and apply. The description of the Adaptive Metropolis (AM) is based on the Metropolis algorithm of the classic random walk (Marshall et al. 2004) and its modification. One significant benefit of the AM algorithm is that at the start of the simulation, the cumulative data is used. This indicates that the search becomes more effective at an early point of the simulation because of the quick start of adaption, which decreases the number of features necessary for evaluation.

Research methodology

The procedures followed to achieve the research goals are summarized in the flow chart presented in Fig. 2. The main program is created as a combination between solving a forward problem and an inverse (backward) problem. The forward problem entails calculating the drawdown at a specific location for a different time based on known values of transmissivity and storativity (T and S) and hydraulic Resistance (C) using Theis equation and Hantush and Jacob equation (Batu 1998).

Building MCMC code (final program)

The primary MCMC program's steps for solving the inverse problem are shown in Fig. 3. The main MCMC code is developed using FORTRAN program. Identifying the hydraulic parameters, which are two (T & S) for confined aquifers and three (T & S & C) for leaking aquifers, and preparing the real data (actual drawdown) for the completed pumping test, constitute the first step of the program.

The prior distribution for each hydraulic parameter is unknown so it is assumed to be uniform. At the first iteration (i), a random value for $T_{i} , S_{i}$ is chosen from the uniform prior distribution, and they are utilized in the forward problem with the equations to produce the anticipated drawdown (calculated drawdown) at the stated location of the observed well and at the same times of the pumping test. The difference (error) between the calculated and actual drawdown for the various time periods is then determined. Then, inverse, log, and exponential functions that represent three likelihood functions are computed. The identical prior computations are then repeated for the iteration (i + 1) and another value for each of the hydraulic parameters, $T_{i + 1} , S_{i + 1}$ is chosen randomly. In order to determine whether the chosen values $T_{i} , S_{i}$ are accepted or not, the values of each likelihood function and the corresponding probabilities are used to calculate the value of $\alpha$, which is compared with a random value ($u$). The previous steps are repeated for subsequent iterations until reaching the maximum iterations. The aforementioned processes provide a posterior distribution that includes all accepted values of T and S.

Verification of the program analytically

The verification process (see Fig. 4) is based on the available pumping test data, passes through four main steps. The first step is related to using the MCMC code. In MCMC code, three main likelihood functions (L.H.) are used: inverse, log, and exponential. For each likelihood function (L.H.), the accepted values of the hydraulic parameters (posterior distribution) are collected and two distributions lognormal and normal distribution are tested to represent the accepted values. For each assumed distribution per each likelihood function, three central tendency measures (mean, median, and mode) are tested.

For each measure (mean for example), the root mean square error (RMSE) between the accepted values and the tested distribution is calculated for each hydraulic parameter (${\text{RMSE}}_{T} , {\text{RMSE}}_{S} , {\text{RMSE}}_{C} ).$ Then some dimensionless factors are calculated $(F_{1} = {\text{RMSE}}_{T} /\mu_{T} , F_{2} = {\text{RMSE}}_{S} /\mu_{S } ,F_{3} = {\text{RMSE}}_{C} /\mu_{C} )$, and factors $(F_{{{\text{av}}{.}}} = \left( {F_{1} + F_{2} } \right)/2$) for confined aquifers, and $(F_{{{\text{av}}{.}}} = \left( {F_{1} + F_{2} + F_{3} } \right)/3$) for leaky aquifer (see Fig. 4). The best distribution (lognormal or normal) for each likelihood function is detected based on the minimum value of $(F_{{{\text{av}}{.}}}$).

The second step deals with the selection of the best likelihood function and the associated best distribution (lognormal or normal) and best representing central measure to represent the case study graphically, using the standard/reference curves. Then, 90% confidence level for each hydraulic parameter is determined. The third step is to use the traditional methods (aquifer test—aquatesolve software) to detect one single value for T, S, and C. The final step is the comparison between the single values of the aquifer hydraulic parameters using the traditional methods and the confidence limit of 90% for the parameters resulting from the MCMC code.

Proof the concept numerically

By developing 3D numerical groundwater models (using the Modflow program), a hypothetical case is generated with known hydraulic parameter values for confined and semi-confined aquifer layers in order to demonstrate the effectiveness of the developed MCMC tool. To validate the idea of the program quantitatively, the aquifer parameters produced by the developed MCMC tool were compared with the known values and with the values produced by the traditional methods.

Application of the program for real case

The proposed MCMC code is applied for one actual real pumping test data in El-Wahat El-Baharya at Egypt in order to test the applicability of the MCMC code.

Results and discussion

Verification of the program analytically

Three hypothetical cases for confined aquifer and another three hypothetical cases for the semi- confined aquifer are used to verify the MCMC code (Bashandy et al. 2022), but due to the limitation of the manuscript size only one case for each aquifer type is presented and for more details about the additional two cases for each aquifer type refer to Bashandy et al. 2022.

Verification process: hypothetical case for confined aquifer

A. Description of the hypothetical case

The hypothetical case deals with 22 drawdown records for pumping test data, with a well discharge rate of 0.835 m³/minute as shown in Table 1 (Fetter 2014). The confined aquifer thickness is 14.6 m and fully penetrating well, and the observed well is located about 251.3 m away from the pumping well.

Table 1 Pumping test data for the confined aquifer–hypothetical case

Uncertainty assessment of aquifer hydraulic parameters from pumping test data

Abstract

Similar content being viewed by others

Automatic estimation of aquifer parameters using long-term water supply pumping and injection records

Uncertainty analysis of aquifer parameters using deterministic and stochastic algorithms for robust groundwater exploitation decision in a basement complex region of SW Nigeria

Estimation of Aquifer Parameters from Pumping Test Data Using the Only Corresponding Competitor Method (OCC): Case Oude of Korendijk (South of Rotterdam)

Introduction

Materials and methods

Theoretical background

Bayesian statistical inference

Markov Chain Monte Carlo (MCMC)

A. Metropolis–Hastings algorithm

B. Adaptive metropolis algorithm

Research methodology

Building MCMC code (final program)

Verification of the program analytically

Proof the concept numerically

Application of the program for real case

Results and discussion

Verification of the program analytically

Verification process: hypothetical case for confined aquifer

Verification process: hypothetical case for leaky aquifer

Proof the concept numerically (uncertainty quantification of aquifer parameters)

Hypothetical case for confined aquifer

Hypothetical case for leaky aquifer

Application of the program for real case

Summary and conclusions

Availability of data and material

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Consent to participate

Consent for publication

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation