Evaluating the Groundwater Quality Variations Using Factor Analysis Combined with Information Entropy Theory

Document Type : Research Paper

Authors

1 Assist. Prof., Hydrogeology, Kharazmi University, Faculty of Geosciences, Tehran-Iran

2 Ph.D. Student, Hydrogeology, Kharazmi University, Faculty of Geosciences, Tehran-Iran

Abstract

Introduction
The chemical characteristic of groundwater as a major source of water supply for human life is an important factor in determining the usage of it for agriculture, industry and drinking. Useful techniques and different methods of studying the chemical composition of groundwater such as correlation coefficient, descriptive statistics, factor analysis and cluster analysis provide an understanding of the main factors that contribute to groundwater salinity and also sources of contamination. Shannon proposed entropy as a measure of uncertainly, a theory recently applied in various fields. Although the ability of the entropy theory has quantified to evaluate uncertainty for hydrological variables and parameters in models of water resources systems, studies have not fully explored its application for describing and evaluating large-scale characteristics of groundwater quality. The three objectives of this study included: (1) applying cluster analysis and GIS technique to recognition of spatial classification of groundwater resources; (2) using factor analysis to interpret major factors that affect groundwater quality in Lenjanat, and (3) investigating the stability and spatial variation of influential factors.
Materials and Methods
In this study, 155 groundwater samples were collected from Lenjanat plain, Isfahan province. the analyzed concentrations of 10 ions including (Ca), (Na), (K), (Mg), (HCO3), (Cl), (F), (NO3), (SO4) and (EC), and 11 metallic species including (As), (Ba), (Cd), (Cr), (Cu), (Fe), (Mn), (Ni), (Pb), (Sb) and (Se) has been used. Finally, 14 parameters with a moderate to high correlation coefficient were chosen to assess the groundwater quality in this plain. In this study, different methods including geostatistics, information entropy theory, and   multivariate statistical methods were employed to assess groundwater quality. In order to extract the most important factors in groundwater quality change, the factor analysis method was used. Factor analysis is a multivariate statistical technique that reduces the major variables to fewer factors which can be used to develop the best interpretable model. This study employed hierarchical agglomerative cluster analysis on standardized data using Ward’s method with squared Euclidean distance. Information entropy theory using in this study was another important part of the research process. Shannon introduced the entropy concept into information theory by suggesting entropy as a measure of information or uncertainty.
Discussion and results
Multivariate analysis and clustering of the data and parameters performed using 14 selected parameters which had a high to moderate correlation. In the first step, normalized values of 14 parameters of 155 samples were used for clustering and factor analysis. Factor analysis of quality parameters revealed that 70.67% of groundwater quality changes in Lenjanat plain are controlled by three factors that expression of each factor is described in the following. Table 1 presents the rotated common factors for the percentage of variance and the total cumulative percentage of variance.
Table 1: The Varimax rotated common factors for loadings, the percentage of variance and the total cumulative percentage of variance





Parameter


Component




1


2


3




Ca


-0.05


0.41


0.73




Na


0.04


0.79


0.35




Mg


0.06


-0.04


0.76




F


0.27


0.78


-0.15




NO3


0.21


0.08


0.69




SO4


0.14


0.81


0.29




EC


-0.07


0.56


0.71




As


-0.92


-0.14


0-.11




Cr


0.47


0.02


-0.07




Cd


0.95


0.06


0.10




Cu


0.95


0.08


0.097




Mn


-0.96


-0.08


-0.10




Ni


0.48


0.16


-0.02




Pb


0.88


-0.006


0.12




Eigenvalue


5.48


3.14


1.27




Total variance (%)


39.19


22.42


9.07




Cumulative variance (%)


39.19


61.61


70.67





Factor 1: This factor indicates high relation with five elements including As, Cd, Cu, Mn and Pb accounts of 39.19% of the total variance in groundwater quality parameters. The existence of these parameters in groundwater is the result of human activities and pollutions generated from industrial areas in this plain.
Factor 2: The impact on the groundwater quality change by this factor is equal to 22.42%. The main source of sodium and sulfate in this plain is the presence of crystal and inter-beds of gypsum and salt deposits in the region. In addition, the use of chemical fertilizers and industrial & residential sewage discharge are another causes of sulfate and fluoride concentration.
Factor 3: This factor controls 9.07% of groundwater quality variance in Lenjanat plain. According to the geological structure of region, source of Ca and Mg in the groundwater is generally natural and these elements could have been derived from erosion of limestone, dolomite and their minerals. Existence of NO3 in factor 4 and as an effective agent in destroying the quality of water show the infiltration of wastewater and the nitrate fertilizers used in farmland to subsurface.
The information entropy for each selected groundwater monitoring well is calculated and the groundwater monitoring wells are ranked according to their calculated information entropy values. In the next step, the ranking values of groundwater monitoring wells were summed up for each parameter classified by each common factor. Table 2 shows some of the results. Finally, the magnitude of the sum of ranks was used to determine the stability of groundwater quality. The smaller values indicate more unstable groundwater qualities. Except for As, Cd and Mn which have the lowest entropy and the maximum weight, all parameters show similar entropy and entropy weight. Accordingly, all parameters except three mentioned parameters have been constant changes and many of these changes can be attributed to the geological formations.

On the other hand, high entropy weight than other parameters indicate the higher influence. So, the heavy metals concentrations in groundwater are the most effective parameters in quality change.
Table 2: Entropy and entropy weight values of parameters





 


Ca


Na


Mg


F


NO3


SO4


EC


As


Cr


Cd


Cu


Mn


Ni


Pb




Entropy Value


7.21


7.19


7.26


7.21


7.24


7.23


7.22


6.92


7.24


6.46


7.21


6.87


7.20


7.22




Entropy Weight


11.06


11.06


11.06


11.06


11.06


11.06


11.06


11.08


11.06


11.12


11.06


11.09


11.06


11.06





 

 
The entropy value and weight of each factor that are presented in Table 3 show that the highest entropy weight are associated with factor 1, and afterwards, factors 3 and 2. According to the results, the greatest impact on groundwater quality of Lenjanat plain is related to parameters of the factor 1 and factor 3 and finally, factor 2. The correlation between entropy value of each parameter and corresponding factor is presented In Table 4.
Table 3: The entropy values and entropy weight for different factors





Sample


Information Entropy Value


Entropy Weight




F1


F2


F3


F1


F2


F3




1


0.23459


0.167692


0.208551


0.358671


0.213175


0.285365




2


0.232756


0.168242


0.212976


0.358844


0.213154


0.28506




3


0.23459


0.157132


0.192494


0.357811


0.213457


0.285887




.


.


.


.


.


.


.




154


0.226236


0.098825


0.183919


0.355484


0.21604


0.284169




155


0.222986


0.134493


0.193117


0.357315


0.214337


0.284751




Sum


34.707


21.643


28.946


55.426


33.193


44.251





Table 4: Correlation between entropy value of each parameter with the corresponding factor score





Pb


Mn


Cu


Cd


As


Parameter




0.860


0.964


0.881


0.858


0.924


Factor 1 scores & Entropy value




-


-


SO4


F


Na


Parameter




-


-


-0.811


-0.775


-0.781


Factor 2 scores & Entropy value




-


EC


NO3


Mg


Ca


Parameter




-


-0.724


-0.723


-0.751


-0.731


Factor 3 scores & Entropy value





 
 
Conclusions
According to the minimum and maximum salinity in the plain, it can be expressed that the most important factor in groundwater quality variance is located in the plain. Correlation between various parameters of quality shows that most influence on electrical conductivity is due to calcium, sodium and sulfate. The same Changes and fluctuations in weight of factor 1 (39.2% of the variation in factor load) with the entropy value of heavy metals indicates the importance of this factor in determining the concentration of heavy metals. This factor has a manmade origin and is not connected with the natural environment and geological formations. Strong negative correlation between the factor score and entropy value of heavy metals shows that the origin and changes in parameters of factor 2 (with 22.4% change in factor load) and 3 (with 9.1% changes in factor load) is due to natural factors and human activities has the lowest impact.

Keywords