Cation exchange capacity (CEC) is an important soil property because it affects the assimilation of nutrients and buffers against soil acidification. Thus, knowledge of CEC is considered key to developing agricultural and environmental models for land management planning. However, in developing countries such as Sudan, there is a lack of soil CEC data due to the absence of research projects and funding to develop this information. Therefore, this research was conducted to predict CEC for large areas using specific soil physical characteristics, including soil texture and saturation percentage (SP), for which there is potentially available data. To achieve this goal, the properties of 430 soil samples (301 for training and 129 for validation) were obtained from the soil database of the Soil Survey Administration, Ministry of Agriculture, Sudan, which had different soil depth intervals (0–0.3 m, 0.3–0.6 m, 0.6–0.9 m, 0.9–1.5 m, and >1.5 m) from Entisols in the Northern State of Sudan. The data were stratified into homogeneous groups based on the textural classes of the main soil order. Then, regression models were performed and evaluated using the coefficient of determination (R2), standard error of the estimate (SEE), and root mean square error (RMSE). The results indicated that in individual Entisols and textural classes, the combined soil covariates silt, clay, and SP were the best properties to predict CEC values (R2 ranged from 0.86 to 0.99). The regression models did not provide statistically significant results for the silty clay loam textural class (R2 ranged from 0.01 and 0.35). The findings of this modeling study could be applied to other Entisols worldwide with divergent textural classes, which could be used to verify the suggested CEC pedotransfer functions and/or improve them. This would help farmers correctly design soil management plans and prevent acidification issues if combined with other soil properties data.
Introduction
Soil cation exchange capacity (CEC) is widely considered an important soil property and is used as an input variable in many agricultural and environmental models (da Silva et al., 2018; Keshavarzi et al., 2017; Liddicoat et al., 2018; Shiri et al., 2017; Sulieman et al., 2018). The CEC is commonly used as an indicator of soil quality (Golchin & Asgari, 2008; Swanepoel et al., 2015; Valle & Carrasco, 2018; Xu et al., 2006), nutrient retention (Mukhopadhyay et al., 2019), and the capacity to protect groundwater from contaminants, particularly cation contamination (Khaledian et al., 2017). It buffers fluctuations in soil pH and, therefore, nutrient availability (Hazelton & Murphy, 2016).
The determination of CEC in the laboratory is a painstaking and costly process because it involves saturation shaking and centrifugation of the soil suspension nine times before sodium can be accurately determined (Aladejana et al., 2018; Dohrmann, 2006). Thus, CEC modeling and prediction is a vital research topic, particularly in non-developed countries where nature-based solutions and agricultural management plans must be implemented due to rapid soil degradation, but information on CEC is lacking, and funds to obtain such information are extremely limited (Aladejana et al., 2018; Bajocco et al., 2018; Grinblat et al., 2015; Kouba et al., 2018; Tesfahunegn, 2019).
Several methods have been proposed to predict and quantify CEC from known soil properties. Many of these rely on statistical models, such as general linear models (Mishra et al., 2019; Sulieman et al., 2018), linear regression models (Seybold et al., 2005), a combination of principal components analysis with soil linear and regression models (Fox & Metla, 2005), fractal parameters using artificial neural networks (ANNs; Bayat et al., 2014), application of genetic expression programming (GEP) and multivariate adaptive regression splines (MARS) (Emamgolizadeh et al., 2015), a combination of adaptive neuro-fuzzy inference system (ANFIS) with data acquisition through remote sensing (Keshavarzi et al., 2017), and a mixture of ant colony organization algorithm and adaptive network-based fuzzy systems with multiple linear regression (Shekofteh et al., 2017). Soil data that have been used in these models include sand, silt, clay, organic carbon, pH, calcium carbonate equivalent (Khaledian et al., 2017), hyperspectral visible near-infrared (Vis–NIR) spectroscopy (Gogé et al., 2014; Gomez et al., 2012; Lu et al., 2013; Pirie et al., 2005; Rehman et al., 2019; Rossel & Webster, 2012; Ulusoy et al., 2016), apparent electrical conductivity as determined with the Veris-3100 (Koganti et al., 2017), and a combination of nuclear magnetic resonance (NMR), X-ray diffraction (XRD), and nitrogen adsorption–desorption isotherm analysis on clay minerals (Cheng & Heidari, 2018). Soil CEC has also been successfully quantified using Sentinel-2A (S2A) multispectral satellite images (Vaudour et al., 2019).
Among the various aforementioned statistical approaches, multiple regression (MR) methods have been widely applied to estimate soil CEC using different soil parameters, and their findings were highly accurate. For example, Kashi et al. (2014) employed MR to predict CEC using bulk density, soil texture, electrical conductivity, lime percentage, and sodium adsorption ratio (SAR) in soils of the Ghoshe Region in Semnan Province, Iran, and found that the performance of the model was very good (R2 = 0.77; mean absolute error [MAE] = 1.85; root mean square error [RMSE] = 1.92 cmol + kg−1). Likewise, Ghorbani et al. (2015) showed similar results when soil pH, soil organic carbon (SOC), and soil texture were used to predict CEC in soils of Golestan Province, Iran (R2 = 0.78; MAE = 1.64; RMSE = 1.83 cmol + kg−1). Seyedmohammadi et al. (2016) found that CEC was strongly correlated with clay and SOC contents (R2 = 0.77) in the soils of Guilan province, northern Iran. Moreover, Olorunfemi et al. (2016) concluded that an MR model based on pH, clay content and SOC was the best one for predicting CEC in the forest soils of Nigeria (R2 = 0.71; MAE = 1.16; RMSE = 1.39 cmol + kg−1).
Several authors confirmed that soil texture can be a key property to model and predict CEC. Clay and decomposed organic matter (humus) are considered among the most important components that contribute to soil CEC variations, while silt has proven useful but to a lesser extent. Bayat et al. (2014), Khaledian et al. (2017), and Sulieman et al. (2018) mentioned that CEC was significantly positively influenced by clay content and negatively affected by sand content. Likewise, the saturation percentage (SP) is mainly determined by soil physical properties (Dane & Topp, 2002). Consequently, SP may also be used as a quantitative predictor of soil texture (Stiven & Khan, 1966), water-storage capacity, and CEC (Rodrigo-Comino et al., 2019). For instance, Aali et al. (2009) successfully used MR, ANNs, and ANFIS techniques with clay, silt, and organic carbon contents as independent variables to predict SP in the Boukan region in northwestern Iran.
In African countries, intensive agricultural activities (e.g., excessive fertilization, improper pesticide usage, and heavy machinery usage) are being expanded with new soils and lands brought into production annually (Thomas, 2008). In addition, due to political and armed conflicts, this expanded agricultural sector is becoming more relevant (Enaruvbe & Atafo, 2019; Enaruvbe et al., 2019). These lands are usually quite large in size, which means large numbers of soil samples will have to be collected to carry out traditional soil surveys and land evaluation (Mekonnen et al., 2016, 2017). One example of this expanding agricultural dynamic is in Northern Sudan (NS), which is in a desert agro-ecological zone (about 30% of the total country) characterized by scarce precipitation and consequently a lack of natural vegetation. Irrigated agricultural practices are commonly concentrated along the Nile River, and the region depends on dates palm and field crops, such as sorghum, maize, millet, wheat, barley, and pulses (Food and Agriculture Organization [FAO] and World Food Programme [WFP], 2011). The main soil orders in NS are Aridisols (along sand dunes) and Entisols (along the Nile River terraces), according to soil taxonomy (Soil Survey Staff, 2014a).
The importance of CEC in assessing soil quality and its potential for various land management systems under different soil types and climate conditions is broadly accepted (Aparicio & Costa, 2007; Biswas et al., 2017; Chaves et al., 2017; Marinari et al., 2006; Masto et al., 2009; Mukhopadhyay et al., 2019; Pulido et al., 2017; Wan et al., 2018; Zuber et al., 2017). Knowledge of CEC is becoming increasingly important in dry areas because they suffer from many problems such as a lack of sufficient water to support plant growth and land degradation. Given the lack of funding to conduct traditional laboratory CEC analyses, it is necessary to develop models to predict CEC in the soils of regions such as NS. Therefore, this study’s goal was to (1) investigate linear (LR) and nonlinear (NLR) statistical models to predict CEC using clay, silt, and SP as covariates, (2) verify the validity of the soil CEC model(s), and (3) identify the most suitable properties for CEC prediction among the selected covariates.
Materials and Methods
Study area and soil sampling
The study area is located at 27° 30’ 17.28” to 27° 37’ 44.04” E longitude and 19° 21’ 43.2” to 19° 53’ 19.32” N latitude, which covers about 1,500 km2 in the Northern State (NS) of Sudan. According to Soil Taxonomy (Soil Survey Staff, 2014a), the study area has aridic and hyperthermic soil moisture and temperature regimes, respectively. Soils of NS are mostly Entisols and Aridisols (Soil Survey Staff, 2014a). These soils have been classified down to the subgroup level (Ministry of Agriculture, 1997). The soils of the area can be divided into two major groups based on their origin: (1) sedentary (residual) soils, which occupy partially elevated sites that seem to have been affected by chemical weathering during a former period was wetter than the present climate. They are medium-textured soils of grayish and dark grayish color and (2) transported soils which resulted from weathering products transported into the area either by wind (aeolian) or water (fluvial processes). Therefore, these involve two types of soils: (1) soils formed by wind deposits which are sandy and characterized by low fertility, low water holding capacity, and high infiltration rates. These soils are classified as Psamments at the suborder level. (2) Soils formed by water deposits from the Nile River during exceptionally high floods or by khors and wadies of a former wetter climate. These soils are classified as Fluvents. The geographical locations of the selected representative profiles are shown in Figure 1. Natural vegetation in the NS consists mainly of Acacia spp., mostly Acacia ehrenbergiana, along with others, such as Leptadenia pyrotechnica, Balanites aegyptiaca (L), and Calotropis Procera. A detailed soil survey was performed by the Soil Survey Administration (Ministry of Agriculture, 1997) to describe the morphological properties of the representative profiles including depth, color, structure, texture, gravels, consistence, the occurrence of nodules, and special features using the standard guidelines for soil profile description (Schoeneberger et al., 2012). The representative profiles were selected to cover all identified landform units in the area and the location of the soil profiles was recorded using a handheld global positioning system (GPS; Garmin Montana 680t). All profiles were excavated to the C horizon. At each profile site, soil samples were collected from five soil depth intervals with approximately 2–3 kg of soil material collected from each horizon.
Data source and extraction
Data from Part II (classification and correlation section) of the soil series of Sudan, provided by the Soil Survey Administration (Ministry of Agriculture, 1997), were used for this study. This database represents soil series from NS, River Nile State, and Khartoum State. The soil samples were collected from five standard depths: 0–0.3, 0.3–0.6, 0.6–0.9, 0.9–1.5, and >1.5 m. The data set contains complete soil properties including bulk density, SP, clay%, sand%, silt%, soil pH, ECe, exchangeable cations, CaCO3, SAR, CEC, total organic carbon (TOC), total nitrogen, total phosphorus, and available macro- and micro-nutrients for 36 soil series of the Entisols, characterized by 430 soil samples (approximately 12 samples for each soil series).
Methods of analysis
The SP was calculated by the weight difference method, and particle size fractions were determined by the pipette method (Gee & Bauder, 2002). The CEC was determined with 1 M NH4OAc at pH 7 (Soil Survey Staff, 2014b), Na+ and K+ concentrations were determined using a flame photometer (Jenway PFP7), and concentrations of Ca2+ and Mg2+ were determined by the titration method.
Statistical analysis
First, a bivariate linear correlation (Pearson) analysis was performed between CEC and the soil covariates at the order level (Entisols), and then split further into sub-orders based on the different textural classes of the main order. Based on the correlation results, only the variables that revealed significant differences at p ⩽ 0.05 were used in the regression equations to predict CEC. Following regression analysis, only the variables that yielded ⩾5% of the R2 value in the regression models were used in the final regression models. Multiple linear and nonlinear regressions were used to establish the relationship between a dependent variable with the chosen independent variables.
where y is the dependent variable; a1, a2,. . ., an are the regression coefficients; x1, x2,. . ., xn are the independent variables.
All statistical analyses were performed using SPSS software, version 22 (IBM Corporation, 2012).
Testing the models’ performance
The performance of all regression models was assessed using three common statistical standards: (1) coefficient of determination (R2), (2) standard error of the estimate (SEE), and (3) RMSE. The SEE and RMSE were calculated from the differences between the predicted CEC values and measured values to determine the precision and bias of the prediction (Verfaillie et al., 2006). The three criteria above were calculated according to the following equations:
where n is the number of points at ith site, obs is the determined CEC value, and pred is the predicted CEC value from the regression models.
Results and Discussion
Regression of CEC for all soils in the Entisols
A summary of the descriptive statistics for the soil properties in the Entisols is shown in Table 1. There were high variations in CEC as indicated by the coefficient of variation (CV = 51.1%, Wilding, 1985). This was likely due to variation in the soil properties that control CEC (da Silva et al., 2018; Koganti et al., 2017), and clay had a similar CV (47.4%). Some of these soils were Fluvents formed along the river, and Fluvents often exhibit a wide textural range in their parent materials (Ahearn et al., 2005; Anderson, 1988; Cerdà, 1999; Challa et al., 2008; Martínez-Hernández et al., 2017). The correlation results for the Entisols (Table 2) showed highly positive CEC relationships with all different covariates (R2 ranged from 0.71 to 0.98, significant at 0.01 level).
Table 1.
Summary for Descriptive Statistics of Measured Soil Properties for Entisols and Different Textural Classes.

Table 2.
Correlations (Pearson) for CEC with Studied Soil Covariates for All Entisols and the Different Textural Classes.

Table 3 shows the regression models, coefficient of determination (R2), SEE, and RMSE for the CEC regressions using the covariates silt, clay, and SP; silt and clay; clay and SP; silt + clay; silt and SP; clay; and silt for the Entisols. All the equations were highly significant (p > 0.001). However, the R2 value was highest and RMSE lowest for the regressions that used silt, clay, and SP to predict CEC. This indicates that CEC is affected by more than one factor. This result agrees with Asadu (1990), who reported that CEC could be predicted from soil organic matter (SOM) and clay content when grouped by taxonomic order (Inceptisols, Alfisols, Ultisols, and Oxisols), with this grouped order tending to reduce the variability in the soils. Drake and Motto (1982) noticed that the CEC prediction was improved when they grouped the soils based on their classification or district. This also agrees with Zeraatpisheh and Khormali (2012), who reported that about 96% of CEC variations were predicted by clay content, SOM content, and pH. Moreover, Khaledian et al. (2017) found that regression equations combining clay, silt, sand, pH, and organic carbon could explain 51%–93% of variations in CEC.
Table 3.
Regression Equations for CEC with Different Soil Properties in the Entisols and Different Textural Classes.

Using clay and silt to model CEC also gave high R2 and low RMSE values, more or less equal to the regression that included all covariates (Table 3). This is different than the results of Krogh et al. (2000), who reported that approximately 90% of soil CEC variations could be explained by silt, clay, OC, and pH. The pH values of the soils in NS do not change much because of the high buffering capacity in these soils. Also, these soils are inherently low in SOM, rarely exceeding 0.5% (Sulieman et al., 2018). Khaledian et al. (2017) found that SOM was not a significant factor determining CEC in soils with low SOM contents.
The regression model using just silt to determine CEC was also significant (R2 = 0.44), which agreed with Morrás (1995), who reported that the silt fraction of two loess soils in the southern region of the Chaco Basin, Argentina, was responsible for between 1/6 and 1/3 of the total soil CEC. This also agreed with Turpault et al. (1996), who reported that clay could have coated the coarser particles and therefore transformed silt and sand into negatively charged particles. This could have been the reason that these fractions influenced CEC. The combination silt + clay also gave a significant regression with high R2 and relatively low RMSE (Table 3).
Regression of CEC on the different textural classes in individual Entisols
Results of the general statistics for CEC and other soil variables of the different textural classes for Entisols are presented in Table 1. Medium and low variations in CEC values were indicated by the medium and low CV in all textured classes (CV = 10.31%–24.07%) except for sandy textured soils which showed high CEC variation (CV = 38.30%). Medium and low variation in CEC is likely due to medium and low variations in the soil properties used to predict CEC. Furthermore, CEC in sandy, loamy sand, and sandy loam was more variable (CV = 38.30%, 24.07%, and 21.44%, respectively) than in the sandy clay loam, clay, and silty clay loam (CV = 13.59%, 10.85%, and 10.31%, respectively) soils. The correlation results for the sandy clay loam, sandy loam, and loamy sand textural classes (Table 2) mostly revealed highly positive CEC relationships for all covariates with R2 values that ranged from 0.77 to 0.98 (significant at 0.01 level). However, the CEC was not correlated with silt (R2 = 0.13) or silt plus clay (R2 = 0.48) in the sandy clay loam and loamy sand textural classes, respectively.
The coefficients of determination (R2) of CEC regression on (silt, clay, and SP), (silt and clay), (clay and SP), (silt + clay), (silt and SP), (clay), and (silt) for the six different soil textural classes are given in Table 3. In general, for all soil textures, R2 values were highest for the MRs for sandy clay loam, sandy loam, loamy sand, clay, and sand, respectively. This agreed with Ersahin et al. (2006), who reported that particle size distribution could estimate CEC and grouped textures from sandy loam to clay. The lowest R2 value was obtained for silt clay loam (R2 = 0.35) and was not significant (p > 0.05). The regression equations were highly significant (p ⩽ 0.001) to significant (p ⩽ 0.01) in all divisions except the regression using silt and SP for loamy sand, which was significant at (p ⩽ 0.05), and sandy and silty clay loam textural classes, which were not significant (p ⩾ 0.05).
Values for R2 using clay content for all regressions of the clay and loamy sand textures were higher than for silty clay loam, sandy loam, and sand. However, sandy texture, as expected, had the lowest R2 values. This could be associated with the influence of clay on CEC as compared to sand.
This agreed with Liddicoat et al. (2018), who established a pedotransfer function (PTF) to predict CEC in alluvial soils across multiple geochronological settings using clay content and SOM as soil covariates. Their results showed there was a direct relationship between these variables and soil CEC. Martel et al. (1978) also reported that fine clay and total clay content in lowland soils in Quebec were more highly related to CEC than to surface area and/or SOM content.
R2 for the regression on silt were higher for sandy loam, clay and loamy sand compared to the R2 values for clay, sand, and silty clay loam soil textures (Table 3). This could have been due to the high silt content, which gave it a high contribution to CEC compared to the other textural classes, which had lower values of R2 due to a large amount of sand. This agreed with Curtin and Smillie (1981), who reported that silt significantly contributes to total CEC when its content is large relative to clay and sand content. This also coincides with McAleese and McConaghy (1957), Martini (1970), and Alxiades et al. (1973), who stated that silt and even sand could make a significant contribution to soil CEC. Lower values of R2 were obtained for the regression equations of all multiple- and single-factor regressions (Table 3). However, this disagreed with Ersahin et al. (2006), who reported that particle size distribution could estimate CEC when grouped by textural class from sandy loam to clay. This finding also disagreed with Rashidi and Seilsepour (2008), who suggested using silt and clay to predict CEC. The equations may have failed to predict CEC due to the small sample size for silty clay loam.
Clay and silt gave high R2 values that were more or less equal to the MRs for sandy clay loam, sandy loam, loamy sand, and clay textural classes in all Entisols. This coincided with Rashidi and Seilsepour (2008), who reported that silt and clay contents could significantly contribute to CEC prediction. Soil pH values in Sudan do not vary much because of the high buffering capacity in these soils (Sulieman et al., 2016, 2018; Sulieman & Ibrahim, 2013). In some textural classes (loamy sand, clay, sandy, and silty clay loam), the R2 values using the silt plus clay regression were very low. The analysis of variance (ANOVA) test indicated that there was no significant difference between mean values of measured and predicted CEC. However, for the clay, sandy, and silty clay loam regressions, the equations were not suitable due to a low number of observations (N = 7). The PTF models that gave the best prediction of CEC for the studied soils are given in Table 4.
Table 4.
Multiple Regression Equations Suggested for Use to Predict CEC in the Entisols of Northern Sudan.

Performance of the models
Results of the prediction error indices (R2, Adjusted R2, and SEE) obtained from the validation of PTF-CEC models are shown using 129 samples for Entisols without textural classes partitions samples for sandy clay loam samples for sandy loam (Figure 2(a) to (g)), and 11 samples for loamy sand (Figure 3(a) to (g)) from the validation data set. For the Entisols (Figure 4), the best values were obtained when using a combination of silt, clay, and SP as covariates, Figure 4(a); R2 = 0.96, R2 adj. = 0.96, SEE = 2.18, and RMSE = 2.21 cmol + kg−1, while silt alone (Figure 4(e)) gave the worst indices (R2 = 0.44, R2 adj. = 0.44, SEE = 5.4, and RMSE = 7.96 cmol + kg−1). For the different textural classes (Figures 2, 3, and 5), the sandy clay loam using the covariates silt, clay, and SP (Figure 5(a)) had the best PTF fit with R2 = 0.9617, R2 adj. = 0.96, SEE = 0.61, and RMSE = 0.62 cmol + kg−1. However, the sandy loam textural class with variable clay (Figure 2(c)) showed the worst indices (R2 = 0.15, R2 adj. = 0.12, SEE = 1.04, and RMSE = 3.79 cmol + kg−1).
Figure 2.
Predicted versus measured CEC for the sandy loam textural class of Entisols using (a) silt, clay, and SP, (b) clay and SP, (c) clay, (d) silt and clay, (e) silt, (f) silt + clay, and (g) silt and SP as covariates.

Figure 3.
Predicted versus measured CEC for the loamy sand textural class of Entisols using (a) silt, clay, and SP, (b) clay and SP, (c) clay, (d) silt and clay, (e) silt, (f) silt + clay, and (g) silt and SP as covariates.

Conclusion
In this study, PTF models were established to predict soil CEC in the Entisols of NS, including their different common textural classes, to support future land management planning for cultivated and grazing areas in NS. The findings revealed that there are no significant differences between CEC measured and predicted using most of the studied equations. MRs gave the highest values of R2 for all the Entisol textural variations; therefore, these equations are recommended for CEC prediction in the study area (Table 4). The results showed that silt and clay are very important factors for predicting CEC in these soils. Prediction of CEC using soil properties should be attempted for other soils in NS, particularly for the Vertisols of the central clay plain, mainly because these soils are the most extensive soils of agricultural significance in the country. Establishing practical PTF for predicting CEC would be efficient when time and cost are limiting. Consequently, the findings of this study will provide baseline for policymakers and farmers to estimate CEC accurately and adequately in time and helps to prescribe fertilizers. However, developing applicable PTF for a wide range of environmental and geographical regions, PTFs need to be verified in several similar and diverse regions.
Acknowledgements
The authors gratefully thank the Soil Survey Division at Agricultural Research Corporation (ARC), Wad Medani, Sudan, Ministry of Agriculture and Forestry, Sudan, and Soil and Environment Sciences Department, Faculty of Agriculture, University of Khartoum, Sudan for providing us with the soil database and for their continuous assistance throughout this research. M.Z.’s postdoctoral program at Henan University, China, has been supported by the National Key Research and Development Program of China (grant nos. 2017YFA0604302 and 2018YFA0606500).
Author Contributions
Conceptualization: M.S., I.I., M.Z., G.M., M.A., and E.C.B.; validation models test: M.S., M.Z., and G.M.; formal analysis: M.A.; investigation: I.I.; resources: M.A.; data curation: M.A. and I.I.; writing—original draft preparation: M.S., I.I., M.Z., G.M., and E.C.B., writing—review and editing: M.S., M.Z., G.M., and E.C.B., visualization: M.A., and M.S.; supervision: I.I.; project administration: I.I. All authors have read and agreed to the published version of the manuscript.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.