ENVIRONMENTAL MONITORING AND ASSESSMENT, vol.195, no.9, 2023 (SCI-Expanded)
Today, data mining has become a relevant topic in digital soil mapping. In this current study, prediction of some soil properties and their spatial distribution were examined by machine learning algorithms (Support Vector Machine, Artificial Neural Network) using reflectance values of Triplesat satellite image bands in Vezirkopru district of Samsun province. The band data obtained from different wavelengths revealed positive correlations between the electrical conductivity and calcium carbonate equivalent contents of the soils. The support vector machine algorithm was the most successful to estimate the textural fractions, organic matter, electrical conductivity, and calcium carbonate equivalent contents of the soils using the bands obtained from satellite images. The mean absolute error for estimating sand, silt, and clay contents by support vector machine was 4.05%, 3.05%, and 3.66%, respectively. Texture classes were determined with an accuracy of 82% with support vector machine and 60% with artificial neural network. In all estimations, the highest percentage of error was for calcium carbonate equivalent content with very low estimation reliability. The mean absolute percentage of error values for this property are 101.13% and 51.61% for artificial neural network and support vector machine, respectively. Also, in both algorithms, the most successfully estimated soil property was clay fraction of soils. It was also investigated the spatial distribution of actual and estimated values using various interpolation methods (Kriging, inverse distance weighting-, radial basis function). Considering the spatial distributions, it was determined that the most successful method was kriging for sand, silt, and clay contents and inverse distance weighting for electrical conductivity, calcium carbonate equivalent, and organic matter contents. According to our findings, it is concluded that successful estimations and spatial distributions can be made by the support vector machine algorithm using band data from different wavelengths.