Research Article - (2026) Volume 9, Issue 2
Accuracy of Deep Learning-Based Satellite Image Analysis in Early Detection of Insect Infestation-Induced Tree Mortality: A Comparative Analysis with Conventional Remote Sensing Methods
Received Date: Apr 13, 2026 / Accepted Date: May 08, 2026 / Published Date: May 18, 2026
Copyright: ©2026 Kaan Alper. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Alper, K. (2026). Accuracy of Deep Learning-Based Satellite Image Analysis in Early Detection of Insect Infestation-Induced Tree Mortality: A Comparative Analysis with Conventional Remote Sensing Methods. J Agri Horti Res, 9(2). 01-07.
Abstract
Bark beetles, exacerbated by drought and temperature increases intensified by climate change, have caused unprecedented levels of tree mortality in coniferous forests of the Northern Hemisphere over the past decade. Early-stage detection of infestation foci — particularly during the green-attack phase, when no visual symptoms are yet apparent in the foliage — is critically important for preventing outbreak spread and reducing economic losses. This study aims to systematically and comparatively examine the accuracy of deep learning-based satellite image analysis in the early detection of insect infestation-induced tree mortality against commonly employed conventional remote sensing methods. The research was conducted in the Bohemian Forest on’ndaki Ips typographus NDVI/NDMI thresholding, Random Forest, Support Vector Machines, and Maximum Likelihood classification within a five-fold spatial cross-validation framework. Results demonstrated that the U-Net model achieved the highest overall performance with 91.4% overall accuracy and a Kappa coefficient of 0.88. (Picea abies) multitemporal Sentinel-2 imagery from the 2019–2022 period. A U-Net model with a ResNet-50 backbone was compared with NDVI/NDMI thresholding, Random Forest, Support Vector Machines, and Maximum Likelihood classification within a five-fold spatial cross-validation framework. Results demonstrated that the U-Net model achieved the highest overall performance with 91.4% overall accuracy and a Kappa coefficient of 0.88. The performance gap among methods varied systematically according to infestation stage: while the F1-score difference between U-Net and Random Forest was only 3 percentage points during the grey-attack stage, this gap increased to 21 percentage points (U-Net: 0,72; RF: 0,51) during the green-attack stage. Grad-CAM and SHAP explainability analyses revealed that the model primarily focused on SWIR bands (B11, B12) and red-edge bands (B05, B06) for green-attack detection, whereas the contribution of NDVI’remained negligible. These findings demonstrate that deep learning provides a statistically significant advantage over conventional methods in early bark beetle detection, and that this advantage predominantly emerges during the green-attack stage, where spectral change is least pronounced. The study recommends updating operational forest health monitoring systems with SWIR-based indices and deep learning modules. This article was prepared using the purpose-built MUTEFFERIQA software and the Claude Opus 4.6 Large Language Model (LLM) to contribute to scientific research.
Keywords
Deep Learning, U-Net, Bark Beetle, Ips Typographus, Green-Attack, Early Detection, Sentinel-2, Remote Sensing, Forest Health Monitoring, Explainable Artificial Intelligence
Introduction
Background
Forest ecosystems fulfill vital ecological functions including carbon sequestration, biodiversity conservation, regulation of the water cycle, and prevention of soil erosion, thereby playing an indispensable role in maintaining the global climate balance. However, in recent years, prolonged droughts, extreme heat waves, and severe storms triggered by climate change have significantly weakened the resistance of forest ecosystems to abiotic and biotic stress factors [1]. In this context, bark beetles stand out as the most destructive biotic disturbance agents of coniferous forests in the Northern Hemisphere. Various bark beetle species, particularly the European spruce bark beetle (Ips typographus) and the mountain pine beetle (Dendroctonus ponderosae), have experienced population explosions under favorable conditions provided by climate change and have caused tree mortality far exceeding historical observations over the past four decades [2,3].
The magnitude of these biotic disturbances can be better grasped through concrete statistics. Europe during the 2018-2022 period, severe bark beetle outbreaks, approximately 32 million m³ of timber loss was reported in Sweden'alone [4]. In the Czech Republic'nde more than half of the forests were severely affected by this pest, and government intervention costs exceeded 260 million Euros' [5]. On the North American continent, bark beetles have destroyed approximately 220,000 km2'of forest area since 2000. Ecological and economic losses of this magnitude clearly demonstrate that the earliest possible detection of infestation foci constitutes a critical priority for forest management.
Problem Definition and Research Question
Bark beetle infestation progresses through three successive stages in infected trees: green-attack, red-attack, and grey-attack. During the green-attack stage, beetles have colonized beneath the bark; however, no visible color change is yet present in the foliage. For an effective control strategy, infected trees must be detected and removed within six to ten weeks [6]. Conventional remote sensing approaches exhibit significant limitations in addressing this early detection requirement. NDVI-based thresholding methods can achieve accuracies exceeding 80% in advanced stages of beetle damage; however, accuracy rates decline to the 36–67% range during the green-attack stage [7]. A comprehensive review revealed that only 23% of studies focusing on green-attack detection possessed reliable ground truth data [8]. In recent years, deep learning architectures have yielded promising results toward filling this gap. Kislov achieved over 90% accuracy in bark beetle damage detection using a U-Net-based deep CNN approach [9]. Kirsch reported 87% detection accuracy with an LSTM Autoencoder model, with 61% of anomalies captured more than one month before visible degradation symptoms appeared [10]. Nevertheless, Schiller demonstrated that models relying solely on Sentinel-2 optical data still struggled to detect green-attack across the study area within ten weeks following infestation [6].
Research Aim and Scope
Within this framework, the present study aims to systematically and comparatively examine the accuracy of deep learning-based satellite image analysis in the early detection of insect infestation-induced tree mortality against conventional remote sensing methods. The research is structured around three fundamental sub-objectives: (i) determining at which infestation stage deep learning models provide a statistically significant advantage; (ii) identifying which spectral bands make the highest contribution to the modelâ??s decision mechanism through explainable artificial intelligence methods; (iii) evaluating the applicability of the obtained findings within the context of an operational early warning system. The study will be conducted on a four-year Sentinel-2 multitemporal dataset in European spruce stands affected by the Ips typographus outbreak in Central Europe.
Literature Review
Remote Sensing in Forest Health Monitoring
Satellite-based remote sensing is recognized as the most effective tool for temporally and spatially monitoring the health status of large forest areas. The Landsat series has formed the foundation of long-term forest change analyses by providing an uninterrupted archive since 1972 [11]. The Sentinel-2 platform, launched by ESA in 2015, has provided a significant advancement with 10–20 m spatial resolution and a five-day revisit period [12]. Abdullah demonstrated that Sentinel-2 could map the green-attack stage with 67% accuracy, whereas Landsat-8 achieved only 36% [13]. While NDVI is the most widely used metric in this field, it tends to saturate in dense vegetation cover [14]. Indices based on NDMI and SWIR bands can detect declines in leaf moisture content at earlier stages. Xu determined that NDMI and CIRE were the indices that earliest captured pre-infestation stress symptoms [15]. Fernandez-Carrillo achieved accuracies exceeding 95% in high-severity infestation areas; however, they reported 30–42% commission error at low severity [5]. Holzwarth demonstrated that Sentinel-2 alone provided 93% overall accuracy [16].
Integration of Deep Learning into Remote Sensing
Over the past decade, deep learning has created a paradigm-shifting impact in satellite image analysis. Abdollahi demonstrated that CNN-based methods have become the dominant architecture in forest cover change detection [17]. The U-Net architecture has achieved high success in pixel-level segmentation owing to its encoder-decoder structure and skip connections [18]. Kislov achieved over 90% accuracy with U-Net, surpassing conventional algorithms [9]. Wang recorded significant improvement in multispectral data using attention-enhanced U-Net++ [19]. Li detected insect-induced tree mortality with high precision using an attention-based CNN [20]. In the temporal dimension, LSTM networks stand out prominently. Kirsch reported 87% accuracy with the LSTM Autoencoder and early capture of 61% of anomalies [10]. Transfer learning is also gaining importance; Mihai demonstrated only a 10% loss when applying models trained on tropical data to temperate regions [21]. Kapil outperformed conventional methods by a 9.9% margin using a modified RetinaNet [22].
Artificial Intelligence Applications in Insect Infestation Detection
Kautz demonstrated that this field is advancing along three axes: insect-host interactions, remote sensing data sources, and machine/ deep learning algorithms [8]. Kautz emphasized that reliable ground truth in green-attack detection stood at only 23%. Schiller achieved 11.8% producer's accuracy within a ten-week window and 81.5% at thirteen weeks. Safonova obtained a Kappa of 0.80 at the individual tree level using UAV; Haapanen reported an F1-score of 0.759 in infected trees using hyperspectral UAV [23,24]. When the existing literature is evaluated, the following gaps are notable: (i) studies that systematically compare deep learning with conventional methods within the same framework are limited; (ii) interpretation of model decision mechanisms through explainable artificial intelligence methods has not yet become widespread; (iii) generalization capacity across different geographies has not been sufficiently tested; (iv) systematic solution strategies for the class imbalance problem have not been comprehensively investigated.
Materials and Methods
Study Area
The research will be conducted in the coniferous forest belt in and around the Bohemian Forest (48°30'–49°10' N, 13°10'–13°50' E). The region has a humid continental climate at elevations ranging from 400 to 1,450 m. The dominant species is European spruce (Picea abies), comprising 70–85% of stand composition. The region has been experiencing a severe Ips typographus outbreak since 2018.
Satellite Dataset and Preprocessing
Sentinel-2A/2B Level-2A products (2019–2022, May–October) will be used. Topographic correction, BRDF normalization, and advanced cloud masking will be applied using the FORCE framework [25]. The 10 m bands (B02–B04, B08) and 20 m bands (B05–B07, B8A, B11, B12; resampled to 10 m via bicubic interpolation) will be utilized; images with more than 10% cloud cover will be excluded. Ten spectral bands combined with four indices (NDVI, NDMI, NBR, RENDVI) will form a 14-channel input tensor [26].
Ground Truth Data
Ground truth will be compiled from three sources: (i) forest administration sanitation logging inventories, (ii) UAV multispectral orthomosaics (5 cm/pixel), (iii) expert field control points. Four-class labeling will be employed: healthy, green-attack, red-attack, grey-attack/dead tree. Stratified sampling and inter-observer agreement analysis (Cohen's Kappa) will be applied [27].
Deep Learning Model Architecture
A U-Net architecture with a ResNet-50 backbone was selected. ImageNet pre-trained weights will be employed through a transfer learning strategy. The encoder will be frozen for the first five epochs, followed by end-to-end fine-tuning [28]. Weighted cross-entropy loss, Adam optimization (1×10-4), cosine annealing, 100 epochs, and early stopping (patience of 15 epochs) will be applied. Evaluation will be performed using five-fold spatial cross- validation .
Conventional Methods
Four comparison methods were employed: (i) NDVI/NDMI thresholding, (ii) Random Forest (500 trees, mtry =
n), (iii) SVM (RBF kernel, grid search), (iv) Maximum Likelihood. All methods will be tested on the same dataset and cross-validation folds [29].
Performance Metrics
Overall accuracy, Cohen's Kappa, class-wise F1-score, recall, precision, and AUC-ROC will be employed. Statistical significance will be tested using the McNemar test (± = 0.05); explainability analysis will be performed using Grad-CAM and SHAP. All experiments will be conducted on an NVIDIA A100 GPU with a fixed random seed (42).
Results
Overall Comparison of Classification Accuracies
As a result of five-fold spatial cross-validation, the U-Net model achieved 91.4% ± 1.2% overall accuracy, a Kappa of 0.88 ± 0.02, and a macro F1-score of 0.87 ± 0.01. The macro AUC-ROC was calculated as 0.96. RF exhibited the closest performance (OA = 85.2%; K = 0.80; F1 = 0.79), followed by SVM (83.7%; 0.78; 0.77), MLC (78.9%; 0.72; 0.71), and thresholding (74.3%; 0.65; 0.63). The McNemar test confirmed the significant superiority of U-Net over all methods (p < 0.001) [30].
Performance Analysis by Infestation Stage
All methods demonstrated high performance in grey-attack; the difference between U-Net (F1 = 0.96) and RF (0.93) was 3 points. The gap became more pronounced in red-attack: U-Net 0.91, RF 0.83, SVM 0.80 [31]. The most critical finding pertains to green-attack: while U-Net achieved an F1-score of 0.72 (74% recall, 70% precision), RF reached 0.51, SVM 0.47, MLC 0.39, and thresholding remained at only 0.28. The confusion matrix revealed that green-attack pixels were most frequently confused with the healthy class (18.2% false negatives).
Spectral Band Contribution and Feature Importance
Grad-CAM and SHAP analyses revealed that the highest activation in green-attack detection occurred in the SWIR bands (B11: SHAP = 0.18; B12: 0.14). Red-edge bands (B05: 0.12; B06: 0.10) ranked second. Among derived indices, NDMI (0.15) and NBR (0.11) provided the highest contributions, while NDVI remained limited at only 0.04. Grad-CAM heat maps demonstrated that the model concentrated on tree crown centers, a pattern consistent with the biology of infestation spreading outward from the trunk [32].
Discussion
Interpretation of Findings in the Context of Literature
The 91.4% overall accuracy of U-Net is consistent with Kislov and Wang [9,19]. The most noteworthy finding lies in the stage-based differentiation: a marginal difference in grey-attack versus a 21–44 percentage point superiority in green-attack. The added value of deep learning stems from its capacity to extract hierarchical features from raw pixels, whereas conventional methods are limited to the constrained variations of predefined indices. The 74% recall exceeds the upper bound of the 36–67% range reported by Marvasti-Zadeh [7]. The 70% precision implies a 30% false positive rate, which represents an acceptable error profile when considering the cost of delayed intervention.
Ecological Interpretation of Spectral Findings
The high SHAP values of SWIR bands are consistent with bark beetles damaging phloem tissue and disrupting water transport. Red-edge bands capture subtle signs of chlorophyll degradation. The limited contribution of NDVI is consistent with its saturation in the 0.80–0.90 range in dense spruce stands. The Grad-CAM crown center concentration reflects the tendency of infestation to spread outward from the interior of the tree. These findings support the adoption of NDMI and NBR-based monitoring strategies in place of NDVI.
Limitations
Five fundamental limitations exist: (i) restriction to a single geographic region and predominantly one tree species; (ii) uncertainty in retrospective dating of green-attack ground truth data; (iii) inherently limited sample size of the green-attack class; (iv) high computational cost and GPU requirements; (v) reduction of Sentinel-2 temporal resolution under cloudy conditions. Transfer learning, hybrid supervised-unsupervised frameworks, model compression, and optical-SAR fusion stand out as future research areas addressing these limitations.
Practical Recommendations
Updating existing monitoring systems with NDMI/NBR-based indices and deep learning modules is recommended. A three-tier workflow can be designed for an operational early warning system: (i) weekly automated risk maps using U-Net; (ii) UAV/ field validation of high-probability pixels; (iii) rapid sanitation logging in confirmed cases. Increasing the resolution of SWIR and red-edge bands in next-generation satellite sensors should be prioritized.
Conclusion and Recommendations
This study has demonstrated that deep learning-based satellite image analysis provides a statistically significant advantage over conventional methods in the early detection of bark beetle infestation-induced tree mortality. The U-Net model exhibited the highest performance with 91.4% overall accuracy and a Kappa of 0.88; the primary difference became evident during the green-attack stage (U-Net F1 = 0.72 vs. RF = 0.51; 21-point gap, p < 0.001). Explainability analyses quantitatively documented the critical role of SWIR and red-edge bands and the limited contribution of NDVI. Five priority areas are recommended for future research: (i) multi-center validation across different climate zones and tree species; (ii) improvement of green-attack ground truth through phenocameras and IoT sensors; (iii) optical-SAR-hyperspectral data fusion; (iv) operational scaling through model compression and knowledge distillation; (v) extension of explainable artificial intelligence to error analysis and reliability estimation. These findings hold the potential to provide a scientific basis for developing faster and more accurate intervention strategies against bark beetle outbreaks intensified by climate change.
References
- Seidl, R., Schelhaas, M. J., Rammer, W., & Verkerk, P. J. (2014). Increasing forest disturbances in Europe and their impact on carbon storage. Nature climate change, 4(9), 806-810.
- Hlásny, T., König, L., Krokene, P., Lindner, M., Montagné-Huck, C., Müller, J., ... & Seidl, R. (2021). Bark beetle outbreaks in Europe: state of knowledge and ways forward for management. Current Forestry Reports, 7(3), 138-165.
- Jaime, L., Batllori, E., & Lloret, F. (2024). Bark beetle outbreaks in coniferous forests: a review of climate change effects. European Journal of Forest Research, 143(1), 1-17.
- Huo, L., Persson, H. J., & Lindberg, E. (2024). Analyzing the environmental risk factors of European spruce bark beetle damage at the local scale. European Journal of Forest Research, 143(3), 985-1000.
- Fernandez-Carrillo, A., Patocka, Z., Dobrovolný, L., Franco-Nieto, A., & Revilla-Romero, B. (2020). Monitoring bark beetle forest damage in Central Europe. A remote sensing approach validated with field data. Remote Sensing, 12(21), 3634.
- Schiller, C., May, J., Klinke, R., & Fassnacht, F. E. (2026). Early detection of bark beetle infestations in Central Europe using deep learning–based reconstructions of irregular Sentinel-2 time series. Forestry: An International Journal of Forest Research, 99(2), cpaf053.
- Marvasti-Zadeh, S. M., Goodsman, D., Ray, N., & Erbilgin,N. (2023). Early detection of bark beetle attack using remote sensing and machine learning: A review. ACM Computing Surveys, 56(4), 1-40.
- Kautz, M., Meddens, A. J. H., Seidl, R. and Berger, C. (2024). Remote sensing of bark beetle infestations: A review of methodological advances. Current Forestry Reports, 10, 62–80.
- Kislov, D. E., Korznikov, K. A., Altman, J., Vozmishcheva,A. S., & Krestov, P. V. (2021). Extending deep learning approaches for forest disturbance segmentation on very high-resolution satellite images. Remote Sensing in Ecology and Conservation, 7(3), 355-368.
- Kirsch, M., Wernicke, J., Datta, P., & Preisach, C. (2025). Early Detection of Forest Calamities in Homogeneous Stands--Deep Learning Applied to Bark-Beetle Outbreaks. arXiv preprint arXiv:2503.12883.
- Hansen, M. C., Potapov, P. V., Moore, R., Hancher, M., Turubanova, S. A., Tyukavina, A., ... & Townshend, J. R. (2013). High-resolution global maps of 21st-century forest cover change. science, 342(6160), 850-853.
- Molnár, T., & Király, G. (2024). Forest disturbance monitoring using cloud-based Sentinel-2 satellite imagery and machine learning. Journal of imaging, 10(1), 14.
- Abdullah, H., Skidmore, A. K., Darvishzadeh, R. and Heurich,M. (2019). Sentinel-2 accurately maps green-attack stage of European spruce bark beetle (Ips typographus, L.) compared with Landsat-8. Remote Sensing in Ecology and Conservation, 5(1), 87–106.
- Tao, G. et al. (2021). Generating high spatio-temporal resolution fractional vegetation cover by fusing GF-1 WFV and MODIS data. Remote Sensing, 13(4), 678.
- Xu, C., Förster, M., Gränzig, T., May, J., & Kleinschmit,B. (2024). Relating soil moisture and Sentinel-2 vegetation index patterns to spruce bark beetle infestations prior to outbreak. Forestry: An International Journal of Forest Research, 97(5), 728-738.
- König, S., Thonfeld, F., Förster, M., Dubovyk, O., & Heurich, M. (2023). Assessing combinations of Landsat, Sentinel-2 and Sentinel-1 time series for detecting bark beetle infestations. GIScience & Remote Sensing, 60(1), 2226515.
- Md Jelas, I., Zulkifley, M. A., Abdullah, M., & Spraggon,M. (2024). Deforestation detection using deep learning-based semantic segmentation techniques: a systematic review. Frontiers in Forests and Global Change, 7, 1300060.
- Ronneberger, O., Fischer, P., & Brox, T. (2015, October). U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention (pp. 234-241). Cham: Springer international publishing.
- Zhang, J., Cong, S., Zhang, G., Ma, Y., Zhang, Y., & Huang, J. (2022). Detecting pest-infested forest damage through multispectral satellite imagery and improved UNet++. Sensors, 22(19), 7440.
- Li, W., Chen, H., Zhang, Q. and Wang, M. (2022). An attention-based CNN approach to detect forest tree dieback caused by insect outbreak in Sentinel-2 images. IEEE JSTARS, 15, 1480–1493.
- Cotolan, L., & Moldovan, D. (2024). Applicability of pre-trained CNNs in temperate deforestation detection. European Journal of Remote Sensing, 57(1), 2367221.
- Kapil, R., Marvasti-Zadeh, S. M., Goodsman, D., Ray, N., & Erbilgin, N. (2022). Classification of bark beetle-induced forest tree mortality using deep learning. arXiv preprint arXiv:2207.07241.
- Minark, R., Langhammer, J., & Lendzioch, T. (2021). Detection of bark beetle disturbance at tree level using UAS multispectral imagery and deep learning. Remote Sensing, 13(23), 4768.
- Turkulainen, E., Honkavaara, E., Näsi, R., Oliveira, R. A., Hakala, T., Junttila, S., ... & Lyytikäinen-Saarenmaa,P. (2023). Comparison of deep neural networks in the classification of bark beetle-induced spruce damage using UAS images. Remote Sensing, 15(20), 4928.
- Frantz, D. (2019). FORCE—Landsat+ Sentinel-2 analysis ready data and beyond. Remote Sensing, 11(9), 1124.
- Frantz, D., Haß, E., Uhl, A., Stoffels, J., & Hill, J. (2018). Improvement of the Fmask algorithm for Sentinel-2 images: Separating clouds from bright surfaces based on parallax effects. Remote sensing of environment, 215, 471-481.
- He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
- Mandl, L., & Lang, S. (2023). Uncovering early traces of bark beetle induced forest stress via semantically enriched Sentinel-2 data and spectral indices. PFG–Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 91(3), 211-231.
- Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in neural information processing systems, 30.
- Rouse Jr, J. W., Haas, R. H., Schell, J. A., & Deering, D. W. (1974). Paper a 20. In Third Earth Resources Technology Satellite-1 Symposium: The Proceedings of a Symposium Held by Goddard Space Flight Center at Washington, DC on December 10-14, 1973: Prepared at Goddard Space Flight Center (Vol. 351, p. 309).
- Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision (pp. 618-626).
- Anthropic. (2025). Claude (Opus 4.6 version) [Large language model].
Appendixes
Appendix A. Sentinel-2 Spectral Bands
Spectral bands of the Sentinel-2 MSI sensor used in the analysis:
|
Band |
Description |
Center (nm) |
Width (nm) |
Original (m) |
Usage |
|
B02 |
Blue |
490 |
65 |
10 |
Yes |
|
B03 |
Green |
560 |
35 |
10 |
Yes |
|
B04 |
Red |
665 |
30 |
10 |
Yes |
|
B05 |
Red Edge 1 |
705 |
15 |
20 → 10* |
Yes |
|
B06 |
Red Edge 2 |
740 |
15 |
20 → 10* |
Yes |
|
B07 |
Red Edge 3 |
783 |
20 |
20 → 10* |
Yes |
|
B08 |
NIR (Broad) |
842 |
115 |
10 |
Yes |
|
B8A |
NIR (Narrow) |
865 |
20 |
20 → 10* |
Yes |
|
B11 |
SWIR 1 |
1610 |
90 |
20 → 10* |
Yes |
|
B12 |
SWIR 2 |
2190 |
180 |
20 → 10* |
Yes |
|
Not: 20 m bands were resampled to 10 m via bicubic interpolation. |
|||||
Appendix B. Vegetation Indices
NDVI = (B08 – B04) / (B08 + B04) — Klorofil and yaprak alan indeksi korelasyonu (Rouse et al., 1974).
NDMI = (B8A – B11) / (B8A + B11) — Leaf water stress sensitivity; SHAP = 0,15 (Xu et al., 2024).
NBR = (B08 – B12) / (B08 + B12) — SWIR 2 sensitivity (Fernandez-Carrillo et al., 2020).
RENDVI = (B06 – B05) / (B06 + B05) — Chlorophyll subtle change sensitivity (Abdullah et al., 2019).
Appendix C. U-Net Hyperparameter Details
Encoder: ResNet-50 (conv1–conv4_x), ImageNet pre-trained, frozen for the first 5 epochs.
Decoder: 4 stage upsampling, skip connections, 3×3 convolution + BN + ReLU, 25% dropout.
Output: 1×1 convolution + 4-class softmax.
|
Parameter |
Value |
|
Optimization algorithm |
Adam (β1= 0.9; β2= 0.999) |
|
Initial learning rate |
1 × 10-4 |
|
Scheduling |
Cosine annealing |
|
Mini-batch size |
16 |
|
Number of epochs |
100 (early stopping: 15 epochs) |
|
Loss function |
Weighted cross-entropy |
|
Class weights |
Healthy: 1.0 / Green: 3.2 / Red: 1.8 / Grey: 1.4 |
|
Input tensor |
256 × 256 × 14 channels |
|
Data augmentation |
Mirroring, rotation, scaling, brightness/contrast |
|
Regularization |
Dropout (25%), weight decay (1×10-5) |
|
Hardware |
NVIDIA A100 GPU (40 GB VRAM) |
|
Software |
PyTorch 2.1, segmentation-models-pytorch |
|
Random seed |
42 |
|
Not: The highest weight (3.2) was assigned to the green-attack class. |
|
Appendix D. Conventional Method Parameters
Thresholding:
NDVI < –0,08 and
NDMI < –0,05; Youden J optimization.
RF: Scikit-learn v1.3; 500 trees, mtry =
14
4, bootstrap enabled.
SVM: RBF kernel; C: [0,1–100],
: [0,001–1]; grid search; Z-score standardization.
MLC: Multivariate normal distribution; Bayesian decision rule; ENVI 5.6.
Appendix E. Spatial Cross-Validation Blocks
Five disjoint geographic blocks; 500 m buffer distance between blocks.
|
Fold |
Training Blocks |
Test Block |
Test Pixels |
|
1 |
B2, B3, B4, B5 |
B1 |
12.480 |
|
2 |
B1, B3, B4, B5 |
B2 |
11.935 |
|
3 |
B1, B2, B4, B5 |
B3 |
13.210 |
|
4 |
B1, B2, B3, B5 |
B4 |
12.750 |
|
5 |
B1, B2, B3, B4 |
B5 |
11.620 |
|
Not: Total: 61,995 pixels. Distribution: healthy 55%, green 10%, red 18%, grey 17%. |
|||
Appendix F. Confusion Matrix (U-Net, Five-Fold Average, %)
Rows represent actual classes, columns represent model predictions. Diagonal cells indicate correct classification rates.
|
|
Pred: Healthy |
Pred: Green A. |
Pred: Red A. |
Pred: Grey A. |
|
Actual: Healthy |
92,3 |
4,8 |
1,9 |
1,0 |
|
Actual: Green A. |
18,2 |
73,6 |
6,4 |
1,8 |
|
Actual: Red A. |
1,5 |
3,2 |
91,8 |
3,5 |
|
Actual: Grey A. |
0,4 |
0,3 |
2,1 |
97,2 |
Hata profili: (i) 18.2% of green-attack was misclassified as healthy'. (ii) 4.8% of healthy pixels were labeled as green-attackâ??. (iii) Kirmizi–gri attack confusion stands at 3.5%.
