πŸ“… March 12, 2026 | INTENSIVE LECTURE SERIES

Advanced Remote Sensing &
Machine Learning for Earth Observation

From raw digital numbers to actionable geospatial intelligence. A satellite signal is merely a spectral proxy for physical reality. Today we master the duality: the foundational physics of radiometric correction and the sophisticated algorithmic logic of Machine Learning, using data from NASA AVIRIS and EO-1 Hyperion sensors.

πŸ‘¨β€πŸ«Dr. Sounny
⏱️10:00 - 17:00
πŸ›°οΈAVIRIS + Hyperion

Workflow Integration Schedule

10:00 - 12:00

Foundations of Signal Integrity

Radiometric/Geometric Correction & Spectral Enhancement. Calibrating DNs into physical units and optimizing the 0-255 brightness range.

12:00 - 14:00

Break & Transition

Long Lunch & Data Ingestion for Afternoon Sessions. Prepare datasets for Change Detection & ML analysis.

14:00 - 15:00

Thematic Synthesis

Supervised/Unsupervised Classification and the Binary Hierarchical Classifier (BHC) Framework.

15:00 - 16:00

Temporal Dynamics

Change Detection, Phenology, and Ensemble Machine Learning for multi-temporal analysis.

16:00 - 17:00

Applied Excellence & Visualization

Random Forest Theory & Professional Cartographic Output. Publication-quality map composition.

πŸ’‘

The Data Science Lifecycle: The progression from morning "Processing & Classification" to afternoon "Change Detection & AI" mirrors the actual professional lifecycle of geospatial data. We do not apply an ensemble classifier like Random Forest to raw data; we apply it to refined, spectrally consistent products. Refining the raw signal is the non-negotiable first step.

Signal to Synthesis: The Spatial Data Pipeline overview showing data ingestion, classification algorithm, feature extraction, and geo-visualization workflow
OVERVIEW Signal to Synthesis: The unified workflow for extracting, categorizing, and communicating high-dimensional Earth observation data.
Section 2

The Standard Processing Pipeline

Raw satellite imagery is rarely suitable for immediate quantitative analysis. Atmospheric scattering and geometric distortions introduce errors that can invalidate downstream models. We must strategically remove these artifacts to ensure data integrity, particularly when dealing with floating-point data such as Sea Surface Temperature (SST) in the SEcoast.dat dataset or high-resolution Landsat TM scenes of Boulder, Colorado.

Three-phase geospatial workflow: Enhance, Classify, Communicate
PHASES The modern geospatial workflow requires continuous translation between machine logic and human perception across three phases.

Morning Session Workflows

πŸ”§ Radiometric Correction

Calibrating raw DNs into physical units (reflectance or degrees Celsius). Removes atmospheric haze and sensor noise.

High-tech diagram of atmospheric correction
AI CONCEPT Signal Purification: Removing atmospheric scattering from the land-reflectance signal.

πŸ“ Geometric Correction

Aligning the image to a map coordinate system (UTM) so every pixel has a real-world location and scale.

🎨 Image Enhancement

Rescaling data to optimize the 0-255 brightness range of digital displays for human interpretation.

🌿 Spectral Unmixing

A single pixel (e.g., 250m MODIS) is a mix of signatures. Just as Purple is a product of red and blue light, a pixel may contain grass, soil, and water. Unmixing "unpacks" the signal to find the proportional reality beneath.

graph LR
    A["Raw Data (DN)"] --> B["Radiometric Calibration"]
    B --> C["Atmospheric Correction"]
    C --> D["Surface Reflectance"]
    D --> E["Feature Engineering"]
    E --> F["ML Classifier"]
    F --> G["Thematic Map"]
                            

Strategic Application of Contrast Stretches

Contrast stretching rescales input data (often floating-point or 16-bit) into the 8-bit (0-255) integer range required for visualization. Each method has a unique strategic application:

Stretch Type Mechanism Strategic Application
Linear (1/2/5%) Clips a specified % of histogram tails and stretches the remainder Standard for general visualization; avoids outliers like clouds
Equalization Scales the input distribution to use the full 0-255 range equally Best for highlighting subtle variations in high-density data regions
Gaussian Fits data to a normal distribution based on standard deviation Enhances features in the middle of the spectral distribution
Square Root Takes the square root of the input histogram before linear stretch Compresses highs to reveal details in darker shadow regions
Logarithmic Non-linear enhancement of low-range brightness values Essential for dark features or low-reflectance targets
Optimized Linear Dynamic range adjustment optimized for 16-bit integer data Maximizes information in midtones, highlights, and shadows
πŸ”¬

"So What?" Layer: Statistical Feature Isolation. The distinction between "Stretch on View Extent" and "Full Extent" is a critical laboratory skill. In the SEcoast.dat image, a significant area of negative values near the bottom can skew global statistics, washing out terrestrial features. By utilizing "Stretch on View Extent," we recalculate statistics based only on visible pixels, effectively isolating land features from the high-variance ocean background.

Contrast stretch algorithms comparison table with visual profiles showing Linear, Equalization, Gaussian, and Logarithmic stretches
VISUAL Contrast stretching forces invisible data distributions into visible, dynamic ranges. Each algorithm produces a distinct visual profile suited to different analytical goals.
The Digital Prism: Color slices converting floating-point SST values into discrete visual categories with histogram analysis
SST The Digital Prism: Color slices convert abstract floating-point Sea Surface Temperature values into sharply delineated visual categories. Note the floating-point overlap rule for shared boundary values.
Section 3

Classification Frameworks

The transition from spectral signatures to thematic knowledge requires a choice between Unsupervised (ISODATA/K-means) and Supervised classification. This choice is governed by the availability of "ground truth" (high-quality labeled training data) and the required degree of human intervention.

Challenges in Hyperspectral Classification

Statistical classification of hyperspectral data is complex due to high dimensionality and limited ground truth.

  • Dimensionality & Sample Size: High-dimensional data requires massive training sets. Small training sets cause standard parameter estimates to become unstable.
  • The Hughes Phenomenon: As dimensionality increases, if the sample size is low, the model overfits to noise rather than finding the true probability distribution.
3D scatter plot showing the Curse of Dimensionality where n is much less than p
DIMENSIONALITY The Curse of Dimensionality: When sensor resolution outpaces your ground truth samples, standard algorithms collapse under the mathematical weight of the noise.

πŸ€– Unsupervised Classification

Algorithms cluster pixels based on inherent spectral similarities without prior "ground truth". The "Big Two" are:

  • K-Means: Fast and simple. You specify a fixed number of clusters, and it assigns pixels based on simple spectral distance.
  • ISODATA: Iterative Self-Organizing Data Analysis Technique. A "smarter" algorithm where you specify a range of clusters; it dynamically splits clusters with high variance and merges clusters that are statistically similar.

Advanced Strategy (Pseudo-Labeling): Sometimes, unsupervised outputs are noisy. You can use K-Means clusters as "labels" to train a Random Forest, which "cleans up" the classification by finding generalized patterns.

🎯 Supervised Classification

Involves training an algorithm with expert-defined ground truth (ROIs). The Art of Training: Proximity matters. You cannot cluster all training points in one spot; you must spread samples across the spectral and spatial extent of the scene to capture variance. The reliability of the output is a direct function of the cleanliness and representativeness of your training data.

🧠 Human Agency vs. The Technological Ceiling

While supervised models learn from us, humans have a unique history of breaking "fixed" environmental rules:

  • Beyond Malthusian Limits: Historically, theorists like Thomas Malthus attempted to calculate physical "ceilings" for population based on land. However, through the Green Revolution and intensive engineering (irrigation, genetic engineering), humans consistently overcome these environmental boundaries.
  • The Policy Duality: In SpaceApps, we recognize that while RS can map a physical limit, human policy and ingenuity are what determine if that limit becomes a collapse or a transition.
πŸ‘Ά

The Cognitive Metaphor: Classification is the most natural human approach. Just as a baby sees the world and groups objects into "goes in mouth" or "doesn't go in mouth" long before learning the word "food," unsupervised algorithms find patterns first; we provide the labels (the words) later.

The Binary Hierarchical Classifier (BHC)

As detailed in the Ham et al. (2005) research, the BHC utilizes a "divide-and-conquer" logic to manage large output spaces in hyperspectral environments:

AI conceptual visualization of the Binary Hierarchical Classifier splitting a complex dataset into meta-classes
AI CONCEPT BHC isolates logic by continuously dividing complex problems into two simple, distinct meta-classes.
graph TD
    A["All Classes"] --> B{"BHC Root Split"}
    B -->|"Meta-Class A"| C["Water + Wetlands"]
    B -->|"Meta-Class B"| D["Land Covers"]
    C --> E{"Sub-Split"}
    E --> F["Open Water"]
    E --> G["Marsh / Swamp"]
    D --> H{"Sub-Split"}
    H --> I["Forest Types"]
    H --> J["Urban / Bare Soil"]
                            
  • Recursive Decomposition: Breaks a multi-class problem into a tree of simpler two-class (meta-class) problems.
  • Class Affinity: The root classifier partitions classes based on natural spectral similarity, allowing easier discriminations (e.g., Water vs. Land) at the top of the hierarchy.
  • Simulated Annealing: Employs the Generalized Framework for Associative Modular Learning Systems (GAMLS) to determine optimal splits.
Decision logic comparison: CART partitioning input space vs BHC decomposing output space
LOGIC CART (left) splits the input space using binary tests on single features, causing size explosions with many classes. BHC (right) decomposes the output space into natural hierarchies based on Maximum Fisher Distance.
⚠️

"So What?" Layer: The Curse of Dimensionality. In hyperspectral environments like the Kennedy Space Center (NASA AVIRIS) or the Okavango Delta (Hyperion), when the number of spectral bands is high and training samples are small, traditional classifiers become unstable. Frameworks like BHC reduce this redundancy while maintaining discriminative power.

Section 4

Change Detection & Temporal Dynamics

Temporal analysis allows us to distinguish between human-driven conversion and natural phenology. This is particularly salient in the distal portion of the Okavango Delta, where seasonal flooding (hydroperiods) dictates vegetation response.

Critical Distinction: Phenology vs. Conversion

πŸ”„ Phenological Cycles (Seasonal)

Predictable variations in spectral response, such as "Hippo Grass" (Class 2) which is seasonally inundated. These represent fluctuating states of the same land cover type.

πŸ”€ Permanent Conversion

A fundamental state shift, such as conversion of "Acacia Woodlands" into settlements. Represents an irreversible change.

πŸ›οΈ

Determinism vs. Governance: Jared Diamond's "Collapse" narrative argues that environmental shifts determine civilizational failure (e.g., Syria's drought). However, a critical distinction in Land Change Science and the study of Coupled Human-Natural Systems is required: while satellite imagery may confirm a drought, drawing a line to causation requires careful interpretation.

πŸ“Š RS Science & Policy: Multi-temporal Remote Sensing (e.g., NASA MODIS/Landsat) acts as the Common Operating Picture for policy-makers. By establishing an objective biophysical baseline, RS forces a debate on whether a crisis is a result of natural "triggers" or systemic "governance failure," moving policy from reactive relief to proactive institutional reform.

AI split comparison rendering showing a seasonal loop vs permanent urbanization change
AI CONCEPT Understanding the difference between a natural Phenological Cycle (left) and an Anthropogenic Permanent Conversion (right).
graph LR
    A["Time 1 Image"] --> C["Image Differencing"]
    B["Time 2 Image"] --> C
    C --> D{"Threshold Analysis"}
    D -->|"Within range"| E["No Change"]
    D -->|"Exceeds range"| F["Change Detected"]
    F --> G{"Temporal Pattern"}
    G -->|"Cyclic"| H["Phenology"]
    G -->|"Permanent"| I["Land Cover Conversion"]
                            

Algebraic vs. Machine Learning Strategies

Monitoring environmental shifts requires robust statistical models over raw pixel subtraction.

  • Algebraic Methods (Image Differencing): Employs a corner method to establish empirical change thresholds between multi-temporal images. Fast, but prone to error in complex scenes.
  • Machine Learning (Post-Classification): Utilizes a decision tree induction algorithm and a post-classification separability matrix. Experimental evaluation on Landsat data proves this ML approach delivers superior accuracy for tracking human impacts and land cover permanent changes.
🌍

Okavango Case Study: The Okavango Delta's annual flood pulse creates a dynamic landscape. Using multi-temporal EVI composites (derived from EO-1 Hyperion), we can distinguish between vegetation zones that respond to seasonal inundation versus those that have permanently shifted due to hydrological or anthropogenic changes.

Map of the Okavango Delta showing permanent, seasonal, and occasional swamps
Satellite imagery classification and inundation frequency map of the Okavango Delta

Source: ScienceDirect - Investigating spatial and temporal dynamics of the Okavango Delta

Section 5

Advanced AI & ML: The Random Forest Framework

To understand a Random Forest, we must first understand its building block: CART (Classification and Regression Trees). A CART is a non-parametric decision tree that handles complex, non-linear satellite data by asking a series of "Yes/No" questions (e.g., "Is the Short-Wave Infrared value very low?"). Unlike older models, CART does not care if your data is "normally distributed."

However, a single CART is prone to Overfittingβ€”it can memorize the noise in specific imagery rather than learning general landscape patterns. The Random Forest (RF) solves this by acting as a "committee of experts." By combining multiple decision trees, RF achieves stability that single-tree models cannot match.

The RF Mechanism

The framework enhances classification through several key mechanisms:

  • Ensemble Voting: Hundreds of individual trees independently classify a pixel; the final output is the category selected by the majority block.
  • Robustness to Noise (Bagging): Bootstrap Aggregating. Every tree is trained on a random subset of the data, ensuring no single anomaly dictates the outcome.
  • Feature Randomness: At every node split, the algorithm randomly selects a subset of features (bands) rather than evaluating all of them. This ensures the trees are "de-correlated" and don't all make the identical mistakes.
  • Handling Imperfect Data: Readily manages missing values via surrogate splits and excels with small or imbalanced ground-truth datasets.
  • Feature Importance Assessment: Naturally calculates "variable importance," allowing analysts to rank which specific spectral bands are most critical for class separation.
RF-BHC Architecture Diagram showing Bagging, Subspace Injection, Tree Construction and the Forest Voting
ARCHITECTURE The RF-BHC Architecture: Combining the natural class affinity of the BHC with the randomized ensemble logic of the forest.
Inside the Node Split showing adaptive feature selection via Fisher Discriminant on an m subset
THE SPLIT Inside the Node: An adaptive, random selection of m features prevents memorization of the training data.
graph TD
    S["Full Training Dataset"] --> B1["Bootstrap Sample 1"]
    S --> B2["Bootstrap Sample 2"]
    S --> Bn["Bootstrap Sample N"]
    B1 --> T1["🌲 Tree 1 - Random Subset of Bands"]
    B2 --> T2["🌲 Tree 2 - Random Subset of Bands"]
    Bn --> Tn["🌲 Tree N - Random Subset of Bands"]
    T1 --> V{"πŸ—³οΈ Majority Vote"}
    T2 --> V
    Tn --> V
    V --> F["Final Classification Map"]
                            

Comparative Analysis: RF-BHC vs. RF-CART

Based on research conducted at the University of Texas Center for Space Research:

Method Diversity (Entropy) Accuracy (Small Samples) CPU Time
RF-BHC Lower (Class Affinity) Higher; exploits spectral groupings 1h 4m 4s (Simulated Annealing)
RF-CART Higher (Diverse Trees) Lower; depends on training statistics 8m 42s (Standard CART)
Performance charts showing RF-BHC maintaining high overall accuracy under low sampling rates compared to RF-CART
PERFORMANCE Performance under pressure: RF-BHC maintains unyielding stability at critically low sample sizes, outperforming standard RF-CART configurations.
🧠

"So What?" Layer: Stability and Generalization. The "Majority Vote" is the secret to RF's success. By averaging many "weak" learners, the ensemble becomes robust against overfitting and outliers. This is especially effective for spatially disjoint areas where class-conditional feature distributions vary.

Visual representation of Random Forest Majority Voting
AI CONCEPT Ensemble Intelligence: Merging hundreds of diverse decision paths into a stable, high-accuracy classification map.

Advanced Hybrids: RF + Convolutional Neural Networks (CNN)

Traditional classifiers like Random Forest often treat a satellite image like a "bag of pixels", ignoring surrounding neighbors. CNNs (Convolutional Neural Networks) work like the human eye. They scan the image using "Convolution" to detect low-level features (edges), mid-level features (shapes), and high-level features (objects like a pivot irrigation field or ship). They are the gold standard for object detection and semantic segmentation.

☁️

CNN vs. CCN Clarity: In Geography, do not confuse CNN (a deep learning machine logic) with CCN (Cloud Condensation Nuclei). CCN are physical aerosols (dust, sea salt) crucial for atmospheric correction and climate modeling that provide the seed surface for water vapor condensation, leading to the Twomey Effect.

Modern spatial data science bridges RF simplicity and CNN spatial awareness using deep learning hybrids:

  • Phase 1 (Pixel-Wise RF): RF handles noisy high-dimensional data, extracting spectral features to generate 2D probability maps identifying likely targets (e.g., distinguishing oil from water). Predictions may be spatially fragmented.
  • Phase 2 (CNN Spatial Refinement): Instead of feeding massive raw 3D data into a CNN, the CNN takes the RF's 2D probability maps as inputs. Using an Encoder-Decoder architecture, the encoder max-pools to find broad context, and the decoder restores spatial resolution.
  • The Result: The hybrid bridges pixel-level probabilistic logic with intelligent spatial feature learning, yielding perfectly contextualized boundaries.
AI schematic of a Random Forest feeding into a CNN neural network
AI CONCEPT The Encoder-Decoder hybrid network pipeline refining RF probability maps.
Section 5.5

Ecological Niche Mapping & SDM

One of the most powerful applications of Remote Sensing is **Species Distribution Modeling (SDM)**. In these models, we aren't necessarily "seeing" the animals or individual plants. Instead, we map their presence by deriving their environmental characteristics entirely from the sensor data.

🐾 Environment as Proxy

We don't know the animal's internal characteristics; we use RS to verify if the Ecological Niche (habitat) is present.

πŸ“‚ Policy Value: RS-derived species probability maps allow governments to delineate Protected Area boundaries and manage invasive species corridors without requiring expensive, localized ground surveys across entire continents.

πŸ‘€ Humans: Niche Creators

Humans use technology to modify the environment (irrigation/HVAC).

πŸ—ΊοΈ RS Science: By mapping Nighttime Lights or Urban Heat Islands with thermal sensors, RS provides the scientific evidence of "human niche creation," allowing urban planners to adjust cooling policies for extreme heat resilience.

Technical Integration: MaxEnt & Machine Learning

Ecological Niche Modeling often uses algorithms like MaxEnt or Random Forest to correlate ground-based occurrence points with continuous Remote Sensing datasets (Predictor Layers). For deeper reading, consult Turner et al. (2003) on Remote Sensing for Modeling Species Distributions.

Section 6

Global Research Network & Applied Excellence

Remote Sensing is a collaborative endeavor integrating climate dynamics, social science, and GeoAI to solve sustainability challenges. Here are key individuals shaping the field:

Faculty Spotlight

ISU Research Highlights

Norbert Muzila
Norbert Muzila
Space Systems Engineer | ISU MSS Alumnus

Norbert is a data science professional developing intelligent space-enabled solutions. In his ISU research, he integrated systems engineering, AI, and GeoAI sensing to advance environmental resilience. He conducted a seminal 25-year EVI time series analysis (2000–2025) of the Okavango Megafan using Google Earth Engine, proving the flood-driven resilience of wetland cores versus climate-sensitive dryland margins.

Explore Megafans Research
Emanuele Gamba
Emanuele Gamba
ISU Researcher & Alumnus

Specialized in automated feature extraction within complex imagery environments. His graduate work and research proposals tackle spatial analysis challenges, mapping rapid environmental changes with AI.

Section 7

Professional Cartographic Communication

A sophisticated model is undermined by poor communication. Publishable maps must translate complex data into accessible knowledge for decision-makers.

Comparison between raw pixel array layout and publishable cartography layout
CONTEXT To be understood, raw data must be enclosed in an architecture of human context. A perfect classification is useless if the composition fails to communicate it.

The Seven Essential Map Elements

πŸ“
Title

Data, location, date

πŸ”²
Neatline

Clean map frame

🧭
North Arrow

Orientation reference

πŸ—οΈ
Legend

All symbology keys

πŸ—ΊοΈ
Inset Map

Geographic context

πŸ“Š
Projection & Source

Author, date, sensor

πŸ“
Scale Bar

Always in Kilometers

Anatomy of a Publishable Spatial Document mapping the 7 map elements
LAYOUT The Anatomy of a Spatial Document: Strategic placement of Title, Neatline, North Arrow, Legend, Inset Map, Source, and Scale Bar.

Best Practices: 3D Surface Visualization

To model topographic complexity (e.g., the Boulder, CO dataset), we combine spectral imagery with a Digital Elevation Model (DEM).

3D DEM modeling layering 30m Digital Elevation Model under Landsat TM composite with Vertical Exaggeration
3D MODELING By correlating single-band elevation arrays with multi-band reflectance images, we break the flat-map paradigm.
ParameterRecommended ValueRationale
DEM ResolutionFull or 64Maximum topographic detail
Vertical ExaggerationFactor of 3.0Multiplies elevation values by a defined factor to make topographic relief distinct and obvious.
Elevation ThresholdMin Plot Value: 1500mFocus on relevant terrain features

Checklist for Publishable Maps

  • ☐ Ensure all units are in Kilometers.
  • ☐ Apply Graticules (Lat/Long grids) for professional reference.
  • ☐ Verify legible text annotations for major features (e.g., "Gulf Stream").
  • ☐ Metadata Citation: Clearly cite data sources as "NASA AVIRIS" or "EO-1 Hyperion."
  • ☐ Export final compositions as high-resolution TIFF or JPEG for publication.
βš–οΈ

Regional Decisions: The Cartographer's Dilemma

Scenario: You are mapping permanent land cover conversion in the Okavango for policymakers managing water rights. Your standard linear stretch makes seasonal swamps look identical to permanent urbanization due to overlapping spectral responses.

Decision: Do you present a raw classification table that is accurate but unreadable, or do you apply an aggressive Gaussian stretch and 3D terrain exaggeration that makes the map beautiful but potentially exaggerates the scale of water loss? How do you balance statistical integrity with compelling political communication?

Role-Playing Exercise

Regional Decisions: The Delta Governance Dispute

πŸ‘€

Scenario: It is 2026. The Okavango Delta is experiencing its most severe hydrological drought in 30 years. Your classification maps (derived from the afternoon ML workflows) show a 40% reduction in seasonal "Hippo Grass" inundation, but a stable permanent wetland core.

Position A: The Agricultural Minister

"Our people need food security. If the water is 'stable' in the core, we should divert just 5% of the upstream flow to irrigate new corn fields to prevent famine."

Position B: The Conservation Lead

"The 'hippo grass' loss shows the system is at a tipping point. Diverting water now will collapse the seasonal pulse, permanently converting the delta into a dust bowl."

Analytical Task for Students

Using the Random Forest Variable Importance metrics and the Change Detection separability matrix we built today, you must provide a 1-page "Decision Memo" to the presidency.

  • Does your data prove permanent conversion or merely seasonal phenology?
  • Is the biophysical stable core resilient enough to survive a 5% diversion?
  • What is the 'Social Cost' of a misclassification in this scenario?

πŸŽ“ Knowledge Check

Test your understanding of today's lecture material.

1. Why is it critical to perform radiometric correction BEFORE applying a Random Forest classifier?

Because Random Forest cannot process raw digital numbers.
Because observed spectral variations must represent land-cover reality, not sensor noise or atmospheric artifacts.
Because radiometric correction reduces the number of bands, solving the curse of dimensionality.

2. What is the key advantage of the BHC framework over traditional multi-class classifiers for hyperspectral data?

It uses more training samples than traditional methods.
It processes data faster using parallel computing.
It decomposes a complex multi-class problem into simpler two-class problems using natural spectral affinity.

3. How does Random Forest achieve stability against overfitting?

By averaging predictions from many diverse, de-correlated decision trees through majority voting.
By using a single, very deep decision tree with pruning.
By reducing the training data to only the most relevant samples.

4. In change detection, what distinguishes a phenological cycle from permanent land cover conversion?

Phenological changes only occur in tropical regions.
Phenological cycles are predictable, seasonal variations in spectral response; conversion is a permanent, irreversible state shift.
Conversion is always human-caused while phenology is always natural.
Review

Summary of Big Ideas

Synthesis flow showing Input, Classification and Output mapping
SYNTHESIS True spatial data science does not end with a high accuracy metric. It ends when complex systems are translated into universally understood visual truths.
  • βœ“ Signal Integrity is Non-Negotiable: Raw data must be corrected radiometrically and geometrically before any quantitative analysis. Garbage in, garbage out.
  • βœ“ Classification is a Spectrum: From simple K-means to sophisticated BHC hierarchies, the choice depends on data complexity and available ground truth.
  • βœ“ Temporal Context Matters: Distinguishing phenology from conversion prevents misclassification in dynamic environments like the Okavango Delta.
  • βœ“ Ensemble Methods Win: Random Forest's "majority vote" provides stability that single-tree classifiers cannot, especially with limited training data.
  • βœ“ Communication is Science: A publishable map with all seven essential elements is the final, critical step in the scientific workflow.