← Back to Blog

Mapping Soil Carbon from Space: Machine Learning Meets Conservation Agriculture

Accurate soil organic carbon (SOC) monitoring is essential for sustainable agriculture and climate change mitigation, yet traditional laboratory methods remain expensive, time-consuming, and spatially limited. Our recent study published in Smart Agricultural Technology demonstrates how freely available satellite data combined with advanced machine learning can transform SOC assessment—achieving 91% prediction accuracy across two ecologically contrasting sites.

The Challenge: Monitoring Carbon Across Diverse Landscapes

Traditional SOC measurement involves labor-intensive soil sampling and chemical testing that captures only point-based snapshots. For farmers adopting conservation agriculture (CA) practices like no-tillage and mulching, proving carbon sequestration benefits requires scalable, high-resolution monitoring—particularly for accessing emerging carbon markets.

We developed a dual-sensor framework integrating:

  • Sentinel-1 synthetic aperture radar (SAR) for soil moisture and structure
  • Sentinel-2 multispectral imagery (MSI) for vegetation indices
  • XGBoost machine learning with active learning optimization

The framework was tested across contrasting environments:

Niigata, Japan (temperate, sandy soils)

  • 37°51'N, 138°56'E
  • 1,800 mm annual rainfall
  • Sandy loam (80% sand, 15% silt, 5% clay)
  • Low organic matter (2%)

Zio, Togo (tropical, clayey soils)

  • 6°42'N, 1°15'E
  • 1,200–1,500 mm annual rainfall
  • Clay-rich soil with high organic carbon (5%)
  • Tropical agricultural calendar (March–September)

Key Findings: XGBoost Outperforms Traditional Models

Our machine learning comparison revealed clear performance hierarchies:

XGBoost (eXtreme Gradient Boosting)

  • Cross-validation R² = 0.88
  • Test R² = 0.91
  • RMSE = 0.17 t C ha⁻¹
  • Superior handling of nonlinear relationships and heterogeneous data

Random Forest

  • Test R² = 0.87
  • RMSE = 0.27 t C ha⁻¹
  • Competitive performance with higher interpretability

Support Vector Machine

  • Test R² = 0.80
  • RMSE = 0.34 t C ha⁻¹
  • Challenges with large, multivariate datasets

The XGBoost model achieved exceptional accuracy while maintaining generalizability across both temperate and tropical conditions—a critical requirement for operational carbon monitoring systems.

Vegetation Indices as SOC Predictors

Spectral vegetation indices showed strong correlations with SOC content, confirming the dominant role of plant cover in shaping carbon storage:

  • SAVI (Soil-Adjusted Vegetation Index): r = 0.75
  • EVI (Enhanced Vegetation Index): r = 0.72
  • NDVI (Normalized Difference Vegetation Index): r = 0.71
  • NDWI (Normalized Difference Water Index): r = 0.65

These correlations enabled accurate SOC prediction based on vegetation health and biomass inputs—key indicators farmers can influence through management decisions.

Spatial Variability Reveals Management Opportunities

High-resolution SOC maps revealed significant spatial patterns:

Niigata Site

  • SOC range: 1.2 to 3.8 t C ha⁻¹
  • Higher values in densely vegetated zones
  • Sandy soil challenges requiring continuous organic inputs

Zio Site

  • SOC range: 0.9 to 3.2 t C ha⁻¹
  • Clay-rich soils support better carbon stabilization
  • Year-round biomass input under tropical conditions

These maps enable farmers to identify carbon-deficient zones and prioritize where organic amendments or conservation practices will yield the greatest benefits.

Conservation Agriculture Impact: Climate and Soil Texture Matter

Our results confirm that both climatic regime and soil texture jointly control SOC stocks under conservation agriculture:

Sequestration Rates

  • Tropical clayey soils (Togo): 1.83 t C ha⁻¹ yr⁻¹
  • Temperate sandy soils (Niigata): 1.04 t C ha⁻¹ yr⁻¹

Fine-textured clay soils retain organic matter through micro-aggregate protection and reduced decomposition rates. Sandy soils depleted quickly without residue inputs but responded positively to no-tillage combined with mulching.

The key insight: vegetation cover and residue management, rather than climate alone, are the primary levers for building SOC under conservation agriculture.

Methodological Innovation: Dual-Sensor Fusion

While XGBoost is established, its integration with Sentinel-1 SAR + Sentinel-2 MSI feature stacks across ecologically divergent sites is rare. Our workflow delivers several advantages:

Data Economy

  • Free Sentinel-1/2 imagery eliminates expensive UAV flights
  • Requires only 10–20 ground samples per new area
  • Standard CPU processing (no GPU costs)

Active Learning Optimization

  • Binary land-use classification (93.45% accuracy) guides sampling
  • Targeted bare-soil point selection improves SOC estimation
  • Five-fold cross-validation ensures robustness

Geographic Transferability

  • Combined-site model maintains high accuracy across both regions
  • Sensor fusion captures complementary information (SAR structure + MSI spectral data)
  • Addresses limitations of single-sensor workflows

Practical Applications for Climate-Smart Agriculture

1. Carbon Market Verification

Fine-resolution SOC maps enable:

  • Documentation of carbon-credit baselines and additionality
  • Quantification of sequestration gains from CA adoption
  • Monitoring, reporting, and verification (MRV) for carbon markets

2. Site-Specific Management

  • Locate sub-field carbon deficits
  • Prioritize organic amendments and cover crops
  • Optimize residue allocation across spatial gradients

3. Extension Services

  • Evidence-based recommendations for conservation practice adoption
  • Visual communication of carbon benefits to farmers
  • Tracking multi-year SOC trajectories at farm scale

4. Regional Carbon Inventories

  • Scalable framework for national soil carbon accounting
  • Low-cost alternative to traditional laboratory-based surveys
  • Integration with climate mitigation policies

Seasonal Dynamics: Temperate vs. Tropical

Our temporal analysis of five spectral indices revealed distinct patterns:

Niigata (Temperate)

  • Stronger seasonal amplitude (NDVI range ≈ 0.45)
  • Sharp spring rise and post-summer decline
  • Well-defined growing cycle
  • Surface moisture dips during dry periods

Togo (Tropical)

  • Stable high values (NDVI range ≈ 0.35)
  • Continuous greenness year-round
  • Prolonged rainy season maintains vegetation
  • Consistently high water availability

These patterns reflect how climate and management interact to control carbon inputs—critical for adapting SOC strategies to local conditions.

Limitations and Future Directions

Multi-Year Validation

Our framework requires 5–10 year confirmation under multi-year climate variability to verify long-term SOC accumulation rates.

Sampling Depth

The 0–15 cm topsoil focus captures management-driven changes but misses deeper carbon pools. Future surveys should extend to 30–60 cm for whole-profile dynamics.

Temporal Resolution

Cloud interference limits Sentinel-2 revisit frequency. Fusing Landsat, commercial CubeSats, or UAV hyperspectral data could close temporal gaps.

Model Evolution

Hybrid architectures embedding CNN-derived spatial features into gradient-boosted ensembles may lift accuracy 5–10% while retaining interpretability.

Comparative Performance: Literature Context

Our dual-sensor XGBoost pipeline delivers three clear advantages over existing approaches:

Accuracy Across Land-Use Mosaics Most optical-only studies report solid but site-limited performance (R² ≈ 0.82–0.86). Our framework sustained Test R² = 0.91 simultaneously in temperate Japan and tropical Togo, demonstrating stronger ecological robustness.

Data Economy and Cost Studies using UAV multispectral + LiDAR plateau at R² ≈ 0.74 with high logistical costs. Our framework relies solely on free Sentinel imagery and minimal ground samples, cutting financial barriers while exceeding UAV-based accuracies.

Model Simplicity Many studies feed dozens of ancillary layers (soil texture, lithology, socioeconomic data) into models and reach R² ≈ 0.73–0.75. Our active-learning feature selection showed that compact SAR backscatter + key vegetation indices + topographic metrics suffice to exceed 0.9 R², balancing accuracy with transferability.

The Bottom Line for Practitioners

Adopt conservation agriculture for measurable carbon benefits—no-tillage + mulching increased SOC by 1.04–1.83 t C ha⁻¹ yr⁻¹

Use free satellite data for cost-effective monitoring—Sentinel-1/2 imagery provides operational SOC tracking without expensive field campaigns

Leverage machine learning for spatial insights—XGBoost models identify carbon-deficient zones for targeted interventions

Document carbon sequestration for markets—high-accuracy SOC maps support carbon credit verification and additionality claims

Plan for the long term—conservation agriculture outcomes appear after 5–10 years of continuous practice

⚠️ Consider soil texture—clayey soils sequester carbon faster than sandy soils, requiring adapted management strategies

Advancing Global Soil Carbon Monitoring

The Sentinel-1/2 + XGBoost framework offers an economically accessible, technically robust tool for routine SOC surveillance. By proving effectiveness across temperate and tropical conditions, our work provides a replicable model for:

  • National soil carbon inventories
  • Regional carbon accounting frameworks
  • Farm-scale carbon credit verification
  • Climate-smart agriculture extension services

As agriculture faces mounting pressure to mitigate climate change while enhancing productivity, scalable SOC monitoring becomes essential. Remote sensing and machine learning transform carbon measurement from laboratory-bound snapshots into continuous, high-resolution spatial assessments—empowering farmers, policymakers, and researchers to make evidence-based decisions for sustainable land management.


Research Publication: Beisekenov, N., Banakinaou, W., Ajayi, A.D., Hasegawa, H., & Tadao, A. (2025). Remote sensing-based soil organic carbon monitoring using advanced machine learning techniques under conservation agriculture systems. Smart Agricultural Technology, 11, 101036. doi:10.1016/j.atech.2025.101036

Study Sites: Niigata University experimental field, Japan (37°52'N, 138°56'E) and Agbelouve research station, Togo (6°42'N, 1°15'E)

Key Methods: Sentinel-1 SAR + Sentinel-2 MSI fusion; XGBoost, Random Forest, and SVM comparison; five-fold cross-validation; active learning for sampling optimization; variance partitioning analysis

Data Availability: Satellite data freely available through Copernicus Open Access Hub; field measurements provided in supplementary materials

Open Access: This research is published under CC BY license, supporting open science and reproducible research in soil carbon monitoring.

View Full Research Paper
Banakinaou Wiyao

Banakinaou Wiyao

PhD Candidate in Environmental Science & Technology

Researching soil carbon sequestration and conservation agriculture at Niigata University, Japan. Exploring sustainable farming through machine learning and biogeochemical modeling.

Copyright © 2025 Banakinaou Wiyao. All rights reserved.
From soil to table, Rooted in quality, Served with care.