Back to guides
3
8 min

AI for Geotechnical Engineering & Foundation Design

Soil Classification from SPT Data, Foundation Selection, Slope Stability, and Indian Soil Challenges

AI-Enhanced Soil Classification from SPT and Lab Data

Standard Penetration Test data is the backbone of Indian geotechnical investigation — IS 2131 governs the procedure, and nearly every foundation design in India starts with an SPT borehole log. The interpretation, however, involves significant engineering judgment. Two geotechnical engineers looking at the same SPT log often disagree on soil layer boundaries, classification, and design parameters. AI formalizes the interpretation process while learning from thousands of historical boreholes.

Open data/soil-investigation-data.csv in the code panel. Each row is an SPT record: borehole_id, project_id, location_lat, location_lng, depth_m, spt_n_value, spt_n_corrected, soil_description_field, soil_classification_is (IS 1498), liquid_limit, plastic_limit, grain_size_gravel_pct, grain_size_sand_pct, grain_size_fines_pct, moisture_content_pct, bulk_density_kn_m3, ucs_kpa (if rock/stiff clay), geological_formation.

From SPT N-values to Soil Properties

The classical correlations (Terzaghi-Peck for sands, Stroud for clays) are first-order approximations. They were developed on temperate-region soils and systematically over- or under-predict for Indian conditions:

Soil TypeClassical Correlation Issue in India
Black cotton soil (BC soil)SPT N = 2-5 but swelling pressure is the real design driver, not bearing capacity
LateriteHigh N-values (20-40) but softens dramatically on saturation — N-value misleading
Mumbai marine clayVery low N (0-2) with high sensitivity — remoulded strength is 20-30% of undisturbed
Delhi alluvium (Yamuna flood plain)SPT refusal in kankar layers interbedded with loose sand — erratic N-profile
Deccan trap basalt (weathered)Highly variable weathering — N ranges from 5 (completely weathered) to refusal in 1m vertical distance

A Random Forest classifier trained on 15,000+ SPT records from 800 projects across India classifies soil type (IS 1498 groups: GW/GP/GM/GC/SW/SP/SM/SC/ML/CL/MH/CH) with 89% accuracy — compared to 72% for rule-based classification using only N-value and depth. The improvement comes from incorporating regional geological context (geological_formation as a feature) and grain size distribution when available.

Automated Borehole Log Interpretation

The more valuable application: automated identification of soil layer boundaries and generation of idealized soil profiles. A 1D convolutional neural network (1D-CNN) processes the N-value profile (N vs depth) along with available lab data and identifies layer transitions. On a test set of 200 boreholes where experienced geotechnical engineers manually defined layer boundaries, the CNN agreed with the engineer's interpretation 84% of the time — and on the 16% disagreement cases, the CNN's layering was judged "equally valid" by a third engineer in 60% of cases.

Foundation Type Selection: Correlating Soil Profile with Structural Loads

Foundation selection in India follows a well-established decision tree: isolated footings for light loads on competent soil, raft for heavy loads or variable soil, piles when bearing stratum is deep. But the "right" choice depends on the interaction between soil profile, structural loads, differential settlement tolerance, and site-specific constraints (water table, adjacent structures, construction access).

Open data/foundation-design-data.json — it contains: project_id, structure_type, column_loads_kn (array), soil_profile (array of layers with classification, depth, properties), water_table_m, selected_foundation_type, design_bearing_capacity_kpa, estimated_settlement_mm, pile_type (if applicable), pile_length_m, pile_diameter_mm.

Decision Support Model

A classification model (XGBoost) trained on 2,000+ foundation design records predicts the optimal foundation type:

Input features:
  max_column_load_kn
  load_variability (max/min column load ratio)
  bearing_stratum_depth_m (depth to first layer with N > 15 for clay, N > 30 for sand)
  soil_variability_index (coefficient of variation of N-values in top 10m)
  water_table_depth_m
  structure_type (residential/commercial/industrial/bridge)
  differential_settlement_limit_mm
  site_access_constraints (boolean flags: piling_rig_access, dewatering_feasible)
  seismic_zone (IS 1893)

Output classes:
  isolated_footing, combined_footing, raft, bored_cast_in_situ_pile,
  driven_precast_pile, micro_pile, well_foundation

The model achieves 86% agreement with experienced foundation designers. The 14% disagreement is concentrated in borderline cases (raft vs pile-raft, isolated vs combined) where either choice is defensible. The model's value is not replacing judgment — it is flagging cases where the chosen foundation type is unusual for the given soil-load combination, prompting a second review.

Mumbai Marine Clay: A Special Case

For projects on Mumbai's western coastline (Bandra to Dahisar, Navi Mumbai), the model has a dedicated sub-model. The marine clay (CH classification, liquid limit 80-120%, sensitivity 4-8) extends to 15-25m depth. The sub-model learns that:

  • Isolated footings fail differential settlement checks even for 2-storey structures
  • Raft foundations require pre-loading with surcharge for 6-12 months (consolidation settlement of 200-400mm)
  • Bored piles to the underlying weathered basalt at 20-30m depth are the default choice for structures above 5 storeys
  • SBC calculation per IS 6403 must use consolidated undrained (CU) parameters, not UU — using UU overestimates SBC by 30-40%
  • Slope Stability Analysis with Seismic Loading

    Slope failures in India cause significant casualties — Uttarakhand, Himachal, Western Ghats, and Northeast India are high-risk zones. IS 1893 seismic zonation (Zones II-V) determines the pseudo-static seismic coefficient for slope stability analysis. But pseudo-static analysis with a single seismic coefficient is conservative for some slopes and unconservative for others — depending on slope height, soil type, and ground motion characteristics.

    Open data/slope-stability-analysis.json — it contains: slope_id, location, slope_height_m, slope_angle_deg, soil_layers (with c, phi, gamma for each), water_table_condition, seismic_zone, failure_mode_predicted, fos_static, fos_pseudo_static, fos_newmark_displacement_mm, actual_failure_observed.

    ML-Enhanced Slope Stability Screening

    For regional slope hazard assessment (highway alignment selection, township planning), running detailed FOS analysis on every slope is impractical. A screening model trained on 5,000+ slope stability analyses (Fellenius, Bishop, Morgenstern-Price) plus observed failure/no-failure data:

    FeatureSHAP Importance
    slope_angle_deg / friction_angle_deg ratio0.24
    **saturation_condition** (dry/partial/full)0.19
    seismic_zone0.14
    slope_height_m0.12
    cohesion_kpa0.11
    geological_formation0.10
    **rainfall_intensity_mm_hr** (design storm)0.06
    vegetation_cover0.04

    The screening model classifies slopes as stable (FOS > 1.5), marginal (1.0-1.5), or unstable (< 1.0) with 91% accuracy. For highway projects in the Western Ghats (NH-66 Goa section, NH-48 Charmadi Ghat), the screening reduces detailed analysis requirements by 60% — focusing geotechnical effort on the 40% of slopes that are marginal or unstable.

    Rainfall-Triggered Failure Prediction

    Western Ghats slopes fail primarily during intense monsoon rainfall — not during earthquakes. The failure mechanism: infiltration raises the water table, reducing effective stress and hence shear strength. A threshold model:

    Antecedent rainfall threshold:
      3-day cumulative rainfall > 200mm AND
      Daily intensity > 100mm/day AND
      Antecedent moisture index (30-day weighted rainfall) > 300mm
    
    Combined with slope susceptibility (from screening model):
      High susceptibility + threshold exceeded → 78% probability of failure
      Medium susceptibility + threshold exceeded → 23% probability
      Low susceptibility + threshold exceeded → 4% probability

    This is operationalized for NH/SH maintenance: when IMD nowcast exceeds the threshold for a highway section with known high-susceptibility slopes, traffic advisories and preventive diversions can be triggered.

    Indian Soil Challenges: Regional Considerations

    Black Cotton Soil (Expansive Clays)

    Found across the Deccan Plateau (Maharashtra, Karnataka, MP, parts of AP/Telangana). Free swell index 50-150%. The AI model learns that for BC soil, the design is governed by swell pressure (50-300 kPa) rather than bearing capacity. Under-reamed piles (IS 2911 Part 3) are the standard foundation solution — the model recommends pile length based on depth to non-expansive stratum (typically 3-5m).

    Laterite Soils

    Bangalore, Mangalore, and Kerala laterites show high in-situ N-values but degrade on saturation. The model incorporates a "saturation sensitivity factor" — the ratio of saturated to natural moisture content strength — as a critical feature. For Bangalore laterite, standard SBC from N-value correlations overpredicts by 25-35% compared to plate load test results.

    Alluvial Deposits (Gangetic and Yamuna Plains)

    Delhi NCR, UP, Bihar — deep alluvial deposits with interbedded sand and clay layers, occasional kankar (calcium carbonate nodite) beds. The AI model identifies liquefiable layers (sand with N < 15, water table within 3m, seismic zone III-V) using the IS 1893 Part 1 simplified procedure — and flags projects where liquefaction assessment per IS 1893 is mandatory but has been skipped in the geotechnical report.

    Key Takeaways

  • SPT interpretation benefits from regional geological context — Indian soils deviate systematically from textbook correlations. ML models trained on local data outperform global correlations by incorporating geological formation and index properties.
  • Foundation selection AI is a second-opinion tool — it does not replace engineering judgment but flags unusual choices that warrant review. The value is quality assurance, not automation.
  • Slope stability screening at regional scale is the high-leverage application — detailed analysis of every slope is impractical. ML screening focuses geotechnical effort where it matters most.
  • Soil-specific challenges (BC soil, laterite, marine clay) require sub-models — a single national model underperforms regional models that encode local failure mechanisms and design practices.
  • This is chapter 3 of AI for Civil & Infrastructure.

    Get the full hands-on course — free during early access. Build the complete system. Your projects become your portfolio.

    View course details