AI for Geotechnical Engineering & Foundation Design
Soil Classification from SPT Data, Foundation Selection, Slope Stability, and Indian Soil Challenges
AI-Enhanced Soil Classification from SPT and Lab Data
Standard Penetration Test data is the backbone of Indian geotechnical investigation — IS 2131 governs the procedure, and nearly every foundation design in India starts with an SPT borehole log. The interpretation, however, involves significant engineering judgment. Two geotechnical engineers looking at the same SPT log often disagree on soil layer boundaries, classification, and design parameters. AI formalizes the interpretation process while learning from thousands of historical boreholes.
Open data/soil-investigation-data.csv in the code panel. Each row is an SPT record: borehole_id, project_id, location_lat, location_lng, depth_m, spt_n_value, spt_n_corrected, soil_description_field, soil_classification_is (IS 1498), liquid_limit, plastic_limit, grain_size_gravel_pct, grain_size_sand_pct, grain_size_fines_pct, moisture_content_pct, bulk_density_kn_m3, ucs_kpa (if rock/stiff clay), geological_formation.
From SPT N-values to Soil Properties
The classical correlations (Terzaghi-Peck for sands, Stroud for clays) are first-order approximations. They were developed on temperate-region soils and systematically over- or under-predict for Indian conditions:
| Soil Type | Classical Correlation Issue in India |
|---|---|
| Black cotton soil (BC soil) | SPT N = 2-5 but swelling pressure is the real design driver, not bearing capacity |
| Laterite | High N-values (20-40) but softens dramatically on saturation — N-value misleading |
| Mumbai marine clay | Very low N (0-2) with high sensitivity — remoulded strength is 20-30% of undisturbed |
| Delhi alluvium (Yamuna flood plain) | SPT refusal in kankar layers interbedded with loose sand — erratic N-profile |
| Deccan trap basalt (weathered) | Highly variable weathering — N ranges from 5 (completely weathered) to refusal in 1m vertical distance |
A Random Forest classifier trained on 15,000+ SPT records from 800 projects across India classifies soil type (IS 1498 groups: GW/GP/GM/GC/SW/SP/SM/SC/ML/CL/MH/CH) with 89% accuracy — compared to 72% for rule-based classification using only N-value and depth. The improvement comes from incorporating regional geological context (geological_formation as a feature) and grain size distribution when available.
Automated Borehole Log Interpretation
The more valuable application: automated identification of soil layer boundaries and generation of idealized soil profiles. A 1D convolutional neural network (1D-CNN) processes the N-value profile (N vs depth) along with available lab data and identifies layer transitions. On a test set of 200 boreholes where experienced geotechnical engineers manually defined layer boundaries, the CNN agreed with the engineer's interpretation 84% of the time — and on the 16% disagreement cases, the CNN's layering was judged "equally valid" by a third engineer in 60% of cases.
Foundation Type Selection: Correlating Soil Profile with Structural Loads
Foundation selection in India follows a well-established decision tree: isolated footings for light loads on competent soil, raft for heavy loads or variable soil, piles when bearing stratum is deep. But the "right" choice depends on the interaction between soil profile, structural loads, differential settlement tolerance, and site-specific constraints (water table, adjacent structures, construction access).
Open data/foundation-design-data.json — it contains: project_id, structure_type, column_loads_kn (array), soil_profile (array of layers with classification, depth, properties), water_table_m, selected_foundation_type, design_bearing_capacity_kpa, estimated_settlement_mm, pile_type (if applicable), pile_length_m, pile_diameter_mm.
Decision Support Model
A classification model (XGBoost) trained on 2,000+ foundation design records predicts the optimal foundation type:
Input features:
max_column_load_kn
load_variability (max/min column load ratio)
bearing_stratum_depth_m (depth to first layer with N > 15 for clay, N > 30 for sand)
soil_variability_index (coefficient of variation of N-values in top 10m)
water_table_depth_m
structure_type (residential/commercial/industrial/bridge)
differential_settlement_limit_mm
site_access_constraints (boolean flags: piling_rig_access, dewatering_feasible)
seismic_zone (IS 1893)
Output classes:
isolated_footing, combined_footing, raft, bored_cast_in_situ_pile,
driven_precast_pile, micro_pile, well_foundationThe model achieves 86% agreement with experienced foundation designers. The 14% disagreement is concentrated in borderline cases (raft vs pile-raft, isolated vs combined) where either choice is defensible. The model's value is not replacing judgment — it is flagging cases where the chosen foundation type is unusual for the given soil-load combination, prompting a second review.
Mumbai Marine Clay: A Special Case
For projects on Mumbai's western coastline (Bandra to Dahisar, Navi Mumbai), the model has a dedicated sub-model. The marine clay (CH classification, liquid limit 80-120%, sensitivity 4-8) extends to 15-25m depth. The sub-model learns that:
Slope Stability Analysis with Seismic Loading
Slope failures in India cause significant casualties — Uttarakhand, Himachal, Western Ghats, and Northeast India are high-risk zones. IS 1893 seismic zonation (Zones II-V) determines the pseudo-static seismic coefficient for slope stability analysis. But pseudo-static analysis with a single seismic coefficient is conservative for some slopes and unconservative for others — depending on slope height, soil type, and ground motion characteristics.
Open data/slope-stability-analysis.json — it contains: slope_id, location, slope_height_m, slope_angle_deg, soil_layers (with c, phi, gamma for each), water_table_condition, seismic_zone, failure_mode_predicted, fos_static, fos_pseudo_static, fos_newmark_displacement_mm, actual_failure_observed.
ML-Enhanced Slope Stability Screening
For regional slope hazard assessment (highway alignment selection, township planning), running detailed FOS analysis on every slope is impractical. A screening model trained on 5,000+ slope stability analyses (Fellenius, Bishop, Morgenstern-Price) plus observed failure/no-failure data:
| Feature | SHAP Importance |
|---|---|
| slope_angle_deg / friction_angle_deg ratio | 0.24 |
| **saturation_condition** (dry/partial/full) | 0.19 |
| seismic_zone | 0.14 |
| slope_height_m | 0.12 |
| cohesion_kpa | 0.11 |
| geological_formation | 0.10 |
| **rainfall_intensity_mm_hr** (design storm) | 0.06 |
| vegetation_cover | 0.04 |
The screening model classifies slopes as stable (FOS > 1.5), marginal (1.0-1.5), or unstable (< 1.0) with 91% accuracy. For highway projects in the Western Ghats (NH-66 Goa section, NH-48 Charmadi Ghat), the screening reduces detailed analysis requirements by 60% — focusing geotechnical effort on the 40% of slopes that are marginal or unstable.
Rainfall-Triggered Failure Prediction
Western Ghats slopes fail primarily during intense monsoon rainfall — not during earthquakes. The failure mechanism: infiltration raises the water table, reducing effective stress and hence shear strength. A threshold model:
Antecedent rainfall threshold:
3-day cumulative rainfall > 200mm AND
Daily intensity > 100mm/day AND
Antecedent moisture index (30-day weighted rainfall) > 300mm
Combined with slope susceptibility (from screening model):
High susceptibility + threshold exceeded → 78% probability of failure
Medium susceptibility + threshold exceeded → 23% probability
Low susceptibility + threshold exceeded → 4% probabilityThis is operationalized for NH/SH maintenance: when IMD nowcast exceeds the threshold for a highway section with known high-susceptibility slopes, traffic advisories and preventive diversions can be triggered.
Indian Soil Challenges: Regional Considerations
Black Cotton Soil (Expansive Clays)
Found across the Deccan Plateau (Maharashtra, Karnataka, MP, parts of AP/Telangana). Free swell index 50-150%. The AI model learns that for BC soil, the design is governed by swell pressure (50-300 kPa) rather than bearing capacity. Under-reamed piles (IS 2911 Part 3) are the standard foundation solution — the model recommends pile length based on depth to non-expansive stratum (typically 3-5m).
Laterite Soils
Bangalore, Mangalore, and Kerala laterites show high in-situ N-values but degrade on saturation. The model incorporates a "saturation sensitivity factor" — the ratio of saturated to natural moisture content strength — as a critical feature. For Bangalore laterite, standard SBC from N-value correlations overpredicts by 25-35% compared to plate load test results.
Alluvial Deposits (Gangetic and Yamuna Plains)
Delhi NCR, UP, Bihar — deep alluvial deposits with interbedded sand and clay layers, occasional kankar (calcium carbonate nodite) beds. The AI model identifies liquefiable layers (sand with N < 15, water table within 3m, seismic zone III-V) using the IS 1893 Part 1 simplified procedure — and flags projects where liquefaction assessment per IS 1893 is mandatory but has been skipped in the geotechnical report.
Key Takeaways
This is chapter 3 of AI for Civil & Infrastructure.
Get the full hands-on course — free during early access. Build the complete system. Your projects become your portfolio.
View course details