Decision Defense Report

Source: docs/architecture/DECISION_DEFENSE_REPORT_MANUAL.html

Manual Index Client UI

Authorship And Creator Position

Johann F.R Wilhelm is recorded here as the author and creator of the project direction, problem framing, and intellectual build intent.

Assistant contributions in this repository are implementation assistance, documentation assistance, and engineering structure produced inside that user-directed frame.

This report is an engineering and provenance statement for internal and colleague review. It does not replace formal legal registration or legal counsel.

System Boundary

This build should be described as a user-directed proof-of-value / proof-of-concept built on open data.

  • It is not a production deployment.
  • It is not a reserves-booking engine.
  • It is not an integration into proprietary internal software.
  • It is a database-backed normalization and decision-support system built to test whether defendable payload-screening can be achieved from open data alone.

Objective

The user objective was broader than a simple feature request. The build was intended to test whether open subsurface data could be turned into a defendable payload-screening system, how difficult that would be in practice, and what would have to exist before such a system could become part of a production, monetizable application.

The user-directed investigation asked the system to prove several things at once:

  • use open field data,
  • normalize it into a source-of-truth data model,
  • cross-reference wells, logs, production, technical context, WITSML, seismic, and models,
  • produce payload-screening outputs that a technical team can inspect and challenge,
  • show whether the approach can be defended technically rather than only demonstrated visually,
  • surface what is still difficult, partial, or approximation-grade,
  • and reveal what would be required to move from draft development into a product-grade application later.

This means the project should be described as an investigation into what can be used, what is worth canonicalizing, what signals are strong enough for decision support, and what additional governance or engineering would be needed before commercialization.

Several important facts surfaced during this investigation and were then folded back into the draft development project:

  • open data alone is enough to build a serious canonical proof-of-concept if provenance is preserved,
  • CRS and ECEF handling are central, not peripheral, to defensible spatial truth,
  • cross-domain well identity resolution is one of the hardest and most important problems,
  • package-level canonical coverage matters because “present” and “decision-ready” are not the same thing,
  • full artifact representation is necessary even when full semantic decode is not yet complete,
  • ranking and barrel estimates are useful only if their heuristic nature is stated explicitly.

The build therefore optimizes for provenance, reproducibility, explainability, and technical honesty rather than novelty claims.

End-To-End Decision Flow

Open Source Data

Volve packages, reports, logs, models, seismic, WITSML

Validation Gate

Inventory, readiness, parser debt, package coverage

Canonical Load

Typed tables in PostgreSQL across wells, tops, production, technical, seismic, model, artifacts

Derived Interpretation

QC, pay events, bypassed-pay, penetrations, support rows, geometry transforms

Decision Outputs

Reopening ranking and low/mid/high remaining-barrel estimates

Stage 1. Source Evidence Raw bundles staged in MinIO with package-level provenance.
Stage 2. Canonical Truth Facts loaded into typed PostgreSQL tables under explicit schemas.
Stage 3. Derived Logic Deterministic transforms and rule-based heuristics computed from canonical truth.
Stage 4. Human Review Outputs are inspectable, exportable, and challengeable by the team.
Decision Step What Happens Why It Is Defendable
Validation Confirms what is present, what is partial, and what is blocked. Prevents silent assumptions and exposes missing coverage explicitly.
Canonicalization Moves source data into typed domain tables. Separates source-authored data from ad hoc script output.
Linking Connects wells to aliases, logs, WITSML, production, reports, and models. Cross-domain conclusions can be traced back to concrete supporting records.
Derived Interpretation Computes QC, pay intervals, bypassed candidates, support rows, and penetrations. Derived logic is explicit and inspectable rather than hidden.
Ranking Scores wells relative to the loaded portfolio. Outputs are broken into components instead of a single opaque number.
Barrel Estimate Produces low/mid/high screening-grade ranges. Communicates uncertainty honestly rather than claiming reserve-grade certainty.

Why The Database Matters

The database is not just storage. It is the mechanism that makes the result defendable.

Schema Role Decision Value
raw Source bundles and objects Proves what entered the system.
ops Canonical operational truth Provides the direct record base for decisions.
semantic Entity projection Supports graph-style query and future AI use without changing source truth.
audit Traceability and evidence Preserves how each result was created.
core Users, access, tenancy Separates operational governance from domain facts.

The central defense is simple: the DB preserves source truth, derived truth, and decision outputs as separate layers.

CRS And ECEF Source Of Truth

CRS normalization is not decoration in this build. It is part of the truth model.

  • WGS84 coordinates are normalized into stable spatial fields.
  • ECEF coordinates are computed deterministically.
  • Transform chains are preserved so spatial assumptions can be inspected later.

This matters because wells, trajectories, surfaces, penetrations, and payload intervals cannot be defended if their spatial basis is hidden or inconsistent.

Spatial Concern How The Build Handles It Why It Matters
CRS resolution Tracks CRS and transform context instead of discarding it. Prevents mixed-coordinate ambiguity.
WGS84 normalization Creates a common geographic reference frame. Allows cross-domain spatial joins.
ECEF conversion Computes deterministic Cartesian coordinates. Supports consistent geometry and distance logic.
Depth and spatial linkage Connects tops, penetrations, trajectories, and intervals through canonical records. Improves defendability of “where is the payload?” questions.

Schema To Decision Map

Decision Topic Main Tables What The Tables Prove
Which well is which? ops.canonical_wells, ops.canonical_well_aliases, ops.canonical_well_source_links Stable well identity and alias provenance.
Where is the well? ops.canonical_well_locations Spatial anchor for cross-domain joins.
What reservoir intervals are present? ops.canonical_formation_tops, ops.canonical_structural_surfaces, ops.canonical_reservoir_bodies Static subsurface context.
What was actually completed? ops.canonical_completion_intervals Completion/perforation context for bypassed-pay logic.
What happened over time? ops.canonical_production_records, ops.canonical_dynamic_reservoir_context, ops.canonical_technical_daily_reports, ops.canonical_witsml_* Dynamic and operational support.
What do the logs say? ops.well_logs, ops.well_qc_cards, ops.well_interpretations, ops.well_pay_events, ops.well_bypassed_candidates Petrophysical support and interval-level interpretation.
Which wells access which bodies? ops.canonical_well_reservoir_penetrations Reservoir-access evidence.
What is the ranked decision? ops.well_reopening_targets Explicit score components and rank.
What is the barrel range? ops.remaining_barrel_estimates Low/mid/high screening-grade payload estimate.

Human vs AI vs Derived

Class Examples How To Describe It
Human-authored source data production values, tops, completions, WITSML XML fields, technical reports, artifact metadata Directly extracted from open source packages.
Human-directed build intent use Volve, use MinIO, process all packages, export outputs, make the build defendable Product direction came from Johann F.R Wilhelm.
Assistant-designed implementation schema layout, migrations, workflow phases, scoring structure, coverage audit Engineering mechanics supplied under user direction.
Deterministic transform WGS84, ECEF, counts, geometry envelopes, canonical inventories Mechanically derived and reproducible.
Rule-based heuristic scores, confidence labels, pay candidates, remaining barrels Authored logic, not hidden model inference.
Runtime AI/ML inference none found in canonicalization/scoring path Current live outputs are not generated by runtime LLM decision-making.

IP And Copyright Position

Area Current Position Defendable Statement
Project direction and build framing User-originated Project intent and narrowing decisions are attributed to Johann F.R Wilhelm.
Repository implementation Mixed authorship Code and docs include assistant-generated implementation under user direction.
Open dataset handling Provenance preserved Normalization does not erase source-license constraints.
Exports and reports Improved but not fully closed Metadata-first posture is stronger than reproducing source text, but legal/policy closure is still separate from engineering documentation.

Colleague Defense Summary

  1. The build is grounded in real open source data, not synthetic examples.
  2. The database separates source facts, canonical truth, and derived outputs.
  3. CRS and ECEF are explicit parts of the source-of-truth model.
  4. Cross-reference and ranking are built from inspectable schema and explicit logic.
  5. The runtime path is deterministic or rule-based, not live LLM decision-making.
  6. The result is defendable as a user-directed POC for decision support, not as a production reserves system.

Supporting Artifacts

Schema provenance ledger:

  • docs/operations/BUILD_SCHEMA_PROVENANCE_AUDIT_20260330.json
  • docs/operations/BUILD_SCHEMA_PROVENANCE_AUDIT_20260330.md
  • docs/operations/BUILD_SCHEMA_PROVENANCE_AUDIT_20260330.csv