CA Well Test Pack Guide

Source: docs/operations/CA_WELL_TEST_PACK_GUIDE.md

Manual Index Client UI

CA Well Test Pack Guide

Purpose

Build a real California test pack for oil/drilling prioritization and audit-ready POC validation.

The pack includes:

Run

From repository root:


python3 scripts/dev/fetch_ca_well_test_pack.py

Or via make:


make fetch-ca-well-pack

Build stage-2 OCR queue from the fetched links:


python3 scripts/dev/build_ca_well_stage2_queue.py

Run stage-2 link status checks (pending/failed/successful):


python3 scripts/dev/run_ca_stage2_download_status.py \
  --pack-root data/external/ca_well_test_pack \
  --max-check 250

Run stage-2 OCR extraction status checks (moves OCR from pending -> successful/failed):


python3 scripts/dev/run_ca_stage2_ocr_status.py \
  --pack-root data/external/ca_well_test_pack \
  --max-check 250 \
  --max-pages 2 \
  --min-chars 40

Run full API workflow (upload + register-job execution + report):


python3 scripts/dev/run_ca_well_pack_api_workflow.py

Run full self-validation with expected-vs-actual comparison:


python3 scripts/dev/run_well_poc_self_validation.py \
  --source-tag ca_well_test_pack_compare \
  --stage2-max-links 60 \
  --stage2-status-max-check 60 \
  --expected-outcome docs/operations/WELL_POC_EXPECTED_BASELINE_CA_RUN.json \
  --workflow-output docs/operations/CA_WELL_TEST_PACK_INGEST_REPORT_COMPARE.json \
  --output-json docs/operations/WELL_POC_SELF_VALIDATION_RESULT_COMPARE.json \
  --output-md docs/operations/WELL_POC_SELF_VALIDATION_RESULT_COMPARE.md

Run audit alignment validation (processed stage1 vs audit manifest):


python3 scripts/dev/validate_ca_well_audit_alignment.py \
  --audit-manifest data/external/ca_well_test_pack/audit/manifest.json \
  --workflow-report docs/operations/CA_WELL_TEST_PACK_INGEST_REPORT_COMPARE_20260303.json \
  --output-json docs/operations/CA_WELL_AUDIT_ALIGNMENT_REPORT.json \
  --output-md docs/operations/CA_WELL_AUDIT_ALIGNMENT_REPORT.md

Default output:

data/external/ca_well_test_pack

Typical Drilling-Focused Run


python3 scripts/dev/fetch_ca_well_test_pack.py \
  --county Kern \
  --county Ventura \
  --well-type OG \
  --well-type DG \
  --max-wells 12000 \
  --max-wst 12000 \
  --max-wcr-rows 15000 \
  --max-geologic-rows 15000 \
  --max-pdf-link-rows 15000

Outputs

normalized/:

audit/:

stage2/:

docs/operations/:

Validation Included

Notes