Tag

#health-check

8 entries across blog, tutorials, videos, podcasts, and code.

June 9, 2026 Blog

Stop Re-Asking the LLM the Same Question

Incremental distillation: hash the inputs, skip unchanged checks, and cut ~64% of LLM calls on an unchanged re-run — with an honest note on what is not gated yet.
June 9, 2026 Tutorial

Tutorial 3 — Memory & Cost: Stop Re-Diagnosing (and Re-Paying for) the Same Problems

Add cross-run memory and incremental skipping so the tool stops re-diagnosing the same issues and an unchanged re-run costs a fraction of the first.
June 7, 2026 Blog

Teaching an Audit Tool to Remember

Cross-audit lineage and supersession: immutable history, and the what-changed-since-last-time report a returning user actually wants.
June 7, 2026 Tutorial

Tutorial 2 — Build an LLM-as-Judge: Catching Hallucinations Before They Ship

Build the LLM-as-judge verification layer: a cheap deterministic grounding check plus a multi-model consensus vote — including the substring-grounding bug in its natural habitat.
June 4, 2026 Blog

Cheap Checks Before Expensive Ones: The Two-Part Judge Pattern

The two-part judge pattern in ~40 lines: a deterministic filter first, multi-model consensus second. It generalizes to RAG, extraction, and code review.
June 4, 2026 Tutorial

Tutorial 1 — Foundations: Structured Output + a Multi-Provider LLM Harness

Replace free-form LLM prose with structured Finding objects over a multi-provider harness — the checkable spine the rest of the build depends on.
June 2, 2026 Blog

I Asked an LLM to Diagnose My Database. Then I Asked Another LLM if It Was Lying.

The origin story of the judge: the model invented a shared_buffers value. The fix was not a bigger model — it was cheap grounding plus a second-opinion jury.
June 2, 2026 Tutorial

Tutorial 0 — We Vibe-Coded a Database Doctor. Does It Actually Work?

We vibe-coded a 100-point Postgres health check. Does it actually work? The quickstart, the two engines, and an honest gut-check of where the naive version fails.

#health-check

Stop Re-Asking the LLM the Same Question

Tutorial 3 — Memory & Cost: Stop Re-Diagnosing (and Re-Paying for) the Same Problems

Teaching an Audit Tool to Remember

Tutorial 2 — Build an LLM-as-Judge: Catching Hallucinations Before They Ship

Cheap Checks Before Expensive Ones: The Two-Part Judge Pattern

Tutorial 1 — Foundations: Structured Output + a Multi-Provider LLM Harness

I Asked an LLM to Diagnose My Database. Then I Asked Another LLM if It Was Lying.

Tutorial 0 — We Vibe-Coded a Database Doctor. Does It Actually Work?