Notes

Writing

Occasional long-form on building and evaluating AI tooling — and the craft of software that lasts.

Build the thing, then prove it works

a multi-source investigation pipeline, and the checks that keep it honest

I built a pipeline that investigates problems the way a person would: by pulling evidence out of every system that might hold a piece of the answer, lining it up, and checking its own work before it reports back. This is how it's put together, and how I decide whether to trust it — blind evaluations across a range of models, run often enough to catch the day it stops working.