Systems / Builds

Follow the white rabbit.

Deep-dives into AI infrastructure, local model workflows, custom automation, retrieval layers, and the interfaces between tools. Each note documents a system that was built, tested, and shipped—or learned from in the attempt.

RAG evaluation harness for retrieval quality

A structured evaluation framework that measures retrieval precision, answer faithfulness, and latency across document chunks.

Custom automation layer over GitHub Actions

Replacing brittle CI pipelines with a custom orchestration layer that handles dependency graphs, artifact caching, and rollback strategies.