Writing

Technical writing on systems I have built. Infrastructure, ML pipelines, and the engineering decisions behind them.

Why single-document inference wastes 85% of GPU capacity, and how type-based grouping, page-count batch sizing, and dual-path routing brought utilization from 15% to 70%+ with zero OOM incidents.

Read →

20246 minIn Progress

HIPAAAWSSecurity

HIPAA-Compliant Infrastructure Without the Overhead

How we built compliant-by-default infrastructure at Archv. Column-level encryption, row-level access control, immutable audit logs, and zero-trust service mesh. All automated, no manual checklists.

Read →

20255 minIn Progress

MLOpsMonitoringProduction

ML Monitoring That Catches Failures

Aggregate accuracy dashboards hide localized failures. We built alerting on data drift, prediction confidence drops, and silent model degradation. Alerts fire before users report problems.

Read →

Writing

GPU Batching Strategies for Document Classification

HIPAA-Compliant Infrastructure Without the Overhead

ML Monitoring That Catches Failures