Skip to content
Back to Work
002

Optum

Production ML for Healthcare at UnitedHealth Group

Role

Software Engineer, AI/ML

Duration

Feb 2026 – Present

Team

Enterprise AI/ML Team

Status

Current

Private Repo

Overview

Building production ML pipelines at UnitedHealth Group, the largest healthcare company in the US. Owning model deployment, monitoring, and data processing across AWS and Kubernetes. The systems I work on process millions of healthcare records daily to flag care gaps, automate claims review, and surface clinical insights.

Problem

Healthcare data is fragmented. Clinical records, claims, pharmacy data, and lab results live in separate systems. Manual review dominates: nurses document by hand, claims require human approval, and care gaps go undetected until patients show up in the ER. The challenge is building ML systems that run reliably at this scale while meeting HIPAA and SOC 2 requirements.

Approach

  • 01Own end-to-end ML pipeline development: data ingestion, feature engineering, model training, deployment, and monitoring
  • 02Deploy models to production on AWS with Kubernetes orchestration and automated rollback on performance degradation
  • 03Build data processing jobs handling millions of healthcare records with strict SLA requirements
  • 04Implement model observability: drift detection, prediction confidence tracking, and alerting for production anomalies
  • 05Work with data scientists to translate research models into production-grade services with latency and throughput guarantees
  • 06Maintain HIPAA compliance across all data pipelines with encryption, access controls, and audit logging

Design Decisions

Technology Stack

Languages

PythonSQL

ML/AI

PyTorchScikit-learnMLflowFeature Stores

Infrastructure

AWSKubernetesDockerTerraform

Data

SparkAirflowPostgreSQLS3

Compliance

HIPAASOC 2EncryptionAudit Logging

Impact

Scale

150M+

Patients served by UnitedHealth Group healthcare services

Records

Millions

Healthcare records processed daily through ML pipelines

Uptime

99.9%+

Production model availability with automated failover

Next Case Study

MedVanta Platform