Architect and Senior SRE focused on observability, distributed systems, and developer tooling. I write Python, Bash, and IaC and I care about keeping things running.
Manual My Oracle Support (MOS) Service Request triage and stakeholder notifications slowed mean time to resolution for critical Sovereign Cloud incidents.
Outcome › Automated SR creation, severity triage, and notifications via Python + OCI APIs; measurably reduced MTTR across low-side and high-side government realms.
CAB Workflow Automation
Python
OCI APIs
GitHub Actions
Buildkite
Manual Change Advisory Board review created a deployment backlog for OCI service teams, slowing safe release velocity.
Outcome › Engineered automation pipelines that eliminated manual change request processing, cut the review queue, and accelerated deployments for the Sovereign Cloud org.
Twilio Resilience Platform
PagerDuty
Rollbar
Prometheus
Python
Ansible
Production outages required manual triage that extended downtime windows and risked SLA breach across distributed services.
Outcome › Cut downtime 40% with automated monitoring, runbooks, and recovery tooling; sustained 99.99% availability and reduced manual operational effort by 50%.