Platform SRE

TD SYNNEX

🌍 100% Remote Full time

Job Description

Role purpose:

  • Ensure reliability, operability, and continuous improvement of TD SYNNEX enterprise platforms across hybrid cloud and on‑prem environments.
  • Engineering‑driven operations focused on automation, Infrastructure‑as‑Code (IaC), observability, and toil reduction.
  • Serve as the L3 escalation for complex incidents; continuously improve platform run posture and readiness for L1/L2 execution.
  • Core responsibilities:

  • Platform reliability (hybrid cloud + on‑prem): Own L3 reliability posture; define SLOs/KPIs; lead operability gates and production readiness; maintain runbooks/SOPs.
  • Automation & IaC: Design/build operational automation (health checks, remediation workflows); develop Terraform/Ansible configurations; script with Python (preferred), PowerShell, and/or Bash; integrate with ITSM for auditable self‑service and controlled remediation.
  • Incident/problem/RCA (L3): Lead diagnosis, stabilization, an...