DevOps Engineer
Who we are looking for
We are looking for a DevOps Engineer reporting directly to Core Data Services - Software Engineering & Infrastructure Development, VP. Devops Engineer is responsible for building, automating, securing, and optimizing cloud infrastructure on Microsoft Azure using Infrastructure as Code (IaC), configuration management, observability, and cost-aware engineering practices. The position is suited to a hands-on engineer who can work closely with Cloud Center of Excellence (CCoE), platform teams, and application development teams to create scalable, reliable, and repeatable delivery patterns across environments.
What you will be responsible for
As a DevOps Engineer, you will:
- Design, provision, and manage Azure infrastructure using Terraform and reusable IaC modules, ensuring standardization, scalability, and repeatability across development, test, staging, and production environments.
- Automate configuration management, software setup, patching, and post-provisioning tasks using Ansible playbooks and role-based automation patterns.
- Build, enhance, and maintain CI/CD pipelines using Azure DevOps, GitHub Actions, Jenkins, or similar tools to support reliable and secure application and infrastructure delivery.
- Implement and manage observability capabilities including monitoring, alerting, logging, dashboards, and operational visibility using tools such as Azure Monitor, Log Analytics, Grafana, Prometheus, Datadog, or equivalent platforms.
- Partner with application teams to enable deployment automation, environment consistency, release reliability, and faster recovery from incidents.
- Collaborate with CCoE and cloud governance stakeholders to align infrastructure patterns with enterprise standards for security, tagging, policy, compliance, and platform architecture.
- Drive FinOps practices by improving cloud cost visibility, resource rightsizing, usage optimization, lifecycle automation, and governance controls that reduce waste without compromising resilience.
- Troubleshoot platform, deployment, and environment issues across cloud infrastructure and delivery pipelines, and perform root cause analysis for reliability improvements.
- Create high-quality technical documentation, SOPs, runbooks, automation patterns, and reusable templates for engineering teams.
- Building internal automation assistants using LangGraph and LangChain for guided operational workflows, runbook execution support, knowledge retrieval, and infrastructure change assistance.
- Integrating Claude models, plugins, and tool-enabled workflows into engineering automation where governed AI assistance can improve productivity in troubleshooting, documentation, RCA drafting, or deployment decision support.
- Defining guardrails, approval checkpoints, observability, and fallback logic for AI-assisted or agentic infrastructure operations.
What we value
Education & Preferred Qualifications
- Bachelor's degree in Computer Science, Information Technology, or a related field
- 3-5 years of experience in DevOps, Cloud Engineering, SRE, or Platform Engineering roles with strong hands-on delivery ownership.
- Proven ability to work in a high paced environment, be flexible, follow tight deadlines, organize, and prioritize work
- Strong hands-on experience with Microsoft Azure services and cloud operations.
- Solid expertise in Terraform, including module design, remote state handling, environment strategy, and infrastructure lifecycle management.
- Strong working knowledge of Ansible for configuration management, orchestration, and automation.
- Good understanding of IaC principles, CI/CD, Git-based workflows, release automation, and environment standardization.
- Experience with monitoring, logging, and alerting tools such as Azure Monitor, Log Analytics, Prometheus, Grafana, ELK, Datadog, or similar platforms.
- Familiarity with containers and orchestration platforms such as Docker, Kubernetes, and AKS is strongly preferred.
- Exposure to scripting with PowerShell, Bash, or Python for automation and operational tooling.
- Understanding of cost optimization, tagging strategy, governance, and operational efficiency in cloud environments.
- Awareness of agentic AI frameworks such as LangChain and LangGraph, plus practical exposure to Claude-based workflows, is highly desirable for teams building next-generation automation capabilities.
Additional requirements
- Certification on Azure & Terraform
- Ability to multi-task, meet aggressive timelines and have strong work ethics.
Work Requirement
- Hybrid Work – 4 days a week
- Regular Work hours
About State StreetAcross the globe, institutional investors rely on us to help them manage risk, respond to challenges, and drive performance and profitability. We keep our clients at the heart of everything we do, and smart, engaged employees are essential to our continued success.
We are committed to fostering an environment where every employee feels valued and empowered to reach their full potential. As an essential partner in our shared success, you’ll benefit from inclusive development opportunities, flexible work-life support, paid volunteer days, and vibrant employee networks that keep you connected to what matters most. Join us in shaping the future.
As an Equal Opportunity Employer, we consider all qualified applicants for all positions without regard to race, creed, color, religion, national origin, ancestry, ethnicity, age, disability, genetic information, sex, sexual orientation, gender identity or expression, citizenship, marital status, domestic partnership or civil union status, familial status, military and veteran status, and other characteristics protected by applicable law.
Discover more information on jobs at StateStreet.com/careers
Read our CEO Statement