Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/pThqEU4RcdrXqLPm7
Back to the job results

Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs)

5 days ago 2026/08/27
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Meet the Team

Cisco’s Cloud Collaboration Technology Group (CCTG) builds and operates large-scale, cloud-native collaboration platforms including Webex. The team focuses on delivering highly reliable, observable, and scalable infrastructure powering millions of users globally. You will work at the intersection of platform engineering, SRE, and observability, enabling engineering teams to operate resilient distributed systems.
 



Your Impact

As a Cloud Engineer (Grade 10), you will design and operate observability and infrastructure platforms supporting Webex microservices at scale. This role combines deep hands-on engineering with production ownership, where you will independently drive reliability, automation, and performance improvements across distributed systems.



  • Design, build, and operate observability platforms (logging, metrics, tracing) for microservices
  • Manage and optimize Kubernetes clusters across multi-region production environments
  • Own and enhance CI/CD pipelines using Argo CD, Helm, and GitOps workflows
  • Implement and manage infrastructure-as-code using Terraform on AWS
  • Operate and scale monitoring ecosystems (OpenSearch/ELK, Prometheus, Grafana, Splunk, Kafka)
  • Build automation for proactive detection and remediation of production issues
  • Lead incident response, participate in on-call rotations, and drive post-incident improvements
  • Ensure system security and compliance through patching and vulnerability management
  • Collaborate with cross-functional teams to improve system reliability and scalability
  • Contribute to distributed system design and platform engineering initiatives

Core Technical Skills:
  • Kubernetes administration
  • CI/CD with Argo CD and Helm
  • Docker and container ecosystems
  • Terraform or IaC tools
  • Kafka or streaming systems
  • Linux/Unix expertise
  • Monitoring and alerting systems

Minimum Qualifications

As a part of core tech 90% of our work in based out of these skills.



  • 8+ years of experience in DevOps, SRE, or platform engineering roles in production environments
  • Hands-on experience operating Kubernetes at scale (multi-cluster, thousands of pods, Helm-based deployments)
  • Strong expertise in observability tools (at least two): Prometheus, Grafana, OpenSearch/Elasticsearch, Splunk, Loki, or Logstash
  • Proven experience with Infrastructure-as-Code (Terraform or equivalent) on AWS
  • Proficiency in scripting or programming (Python, Golang, or Bash) for automation and CI/CD integration

Preferred Qualifications
  • Experience managing Kafka / AWS MSK clusters and high-throughput streaming systems
  • Hands-on experience with OpenTelemetry and distributed tracing frameworks
  • Familiarity with security standards (ISO 27001, SOC 2, FedRAMP) and container hardening tools
  • Experience with GitOps workflows (Argo CD), Helm bundles, and progressive delivery (canary/blue-green)
  • Experience using AI tools (Copilot, Claude, LLM-based agents) for code generation, troubleshooting, or incident automation

#WeAreCisco

#WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all.



Our passion is connection—we celebrate our employees’ diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best, but be their best.



We understand our outstanding opportunity to bring communities together and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer—80 hours each year—allows us to give back to causes we are passionate about, and nearly 86% do!



Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!




Why Cisco? 

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.



Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere. 



We are Cisco, and our power starts with you. 





This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.