Senior Site Reliability Engineer (SRE) – Observability & Kubernetes (8–12 yrs)

- Cisco
- Bengaluru· India

5 days ago 2026/08/27

Complete Questionnaire

Apply on company site

Other Business Support Services

Create a job alert for similar positions

Job alert turned off. You won’t receive updates for this search anymore.

Undo

Job description

Meet the Team

Cisco’s Cloud Collaboration Technology Group (CCTG) builds and operates large-scale, cloud-native collaboration platforms including Webex. The team focuses on delivering highly reliable, observable, and scalable infrastructure powering millions of users globally. You will work at the intersection of platform engineering, SRE, and observability, enabling engineering teams to operate resilient distributed systems.

Your Impact

As a Cloud Engineer (Grade 10), you will design and operate observability and infrastructure platforms supporting Webex microservices at scale. This role combines deep hands-on engineering with production ownership, where you will independently drive reliability, automation, and performance improvements across distributed systems.

Design, build, and operate observability platforms (logging, metrics, tracing) for microservices
Manage and optimize Kubernetes clusters across multi-region production environments
Own and enhance CI/CD pipelines using Argo CD, Helm, and GitOps workflows
Implement and manage infrastructure-as-code using Terraform on AWS
Operate and scale monitoring ecosystems (OpenSearch/ELK, Prometheus, Grafana, Splunk, Kafka)
Build automation for proactive detection and remediation of production issues
Lead incident response, participate in on-call rotations, and drive post-incident improvements
Ensure system security and compliance through patching and vulnerability management
Collaborate with cross-functional teams to improve system reliability and scalability
Contribute to distributed system design and platform engineering initiatives

Core Technical Skills:

Kubernetes administration
CI/CD with Argo CD and Helm
Docker and container ecosystems
Terraform or IaC tools
Kafka or streaming systems
Linux/Unix expertise
Monitoring and alerting systems

Minimum Qualifications

As a part of core tech 90% of our work in based out of these skills.

8+ years of experience in DevOps, SRE, or platform engineering roles in production environments
Hands-on experience operating Kubernetes at scale (multi-cluster, thousands of pods, Helm-based deployments)
Strong expertise in observability tools (at least two): Prometheus, Grafana, OpenSearch/Elasticsearch, Splunk, Loki, or Logstash
Proven experience with Infrastructure-as-Code (Terraform or equivalent) on AWS
Proficiency in scripting or programming (Python, Golang, or Bash) for automation and CI/CD integration

Preferred Qualifications

Experience managing Kafka / AWS MSK clusters and high-throughput streaming systems
Hands-on experience with OpenTelemetry and distributed tracing frameworks
Familiarity with security standards (ISO 27001, SOC 2, FedRAMP) and container hardening tools
Experience with GitOps workflows (Argo CD), Helm bundles, and progressive delivery (canary/blue-green)
Experience using AI tools (Copilot, Claude, LLM-based agents) for code generation, troubleshooting, or incident automation

#WeAreCisco

#WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all.

Our passion is connection—we celebrate our employees’ diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best, but be their best.

We understand our outstanding opportunity to bring communities together and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer—80 hours each year—allows us to give back to causes we are passionate about, and nearly 86% do!

Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

This job post has been translated by AI and may contain minor differences or errors.

Apply on company site Email to Friend Complete Questionnaire

Compare your profile with other applicants

Cancel

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.

MANAGE

Job alert created for this search. You’ll receive updates when new jobs match.

Manage alerts

Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.