https://bayt.page.link/v1TUmrkCw1dqRip19

Back to the job results

Site Reliability Devops Engineer

- InnovaziT (A Happiest Minds company)
- Abu Dhabi · UAE

30+ days ago 2026/05/17

Complete Questionnaire

Full time · 5 - 10 Years of Experience

50-99 Employees

Get the Bayt App

Download the Bayt App to manage your real time conversation with the recruiter

Download App

Create a job alert for similar positions

Job alert turned off. You won’t receive updates for this search anymore.

Undo

Job description

About the Role

We’re looking for a talented Site Reliability Engineer (SRE) to keep our systems running smoothly, reliably, and at scale. Through smart automation, deep observability, and a calm head in a crisis, you’ll help us balance speed, compliance, and stability, working alongside DevOps, Cloud, Quality Engineering, and Product teams to drive continuous improvements in performance, security, and resilience.

What You Will Be Doing

• Define and implement SLIs / SLOs and error budgets for business-critical digital banking services.

• Build actionable observability (metrics, logs, traces, dashboards, and alerts) using Dynatrace, Prometheus, Grafana, and ELK, while reducing alert fatigue.

• Leverage AI-driven insights and anomaly detection (Dynatrace Davis AI or equivalent AIOps platform) to proactively predict and resolve reliability issues before impact.

• Lead incident management — from on-call triage and root-cause analysis to blameless postmortems with actionable follow-ups.

• Improve deployment safety with robust rollout / rollback strategies, canary and blue-green deployments, and production readiness reviews.

• Support and optimize microservices-based architectures, ensuring service reliability, scalability, and inter-service resilience.

• Conduct capacity planning, performance tuning, and resilience testing, optimizing for both reliability and cost efficiency.

• Automate operational toil — from runbooks and remediation scripts to proactive health checks and self-healing workflows.

• Collaborate with DevOps to embed reliability gates and validations into CI / CD pipelines (GitHub Actions, Jenkins, GitLab CI / CD or Azure DevOps). • Own and evolve the observability and AIOps stack, driving intelligent automation and predictive alerting capabilities.

• Maintain high-quality documentation, playbooks, and operational standards across environments.

• Ensure operational compliance and security alignment with internal controls and regulatory standards.

• Analyze system performance, availability, and cost data to continually optimize operations.

• Provide reliability support and escalation guidance for critical production systems during major incidents

Skills

Experience and Qualifications

• 5+ years of experience in SRE or DevOps roles, building and managing large-scale, high-availability systems across banking, fintech, e-commerce, or other data-intensive digital ecosystems.

• Bachelor’s degree in Computer Science or equivalent technical experience.

• Strong experience with Linux environments and performance troubleshooting.

• Proven expertise in Terraform and Infrastructure as Code (IaC) methodologies.

• Proficiency with Kubernetes and container orchestration in microservices environments.

• Hands-on experience with AWS (preferred); exposure to Azure or GCP is an advantage.

• Deep knowledge of Dynatrace (AIOps, Davis AI), Prometheus, Grafana, and the ELK stack.

• Experience implementing AI / ML-driven reliability or automation solutions (AIOps, anomaly detection, predictive alerting).

• Practical understanding of CI / CD pipelines (GitHub Actions, Jenkins, GitLab CI / CD or Azure DevOps).

• Experience with Kafka, RabbitMQ, Redis, Aurora, and RDS databases.

• Strong scripting or programming skills in Python, Bash, or Go.

This job post has been translated by AI and may contain minor differences or errors.

Preferred candidate

Years of experience

5 - 10 years

Loading... Email to Friend Complete Questionnaire

Compare your profile with other applicants

Cancel

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.

MANAGE

Job alert created for this search. You’ll receive updates when new jobs match.

Manage alerts

Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.

People who applied to this job also applied to

Senior Software Engineer
Silver Rabbit LLC
Aiea · USA

19 days ago · Easy Apply
DevOps Engineer
TASC Outsourcing
Dubai · UAE

18 days ago · Easy Apply
مطور ويب (Web Developer – AI-Focused)
Hikma Institute
Khobar · Saudi Arabia

10 days ago · Easy Apply

People who viewed this job also viewed

Software Engineer – Kuwait
Diyar United Company
Al Kuwait · Kuwait

30+ days ago · Easy Apply
Service Fulfilment Developer | Retail | IKEA RSO Jebel Ali
Al Futtaim Group
Dubai · UAE

30+ days ago
System Analyst
InnovaziT (A Happiest Minds company)
Abu Dhabi · UAE

30+ days ago · Easy Apply
Site Reliability Devops Engineer
InnovaziT (A Happiest Minds company)
Abu Dhabi · UAE

30+ days ago · Easy Apply
DevOps Engineer
TASC Outsourcing
Dubai · UAE

18 days ago · Easy Apply

See other jobs by
InnovaziT (A Happiest Minds company)

HR Business Analyst
InnovaziT (A Happiest Minds company)
Abu Dhabi · United Arab Emirates

25 days ago · Easy Apply
Senior Devsecops Engineer
InnovaziT (A Happiest Minds company)
Abu Dhabi · United Arab Emirates

30+ days ago · Easy Apply

View all jobs

Upgrade to Premium

Promote your job application to the top.