Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/VUN9DfzM4ke2oyzCA
Back to the job results

Principal Software Engineer (SRO)

24 days ago 2026/06/30
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Roles and Responsibilities

What you will be doing?
As part of the Application Observability (AppO) team, your responsibilities will include:
1. Defining and refining monitoring and alerting rules, both for the team and organisation wide
2. Work together with other teams (Platform and Observability Backend) to enhance performance and fulfil user stories
3. Leading projects such as Grafana's migration from on-premises data centers to AWS by planning, defining requirements, supervising and implementing
4. Improving the deployment of services using Git workflows and ArgoCD
5. Proposing and validating performance and user experience improvements for AppO services
6. Addressing issues, implementing preventive measures and managing postmortems and related improvement tasks
7. Analysing performance, identifying anomalies and defining, documenting and implementing corrective measures
Ensuring compliance with the SLA
8. Additionally, you will participate in the on-call rotation for team services, which requires the ability to resolve issues (using runbooks) knowledge on skill like (Elasticsearch, ThanosKafka, OpenTelemetry, Grafana and Docker)
Three KEY domain exposure:
1. DevOps
2. Platform Engineering
3. Application Observability



Additional Responsibilities

* Good knowledge on software configuration management systems
* Strong business acumen, strategy and cross-industry thought leadership
* Awareness of latest technologies and Industry trends
* Logical thinking and problem-solving skills along with an ability to collaborate
* Two or three industry domain knowledge
* Understanding of the financial processes for various types of projects and the various pricing models available
* Client Interfacing skills
* Knowledge of SDLC and agile methodologies
* Project and Team management



Technical Requirements

* Technology->DevOps->Site Reliability Engineering (SRE)



Job Description

We are on the lookout for a skilled Principal software engineer(Lead Role) with a strong background in DevOps and platform engineering to join our Application Observability team. This team plays a critical role in managing stateful services within the Service Reliability and Observability (SRO) department.
The SRO department provides innovative observability solutions and standardised methods to enhance the efficiency and reliability of IT systems, simplifying tasks for both infrastructure and software engineers.
The Application Observability (AppO) team focuses on driving observability forward by utilizing tools such as Open Telemetry, Elastic Stack, Prometheus and Grafana.


This job post has been translated by AI and may contain minor differences or errors.
You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.