Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/n7SbBpduuBGCd7Mg8
Back to the job results

DevOps Engineer (Observability)

30+ days ago 2026/07/16
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

 Our Company


At Teradata, we believe that people thrive when empowered with better information. That's why we built the most complete cloud analytics and data platform for AI. By delivering harmonized data, trusted AI, and faster innovation, we uplift and empower our customers-and our customers' customers-to make better, more confident decisions. The world's top companies across every major industry trust Teradata to improve business performance, enrich customer experiences, and fully integrate data across the enterprise.


What You'll Do
  1. Working on a team of professionals, you will manage, configure, and support observability tooling for Teradata's product offerings across all three major cloud service providers (AWS, Azure, and Google Cloud).
  2. You will define, configure, and deploy monitoring to measure performance, scalability, reliability, and resiliency, and alert when critical thresholds are crossed.
  3. You will build concise, impactful dashboards displaying infrastructure-level and application-level telemetry for both internal and external audiences.
  4. You will monitor all the layers of Teradata's application stack, from the customer-facing interface all the way through the backend, including all services, network layers, databases, and cloud service provider integrations.
  5. You will constantly seek to reduce mean-time-to-discover and mean-time-to-recover through improvements to Teradata's observability tooling.
Who You'll Work With
  1. You'll work closely with product engineering and cloud operations personnel to help administer all aspects of Teradata's observability tooling in pre-production and production environments.
  2. You'll work with security and compliance teams to help provide evidence necessary to meet Teradata's compliance obligations.
  3. You'll report to a Sr. Manager, Site Reliability Engineering.
What Makes You A Qualified Candidate
  1. Experience with at least one major cloud service provider (AWS, Azure, and/or Google Cloud), preferably all three.
  2. 2+ years of administrative-level experience with Grafana or an equivalent observability tool, including but not limited to onboarding users; defining group policies; authoring monitors, alerts, and dashboards; and integration with other enterprise applications such as ServiceNow.
  3. Experience with an infrastructure-as-code (IaC) cloud provisioning tool, preferably Terraform.
  4. Strong scripting skills with a modern programming language such as Python.
  5. Experience with a configuration management tool such as Ansible or Puppet.
  6. Experience with a build/deployment automation tool such as Jenkins or Bamboo.
  7. Experience with at least one modern source control tool, preferably Git.
  8. Experience with at least one modern defect tracking tool, preferably Jira.
  9. Familiarity with both SQL and noSQL databases, and use cases for each.
  10. Experience administering Linux-based systems.
What You'll Bring
  1. 3 to 4 years of experience in the software industry in a devops or site reliability engineering role.
  2. A passion for constant, iterative improvement over the status quo.
  3. An in-depth understanding of infrastructure-level and application-level monitoring principles and practice, across both production and non-production environments.
  4. An understanding of enterprise software deployment and security/compliance principles.
  5. Proficiency with multi-layered technical troubleshooting and root-cause analysis.
  6. The ability to quickly and comprehensively decompose a problem, identifying dependencies and defining tasks.
  7. The ability to work both independently and collaboratively in a fast-paced environment, and adjust as priorities change.
  8. The flexibility to work on a globally-distributed team managed from the United States.

#LI-NM1


This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.