Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/DBmf2muaMtysY31Q9
Back to the job results

DevOps Engineer SE II - GCP & AI

30+ days ago 2026/09/03
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Responsibilities: Infrastructure Ownership: Own Helpshift production services and ensure complete monitoring coverage, troubleshoot and fix production issues.
Infrastructure as Code (IaC): Design and maintain scalable GCP infrastructure using Terraform o AI Orchestration & LLMOps: Build deployment pipelines for AI agents, managing vector databases (e.
g., Vertex AI Search, Pinecone, Weaviate, ElasticSearch) and model endpoints.
Security (DevSecOps): Implement "Security-by-Design," including IAM least-privilege access, secret management (Secret Manager), and automated vulnerability scanning for AI workloads.
CI/CD Excellence: Architect high-velocity pipelines for both traditional microservices and AI model prompts/configurations.
Design, implement, and maintain secure CI/CD pipelines for automating deployment, configuration, and testing processes.
Observability: Set up comprehensive monitoring for system health and LLM-specific metrics (latency, token usage, and cost) Cloud Governance: Optimise GCP costs and manage resource quotas, especially for GPU/TPU-intensive AI tasks.
Cross Cloud Deployment: Establish & Optimise the connectivity among apps deployed in different cloud environments (AWS <> GCP) Requirements Relevant experience of 6+ years and above Expert-level Google Cloud Platform (GCP) administration skills: GKE, Cloud Run, Vertex AI, GCS, NEG etc Experience deploying Vector Databases (Pinecone, Weaviate, ElasticSearch or Vertex Search) and managing API rate limits/throttling for LLM providers.
Setting up Cloud Monitoring/Logging specifically for AI metrics: token consumption, inference latency, and model error rates.
In-depth knowledge of running/managing UNIX-like operating systems (we use Ubuntu) Strong knowledge of networking protocols, security architectures, and identity and access management (IAM) principles.
Experience with containerisation technologies (e.
g., Docker, Kubernetes) and securing containerised environments.
Proficiency in Python and Bash Experience in designing and building solutions that are highly scalable, fault tolerant and cost-effective Experience with IaaC tools like Ansible, Terraform.
Ability to analyse bottlenecks in architecture and quickly debug to reach a resolution for issues Have an automation mindset and ability to reason and work with complex systems.
Excellent communication and documentation skills Quick learner and good mentor for junior team members
This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.