Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/MP3gERbGp3XHHixe9
Back to the job results

Manager Systems Administration

2 days ago 2026/09/11
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Overview About Business Unit: At the core of all that Epsilon does is a team that sets the foundation of our IT infrastructure. The team drives innovation and efficiency through pioneering technology across Epsilon's platforms and business verticals. From being the first point of contact for infrastructure needs to final deployment, the team provides end-to-end solutions for our client-facing platforms. ETS supports all aspects of revenue-generating platforms for Epsilon and sets the architectural direction for our enterprise deployments. By adopting the newest technologies, such as Cloud, Automation, and Artificial Intelligence, the team is at the front of redefining our digital business and capturing new opportunities. Why we are looking for you: You need to oversee operational activities, proactively identify and implement service improvements within the Operations Centre. You should have good knowledge in Kubernetes, with hands-on experience using platforms, such as Google Kubernetes Engine (GKE) or Amazon Elastic Kubernetes Service (EKS) You understand are have experience setting up of enterprise observability, alerting, and manage incident command at scale. You should have a deep understanding of Linux, networking protocols (TCP/IP, DNS), and distributed systems. Strong skills in monitoring tools (e.g., OpsRamp, ThousandEyes, Grafana, Prometheus) and establishing observability practices. Cross-functional abilities to collaborate skills to align with product and engineering partners. You should define and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs). You should act as an escalation point for critical production outages. What you will enjoy in this role: Leading a critical Operations Command Center supporting enterprise-scale infrastructure and services Develop internal processes and procedures to enhance operational effectiveness based on industry best practices, creating and maintaining an operational centre of excellence. Working closely with global clients and internal partners in a high-visibility role Proactive in identifying service improvements and operational efficiencies Driving continuous service improvement and operational excellence initiatives Operating in a fast-paced, dynamic environment that values accountability and leadership Click here to view how Epsilon transforms marketing with 1 View, 1 Vision and 1 Voice. Responsibilities Lead, mentor, and upskill a high‑performing DevOps / SRE‑oriented operations team, fostering strong engineering ownership Partner with engineering teams to improve CI/CD reliability, release safety, and change automation. Own the observability strategy across metrics, logs, and traces to provide real‑time and predictive insights into system health. Ensure effective use and optimization of monitoring, alerting, and log analytics platforms (e.g., ELK, Splunk, Zabbix, OpsRamp). Continuously tune alerts to minimize noise and improve signal quality, enabling faster detection and resolution. Ensure consistent observability of platforms and services to maintain optimal client product uptime. Ensure incident, problem, and change processes are lightweight, automated, and outcome‑focused, aligned with ITIL where appropriate. Drive service continuity, resilience testing, and disaster recovery readiness. Maintain consistent connect with peer teams for smooth operational efficiencies. Participate in RCA of issues and address any monitoring/process gaps noticed during any major incident. Enables Operations teams through coaching and hands-on guidance to build capabilities and achieve strategic goals. Ensure all incidents and requests are processed using documented processes and adhere to SLA/OLA obtainment Maintain consistent ticket quality through regular quality assessments. Present management level reporting through weekly and monthly summary of incidents/tickets/projects and any challenges observed. Evaluates production change proposals and ensures smooth, well governed implementation through deliberate risk‑ aware‑ decisions. Ensures service continuity plans are compatible with operational delivery centre operations and that plans are tested regularly Review and analyse all Operations management toolsets for best of bread environmental event management. Client coordination for preparing Operational run books Maintains operational logs and journals on all events, reported issues, warnings, alerts, and alarms, recording and classifying all messages. Maintains operational documentation, processes, management, and diagnostic tools, ensuring that services are maintained at the agreed levels. Ensure that maintenance tasks are completed as per procedural documentation for scheduled BAU tasks and client-specific infrastructure. Qualifications Bachelor’s degree in engineering, Computer Science, IT, or equivalent field 12 - 15 years of related experience Strong verbal/written communication Command Centre/NOC/SOC experience Familiarity with application lifecycle and IT Service Management concepts. Additional Information Epsilon is a global data, technology and services company that powers the marketing and advertising ecosystem. For decades, we’ve provided marketers from the world’s leading brands the data, technology and services they need to engage consumers with 1 View, 1 Vision and 1 Voice. 1 View of their universe of potential buyers. 1 Vision for engaging each individual. And 1 Voice to harmonize engagement across paid, owned and earned channels. Epsilon’s comprehensive portfolio of capabilities across our suite of digital media, messaging and loyalty solutions bridge the divide between marketing and advertising technology. We process 400+ billion consumer actions each day using advanced AI and hold many patents of proprietary technology, including real-time modeling languages and consumer privacy advancements. Thanks to the work of every employee, Epsilon has been consistently recognized as industry-leading by Forrester, Adweek and the MRC. Epsilon is a global company with more than 9,000 employees around the world. Our pillars aren't just words. They're how we show up every day. People centricity: We focus on employee well-being in an environment where colleagues truly care about each other. Collaboration: We work together, support one another, and collectively achieve goals. Growth: There are endless opportunities for growth through learning, development and career advancement. Innovation: We drive progress through cutting-edge solutions and forward-thinking approaches. Flexibility: We’ve created a balance between work and personal life, and we encourage adaptability to solve problems creatively. Our values guide us to create value for our clients, our people and consumers. Act with integrity Work together to win together Innovate with purpose Respect all voices Empower with accountability These pillars and values are our foundation—shaping our culture, guiding our decisions, and uniting us in common purpose. Epsilon is an Equal Opportunity Employer. Epsilon is committed to promoting diversity, inclusion, and equal employment opportunities by using reasonable efforts to attract, recruit, engage and retain qualified individuals of all ethnicities and backgrounds, including, but not limited to, women, people of color, LGBTQ individuals, people with disabilities and any other underrepresented groups, traits or characteristics.

This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.