Submitting more applications increases your chances of landing a job.
Here’s how busy the average job seeker was last month:
Opportunities viewed
Applications submitted
Keep exploring and applying to maximize your chances!
Looking for employers with a proven track record of hiring women?
Click here to explore opportunities now!You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for
Would You Be Likely to Participate?
If selected, we will contact you via email with further instructions and details about your participation.
You will receive a $7 payout for answering the survey.
Valeo is a tech global company, designing breakthrough solutions to reinvent the mobility. We are an automotive supplier partner to automakers and new mobility actors worldwide. Our vision? Invent a greener and more secured mobility, thanks to solutions focusing on intuitive driving and reducing CO2 emissions. We are leader on our businesses, and recognized as one of the largest global innovative companies.
Mission
The SysOps/SRE Lead is responsible for the implementation, architectural reliability, scalability, and operational health of the R&D infrastructure worldwide. You will bridge the gap between development and operations by defining and applying engineering practices to system administration.
As a Lead, you are the owner of the platform’s and the architect of the team’s operational excellence, ensuring that our global toolchain remains resilient under pressure. while Also planning, organizing, leading and controlling SRE Teams Implementation timeline and standards adherence.
Responsibilities
- Operational Excellence: Define and manage the On-Call schedules and rotations, ensuring global coverage models are sustainable, fair, and optimized for rapid incident response.
- Incident Leadership: Serve as the primary escalation point for critical infrastructure and critical applications production incidents. Lead "Blameless Post-Mortems" to identify root causes and ensure permanent remediation, also manage and investigate normal Application incidents according to the defined SLA/SLO.
- Health Oversight: Define, implement and Maintain daily oversight of system health monitors and error budgets. You are expected to identify systemic weaknesses and performance regressions before they impact the end-user.
- Documentations: Ensure Excellent documentation of implementations, runbooks, and best practices.
- Planning: Drive the definition and planning and implementation of Program Increment (PI) related infrastructure activities. You will translate business requirements into technical roadmaps and ensure the team remains aligned with organizational milestones.
- Delivery Excellence: Ensure the team meets defined schedules and technical benchmarks. You will balance the "50/50" SRE split—ensuring that at least half of the team’s capacity is dedicated to engineering improvements rather than reactive toil.
- Standards & Compliance: Define and enforce standards for Cloud Center, security patching, and infrastructure-as-code (IaC). You hold the final approval for production readiness.
- Infrastructure Evolution: Oversee the design of secure, multi-region AWS environments, focusing on high availability and cost-optimization. by fine tuning Systems/applications/infrastructure resources for best performance, while keeping cost lower
- Observability Framework: Standardize our monitoring stack (Prometheus, Grafana, ELK) to move from basic alerting to proactive, functional observability.
- Mentorship: Provide technical mentorship to junior and mid-level engineers, fostering a culture of automation, documentation, and continuous improvement.
- Reporting and Governance: Define and report KPIs to asses platform health and service level End-2-End.
Qualifications/Technical Skills Required
- Excellent knowledge in Windows/Linux Administration
- Excellent knowledge in AWS Preferred (Or any similar Cloud Providers)
- Excellent knowledge in Terraform and Ansible
- Excellent knowledge in Docker or any containerization technology
- Excellent knowledge in Jenkins
- Excellent knowledge in Kubernetes
- Excellent knowledge in observability tools
- Excellent knowledge in Git (Source control tools)
- Very Good knowledge in any scripting language for automations
- very good knowledge in SAFe/agile Methedologies
- English Language is a must
- Professional Experience of 8+ years is required
Job:
Organization:
Schedule:
Employee Status:
Job Type:
Job Posting Date:
Join Us !
Being part of our team, you will join:
- one of the largest global innovative companies, with more than 20,000 engineers working in Research & Development
- a multi-cultural environment that values diversity and international collaboration
- more than 100,000 colleagues in 31 countries... which make a lot of opportunity for career growth
- a business highly committed to limiting the environmental impact if its activities and ranked by Corporate Knights as the number one company in the automotive sector in terms of sustainable development
More information on Valeo: https://www.valeo.com
You'll no longer be considered for this role and your application will be removed from the employer's inbox.