Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/3KXxNF6xRyBTLWyVA
Back to the job results

Site Reliability Engineer II

21 days ago 2026/08/09
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Play a key role in ensuring system reliability at one of the world’s most iconic and largest financial institutions.


As a Site Reliability Engineer II at JPMorgan Chase within the Asset and Wealth Managment , you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you’ll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase’s business and relevant technologies.


Job responsibilities


  • Designs, develops, tests, and delivers automation solutions using Python, with a focus on AI integration to optimize operational workflows.
  • Collaborates with application teams to establish and monitors Site Reliability Engineering principles, including SLOs and SLIs, and implement data-driven approaches to improve service levels.
  • Leads automated software upgrades, change management, release management, and self-healing solutions, utilizing AWS as the primary cloud platform.
  • Manages application deployment, incident resolution, capacity planning, and reporting to ensure stable and high-performing platforms.
  • Conducts performance testing, identify bottlenecks, and drive continuous optimization and capacity improvements.
  • Defines and implements best-in-class monitoring frameworks for end-to-end observability and noiseless alerting, using industry-standard tools such as Prometheus, Grafana, Splunk, and AWS-native services.
  • Facilitates blameless post-mortems and ensure permanent closure of incidents.
  • Coaches and mentors team members, manage small to medium projects independently, and contribute to larger initiatives.
  • Writes high-quality, maintainable code following software engineering best practices.
  • Proactivelys identify and eliminate manual toil through automation and systems engineering.
  • Implements and enhances observability patterns, service level indicators, and alerting solutions for optimal transparency.

Required qualifications, capabilities, and skills


  • Formal training or certification on software engineering concepts and 2+ years applied experience
  • Advanced understanding of application monitoring stacks (Metrics, Events, Traces, Alerts, Logs) and end-to-end observability for AWS infrastructure and applications.
  • Strong experience with AWS services, Kubernetes, containerization and expertise in networking technologies and maintaining AWS-based infrastructure.
  • Proficiency in Python programming and AI implementation with Good SQL skills and database experience.
  • Experience with CI/CD tools (Jenkins, Git, Terraform, etc.) and automation tools (Ansible, Puppet).
  • Knowledge of resiliency patterns, self-healing, chaos engineering, and performance monitoring.
  • Strong grasp of SRE concepts (SLOs, SLIs, Error Budgets) and thorough knowledge of web services (Apache, Tomcat, SOAP, REST).
  • Experience working in Agile environments and delivering IT projects on time and within budget.
  • Ability to articulate technical strategies to management and collaborate effectively within large teams.
  • Familiarity with observability tools (Grafana, Prometheus, Datadog, Splunk, Dynatrace) and site reliability practices.
  • Eagerness to learn and apply new methodologies and technologies to enhance system effectiveness and ability to create impact at scale through automation and AI-driven solutions.


Preferred qualifications, capabilities, and skills


  • General knowledge of financial services industry

This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.