Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/MJiijf5Bd8U5ePWv5
Back to the job results

Senior Machine Learning Engineer / Technical Lead

Today 2026/09/10
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

The Role We are looking for a hands-on Senior ML Engineer to lead the development and operation of production-grade AI systems across LLMs, OCR, and voice.
This role combines deep technical ownership with leadership of a high-performing team of junior engineers, as well as direct engagement with stakeholders to shape AI solutions.
Responsibilities: * Lead and mentor a team of highly talented junior ML engineers through: * Code reviews, design reviews, and technical direction * Enforcement of strong software engineering and ML best practices * Design, deploy, and operate scalable AI systems with a focus on reliability and performance * Lead production deployment of LLMs and multimodal systems (RAG, OCR, voice) * Own **model performance end-to-end**, combining evaluation, observability, and hardware optimization: * Build evaluation pipelines (benchmarks, regression testing, LLM-as-judge) * Implement deep observability (tracing, latency, error tracking) * Optimize GPU utilization (multi-GPU serving, batching, quantization, memory tuning) * Continuously improve throughput, latency, and cost efficiency * Architect and manage GPU infrastructure: * Model serving, load balancing, and scaling strategies * Hardware-aware deployment and performance tuning * Build and maintain robust MLOps pipelines: * Model/version management, CI/CD, automated testing, and rollback strategies * Monitoring and feedback loops for continuous improvement * Engage directly with clients and stakeholders to: * Gather and clarify business requirements * Translate non-technical needs into well-defined technical problems * Communicate solutions, trade-offs, and progress through clear documentation, reports, and proposals * Contribute hands-on to system design, implementation, debugging, and production incident resolution * Proven experience deploying LLMs in production * Strong experience with GPU-based inference and optimization * Solid backend engineering skills (Python, APIs, distributed systems) * Experience with MLOps and production ML systems * Experience with OCR/document AI and/or voice systems (STT/TTS) * Experience with Docker and Kubernetes * Strong understanding of modern AI architectures (RAG, vector DBs, agent workflows) * Experience mentoring or leading engineers * Strong communication skills with the ability to bridge business and technical domains Nice to Have: * Experience with open-weight models (Qwen, Llama, DeepSeek, Gemma) * Experience with on-prem / sovereign AI deployments * Experience with LoRA / fine-tuning * Multilingual or Arabic NLP experience
This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.