Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!

We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.

https://bayt.page.link/yUStdPRApjzBaEpF6

Back to the job results

Senior Software Engineer Modelzoo

- NXP
- India

21 days ago 2026/10/11

Complete Questionnaire

Apply on company site

Other Business Support Services

Create a job alert for similar positions

Job alert turned off. You won’t receive updates for this search anymore.

Undo

Job description

Job Title: Software Engineer (2-3 years experience)

Job Summary

We're looking for a skilled and motivated Machine Learning Software to join our team. The ideal candidate will have a solid foundation in deep learning and a strong interest in optimizing and deploying ML models on specialized hardware. This role involves implementing model optimizations, with a particular focus on quantization, to improve the performance of machine learning inference on target platforms.

Key Responsibilities

Model Porting & Deployment: Port and deploy deep learning models from frameworks like PyTorch and TensorFlow to proprietary or commercial ML accelerator hardware platforms.
Performance Optimization: Analyze and improve the performance of ML models for target hardware, focusing on latency and throughput.
Quantization: Contribute to model quantization efforts (e.g., INT8) to reduce model size and accelerate inference while maintaining model accuracy.
Profiling & Debugging: Use profiling tools to identify and fix performance bottlenecks in the ML inference pipeline on the accelerator.

Required Qualifications

Technical Skills:

Proficiency in deep learning frameworks such as PyTorch and TensorFlow.
Hands-on experience with deploying and optimizing models on GPUs or other specialized accelerators.
Some experience with model quantization (Post-Training Quantization).
Strong proficiency in C++ and Python.
Experience with GPU programming models like CUDA/cuDNN is a plus.
Familiarity with ML inference engines and runtimes (e.g., TensorRT, OpenVINO, TensorFlow Lite).
Foundational understanding of computer architecture principles.

Version Control: Proficient with Git and collaborative development workflows.
Education: Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.

Preferred Qualifications

Knowledge of hardware-aware model design.
Familiarity with compiler technologies for deep learning.
Experience with real-time or embedded systems.
Knowledge of cloud platforms (AWS, GCP, Azure).
Experience with CI/CD pipelines for ML models.

More information about NXP in India...

#LI-2734

This job post has been translated by AI and may contain minor differences or errors.

Apply on company site Email to Friend Complete Questionnaire

Compare your profile with other applicants

Cancel

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.

MANAGE

Job alert created for this search. You’ll receive updates when new jobs match.

Manage alerts

Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.