Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/qSRyLgrB12J2afg66
Back to the job results

Computer Vision Research Internship: Image to Sequence Modeling (e.g. Transformers)

30+ days ago 2026/05/14
General Engineering Consultancy
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Duration: Minimum 6 months; ideally 9–12 months, depending on the candidate’s experience


Scandit gives people superpowers. Whether enabling delivery drivers to make quicker deliveries, matching a patient with their medication, or allowing retailers to make store operations more efficient, our technology automates workflows and provides actionable insights to help businesses in a variety of industries. Join us, as we continue to expand, grow and innovate, and help take Scandit to the next level.


About the Internship

We are offering a research-focused internship aimed at advancing machine learning methods for complex visual understanding tasks. The project centers on deep learning architectures for image-to-sequence modelling, such as Transformers, attention mechanisms, and modern sequence and representation-learning frameworks, to address challenging and highly structured computer vision problems. This project contributes to long-term research efforts aimed at achieving even higher performance, robustness, and generalization in large-scale visual applications.


What you will do

You will work closely with experienced ML researchers and engineers on cutting-edge research at the intersection of computer vision and sequence modeling. Your work will include:


  • Designing and experimenting with new ML architectures for structured visual data.
  • Evaluating alternative modeling paradigms (e.g., encoder–decoder, hybrid Transformer models, sequence-based representations).
  • Investigating techniques for improving robustness, generalization, and multi-view reasoning.
  • Running systematic experiments, ablations, and error analyses to validate research hypotheses.

This project provides opportunities for novel model design, extensive experimentation, and scholarly research. You will contribute to long-term innovation in our technology, with potential real-world impact for millions of users. An ideal position for experienced master’s students, PhD collaborations, or candidates preparing for a research career in industry or academia.


Who you are

MSc or PhD student in Computer Science, Machine Learning, Artificial Intelligence, or a related field with a strong research focus. Candidates should have a solid foundation in machine learning theory, neural networks, and computer vision.


Essential Skills:


  • Proficiency in Python and deep learning frameworks such as PyTorch.
  • Practical experience designing, training, and evaluating neural networks, including CNNs and Transformer-based architectures.
  • Strong analytical and problem-solving abilities, with the capability to interpret experimental results and iterate effectively.
  • Familiarity with research best practices, including reproducibility, controlled experiments, and ablation studies.

Desirable Skills:


  • Prior research experience in computer vision, pattern recognition, sequence modeling, or image-to-sequence architectures.
  • Experience training large-scale models or working with foundation-style architectures.
  • Contributions to publications, preprints, or open-source machine learning projects.

Strong communication skills and the ability to work independently in a research-oriented environment.


What We Offer
  • We are certified as a “Great Place to Work” in 10 countries!
  • A highly skilled team and a fun environment where you can put your enthusiasm for computer vision challenges and cutting-edge technologies to use
  • Hackathons, summer parties, company outings and other regular events
  • Office in the city center of Zurich
Who We Are

Could your code give superpowers? Whether enabling delivery drivers to make quicker deliveries, matching a patient with their medication or allowing retailers to make store operations more efficient, our technology automates workflows and provides actionable insights to help businesses in a variety of industries. This means we have no shortage of technical challenges for engineers like you. Join us, as we continue to expand, grow and innovate, and help take Scandit to the next level.


“Everybody is welcome here” - Is a celebrated component of our DNA.


At Scandit we strive to create an inclusive environment that empowers our employees. We believe that our products and services benefit from our diverse backgrounds and experiences and are proud to be a safe space for all.


All qualified applications will receive consideration for employment without regard to race, colour, nationality, religion, sexual orientation, gender, gender identity, age, physical [dis]ability or length of time spent unemployed.


#LI-MB1


#Engineering


#Hybrid


This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.