Job description
Project Role : Large Language Model Architect
Project Role Description : Architect large language models (LLM) that can process and generate natural language. Design neural network parameters, trained on large quantities of unlabeled text data.
Must have skills : Large Language Models (LLMs)
Good to have skills : NA
Minimum 3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Large Language Model Architect, your day involves designing and structuring advanced language models capable of understanding and generating human-like text. You will focus on creating neural network configurations that efficiently process vast amounts of unstructured text data. Your role includes continuous refinement of model architectures to enhance performance and adaptability, collaborating with various teams to ensure the models meet project goals and real-world application needs. This position requires a thoughtful approach to managing complex data flows and optimizing computational resources to deliver innovative language solutions.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to align model design with project objectives and operational requirements.
- Continuously evaluate and improve model efficiency and accuracy through experimentation and analysis.
- Document architectural decisions and model development processes to support knowledge sharing and future enhancements.
- Assist junior team members by providing guidance and support in understanding model design principles.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in Large Language Models (LLMs).
- Experience in designing and tuning neural network architectures for natural language processing tasks.
- Strong understanding of language model training techniques using large-scale unlabeled datasets.
- Familiarity with optimization methods to improve model performance and reduce computational costs.
- Ability to analyze and interpret model outputs to guide iterative improvements.
- Knowledge of scalable computing environments and resource management for training large models.
Additional Information:
- The candidate should have minimum 3 years of experience in Large Language Models (LLMs).
- This position is based at our Bhubaneswar office.
- A 15 years full time education is required.
This job post has been translated by AI and may contain minor differences or errors.