Job description
Our customer's product is an AI-powered platform that helps businesses make better decisions and work more efficiently.
It uses advanced analytics and machine learning to analyze large amounts of data and provide useful insights and predictions.
The platform is widely used in various industries, including healthcare, to optimize processes, improve customer experiences, and support innovation.
It integrates easily with existing systems, making it easier for teams to make quick, data-driven decisions to deliver cutting-edge solutions.
Employment-based cooperation Awesome projects with an impact Comprehensive insurance for you and your family (health, life, and accident) Paid PTO policy (vacation, sick leaves, and public holidays) Tech equipment provided Udemy courses, workshops, trainings & expert knowledge-sharing Clear career path Flexible hours & work setup Bachelor’s or Master’s degree in Computer Science or a related field.
Strong Python coding skills - 7+ years.
2+ years of hands-on experience with machine learning and production LLM systems.
Experience building backend APIs with FastAPI, async patterns, rate limiting, and SQLAlchemy - 3+ years.
Experience designing maintainable and extensible systems using dependency injection, interfaces, and abstract base classes.
Experience with vector databases such as Pinecone, Weaviate, or Chroma, as well as hybrid search.
Strong understanding of RAG architectures, including retrieval, reranking, context assembly, and response generation.
Hands-on experience with LangChain and LangGraph for building and orchestrating LLM workflows.
Advanced Python skills, including async/await, type hints, Pydantic, and SOLID principles.
MLOps experience with MLflow, model versioning, and A/B testing; experience with Langfuse is a plus.
Experience in NLP and computer vision, including document understanding, OCR, and GPT-4 Vision.
Experience building feature pipelines, real-time and batch inference systems, and model serving.
Hands-on experience with Hugging Face is required; experience with LlamaIndex is a plus.
Nice to have skills: Understanding of DevOps, CI / CD including: Docker containerization, Azure DevOps pipelines or GitHub Actions, Kubernetes (nice to have); Data security including: Multi-tenant data isolation, Secure key management (Azure Key Vault), Audit trail implementation; Experience in designing on cloud platform including: Azure (strongly preferred): Azure OpenAI, Blob Storage, Key Vault, Container Registry, AWS or GCP; Experience in data engineering in Big Data systems including: Large-scale data processing, ETL/ELT pipelines.
Responsibilities: Design and build scalable backend systems, APIs, and microservices with FastAPI; Write high-quality backend code using Python, SQL, async/await, and solid OOP principles; Apply software best practices to ensure reliability, scalability, and on-time delivery; Implement dependency injection, layered architectures, and SOLID design patterns; Integrate Azure OpenAI (GPT-4, GPT-4 Vision) with robust retry and error handling; Build LLM observability with Langfuse (prompts, tokens, cost, latency); Develop prompt management with versioning, fallbacks, and cost optimization strategies; Orchestrate async workflows using Celery for complex pipelines; Design multi-tenant architectures with strict data isolation; Integrate third-party APIs (Veeva Vault, Adobe PDF Services, OCR); Troubleshoot systems using structured logging and distributed tracing; Document APIs and changes using OpenAPI/Swagger.
This job post has been translated by AI and may contain minor differences or errors.