Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/F84LZCD4n1fxsY427
Back to the job results

AI Retrieval & Agent Platform Engineer

12 hours ago 2026/10/31
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

We're building the next generation of enterprise AI infrastructure — and we need you to help make it intelligent, fast, and reliable. As an AI Retrieval & Agent Platform Engineer , you'll be a core contributor to our AI Factory , designing the retrieval and agent connectivity layer that powers our AI-driven decision-making at scale. If you're passionate about RAG pipelines, vector databases, and agent tooling — this is your role. How You’ll Make an Impact (responsibilities of role) Vector DB & Hybrid Retrieval Stand up and tune vector databases (Pinecone/Weaviate/Qdrant/AWS-native) for similarity search at scale. Design hybrid retrieval combining vector semantic search with graph context and business logic filters; implement re-ranking. Manage embedding lifecycle (choice, diversity, refresh cadence, cold-start strategies). RAG & Contextualization Build RAG pipelines pulling structured/unstructured context; implement chunking, metadata, and guardrails. Integrate graph-derived context windows for multi-hop reasoning in agent workflows. Agent Connectivity (MCP) & Tooling Implement MCP-based tool discovery/invocation for agent ↔ system interaction. Wrap enterprise systems (Snowflake/MongoDB/SharePoint/ERP) as reusable tools/skills with clear schemas/capabilities. Represent tool capabilities & dependencies as graph processes for orchestration; collaborate with graph team. Observability & Feedback Loops Instrument agent KPIs (latency, accuracy, relevance, cost/execution); implement tracing across retrieval/graph layers. Build dashboards and automated feedback loops (e.g., low relevance → retraining/embedding refresh; failures → rule updates). Optimize cloud architecture for performance, cost, security; maintain SLOs. Cloud & Performance Deploy and scale retrieval services, vector stores, and agent endpoints on AWS (IAM, VPC, S3, Lambda, EKS/ECS, DynamoDB). Conduct performance profiling, caching strategies, and cost optimization (e.g., batch upserts, ANN index selection, sharding). What You Bring (required qualification and skill sets) Bachelor’s/Master’s in CS , Data Science, Engineering, or related field. 3–10 years in IR/Retrieval systems, vector DBs, or agent platform engineering . Hands-on with Pinecone/Weaviate/Qdrant (at least one in production), embeddings, ANN indexes, and hybrid ranking. Experience building RAG pipelines and contextualization strategies with LLMs. Strong Python 3.10 + backend engineering skills (FastAPI, FastMCP) Own CI/CD pipelines via GitLab CI and containerized deployments Exposure to Neo4j / AWS Neptune Expertise in bedrock/AWS AgentCore Familiarity with graph queries (Cypher/Gremlin/SPARQL) to leverage semantic context. AWS infrastructure knowledge and provisioning IAC as Preferred Qualifications Experience with MCP or equivalent agent–tool interoperability patterns ; skill registries and capability discovery. Observability stack: OpenTelemetry, Prometheus/Grafana, distributed tracing; KPI-driven optimization. Knowledge of LangChain/LlamaIndex, vector re-ranking, prompt caching, and safety/guardrail mechanisms. Exposure to Neo4j/Neptune/TigerGraph; event streaming (Kafka/Kinesis) for ingestion/update triggers. Hands-on with LangGraph, LlamaIndex, re-ranking, prompt caching, and guardrail mechanisms
This job post has been translated by AI and may contain minor differences or errors.
You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.