Submitting more applications increases your chances of landing a job.

Here’s how busy the average job seeker was last month:

Opportunities viewed

Applications submitted

Keep exploring and applying to maximize your chances!

Looking for employers with a proven track record of hiring women?

Click here to explore opportunities now!
We Value Your Feedback

You are invited to participate in a survey designed to help researchers understand how best to match workers to the types of jobs they are searching for

Would You Be Likely to Participate?

If selected, we will contact you via email with further instructions and details about your participation.

You will receive a $7 payout for answering the survey.


User unblocked successfully
https://bayt.page.link/C9e98xZHdRkDuN4J7
Back to the job results
Remote
Other Business Support Services
Create a job alert for similar positions
Job alert turned off. You won’t receive updates for this search anymore.

Job description

Job Summary: As an AI Architect you will build AI-native products.
You’ll lead cross-functional Innovation Delivery Squads—owning outcomes end-to-end across web, mobile, AI agents, and streaming backends.
You’re a hands-on technical leader who can scope, architect, staff, and ship; then run the product safely at scale.
Job Responsibilities: Stand up and run squads (Discovery → Prototype → Product → Platform & SRE).
Design and ship RAG/agent systems: pick models (e.
g., Anthropic Claude, OpenAI, Google, or open-weights like Llama/Mistral), define tools/functions, and choose retrieval (default Postgres + pgvector, scale to Weaviate/Qdrant/Pinecone when needed).
Operate AI safely: evals & guardrails, structured outputs (JSON/Schema), PII redaction, refusal policies, cost/latency budgets, and LLM observability.
Own delivery outcomes: SLOs, quality, cost, velocity; release with feature flags and canaries.
Be client-facing: discovery, scoping, SoW, roadmap, QBRs.
Hire/coach Tech Leads, EMs, and PMs; level up practices.
8–12+ yrs engineering; 4+ yrs leading multi-team delivery; shipped production web/mobile systems at scale.
Shipped at least one production AI app using Claude/GPT/Gemini/Llama/Mistral, backed by retrieval (pgvector or a vector DB) and a basic eval/guardrail pipeline.
Implemented orchestration (LangGraph/DSPy or Temporal for durable workflows), rerankers (e.
g., Cohere/Jina/Voyage), and prompt/tool versioning.
Built with modern cloud + data: serverless/K8s, Terraform, OpenTelemetry, feature flags/experimentation.
Excellent client communication and commercial sense (SoWs, staffing, utilization).
Tech stack (you have hands on experience) Models: Anthropic Claude; OpenAI; Google; open-weights (Llama, Mistral).
Orchestration & agents: LangGraph (or DSPy) for graphs; Temporal for durable, long-running tasks and SLAs.
Retrieval: Postgres + pgvector (default); Weaviate/Qdrant/Pinecone when scale/ops require; hybrid search with OpenSearch/Typesense.
Embeddings / rerankers: OpenAI/Voyage/E5/BGE; Cohere/Jina/Voyage rerank.
Guardrails & evals: JSON/Pydantic schemas, red-team sets, promptfoo/Ragas/DeepEval; content/PII filters.
Observability: OpenTelemetry traces incl.
prompt/tool spans; Langfuse/Arize Phoenix (or equivalent) + Sentry/Grafana.
App & data: Next.
js 15 (RSC), TypeScript/Go/Python; Postgres; Kafka/Redpanda/NATS; dbt/lakehouse optional.
Ops: Cloud Run/ECS/K8s; Terraform/OpenTofu; GitHub Actions; LaunchDarkly/Unleash; Statsig/GrowthBook.

This job post has been translated by AI and may contain minor differences or errors.

You’ve reached the maximum limit of 15 job alerts. To create a new alert, please delete an existing one first.
Job alert created for this search. You’ll receive updates when new jobs match.
Are you sure you want to unapply?

You'll no longer be considered for this role and your application will be removed from the employer's inbox.