Astro Sirens is an IT staffing agency based in Austin, Texas.
We connect talented professionals from around the world with U.
S. companies, offering exciting opportunities to work on innovative, high-impact projects.
We are currently seeking a Senior AI/ML Engineer with strong experience in modern AI technologies—including Large Language Models (LLMs), Generative AI, and intelligent agent systems—to design and deploy cutting-edge AI solutions.
Responsibilities Design, develop, and deploy AI/ML solutions leveraging LLMs, NLP, and Generative AI Build and optimize Retrieval-Augmented Generation (RAG) pipelines using vector databases and embedding models Develop agentic AI systems (multi-step reasoning, tool use, orchestration frameworks) Fine-tune, prompt-engineer, and evaluate large language models for production use cases Build scalable, end-to-end ML/AI pipelines including data ingestion, preprocessing, model training, and deployment Integrate AI solutions into applications via APIs and microservices Collaborate with cross-functional teams (data engineers, product managers, and business stakeholders) to define AI-driven solutions Implement model monitoring, evaluation frameworks, and guardrails (bias, hallucination mitigation, safety) Optimize models and pipelines for performance, scalability, and cost-efficiency in cloud environments Translate complex AI outputs into actionable insights for both technical and non-technical audiences Contribute to AI best practices, architecture decisions, and internal tooling Mentor junior engineers and guide teams on modern AI development patterns Paid Time Off (PTO) Work From Home Professional development opportunities Training & Development Programs Collaborative and inclusive company culture Competitive salary and performance-based bonuses Bachelor’s or Master’s degree in Computer Science, Data Science, AI, Statistics, or a related field 5+ years of experience in machine learning, data science, or applied AI roles Strong proficiency in Python and ML/AI ecosystems Hands-on experience with LLMs and GenAI frameworks (e.
g., OpenAI APIs, Hugging Face, LangChain, LlamaIndex, or similar) Solid experience with NLP techniques and transformer-based models Experience building RAG pipelines and working with vector databases (e.
g., Pinecone, Weaviate, FAISS) Experience designing or working with agentic workflows (tool calling, multi-agent systems, reasoning chains) Strong understanding of ML fundamentals (supervised/unsupervised learning, deep learning, evaluation metrics) Experience deploying models into production environments (APIs, batch/real-time systems) Familiarity with MLOps/LLMOps practices (model versioning, CI/CD, monitoring, prompt/version management) Strong SQL skills and experience with relational databases Experience with cloud platforms such as AWS, GCP, or Azure Understanding of AI safety, ethics, and data privacy considerations Strong communication skills and ability to work with U.
S.-based stakeholders Preferred Qualifications Experience with fine-tuning LLMs (LoRA, PEFT, or similar techniques) Familiarity with evaluation frameworks for LLMs (e.
g., human-in-the-loop, automated evals) Experience with Docker, Kubernetes , and scalable AI deployments Background in multi-modal AI (text, image, audio models) Experience with big data tools like Spark or distributed data processing Exposure to cost optimization strategies for LLM-based systems