AI Applications Architect, AI Services

Remote Full-time
Job Description: • Design and own cloud-native architectures (AWS/Azure) for agentic AI workloads using Kubernetes/EKS, Terraform, Docker, serverless APIs, AWS Batch, and async orchestration frameworks (Celery, Step Functions, EventBridge, StoneBranch). • Define agentic system patterns using LangChain, LangGraph, Autogen, LlamaIndex, Pinecone, and other multi-agent frameworks; ensure consistency of prompt/tool design, memory/state handling, and workflow orchestration. • Architect vector database, RAG, embeddings pipelines, and model-serving endpoints (LLM/SLM) with strong emphasis on scalability and latency management. • Establish platform-wide standards for API gateway patterns, identity and auth (OAuth2, Cognito, Vault), secrets management, event contracts/schemas, and data governance. • Ensure holistic observability across multi-agent systems: tracing, metrics, logging, SLO/SLA definitions, synthetic checks, and incident response playbooks. • Lead architecture reviews, threat modeling, and performance benchmarking for agentic workloads. • Guide engineering teams through architectural decisions, distributed design principles, and production-readiness standards. • Mentor engineers in Kubernetes/EKS, async programming, multi-agent orchestration, cloud-native development, and responsible AI practices. • Provide input on hiring, onboarding, and talent development to grow AHEAD’s agentic engineering bench. • Partner with Delivery Leads to ensure architecture is executable, scalable, and aligned with timelines. • Champion automation, IaC, CI/CD, model deployment workflows, runbooks, and platform governance. • Lead sprint-level architectural alignment, backlog refinement, retrospectives, and post-incident reviews. • Work with Product Owners and client stakeholders to shape roadmaps, define technical scope, and convert ambiguous problem statements into actionable designs. • Communicate architectural decisions clearly to both technical and business audiences, balancing constraints, risks, and tradeoffs. • Embed platform security, compliance, cost optimization, and data integrity into all architectural decisions. Requirements: • 6+ years designing and delivering cloud-native, event-driven, or distributed architectures at scale (AWS/Azure). • Deep hands-on experience with: • Kubernetes/EKS, Docker, Terraform, and cloud infrastructure patterns • Python, FastAPI, async frameworks, serverless APIs • Vector DBs (Pinecone, Elasticsearch, pgvector) and RAG/LLM integration workflows • Agentic AI frameworks (LangChain, LangGraph, Autogen, CrewAI, LlamaIndex) • Strong knowledge of security, identity, devsecops pipelines, and secrets management in cloud environments. • Proven leadership experience guiding engineering teams, performing code/design reviews, and enforcing architectural best practices. • Excellent communication, stakeholder alignment, and documentation skills. • Experience operating LLMs/SLMs in production (NIMs, Bedrock, OpenAI, Azure OpenAI). • Experience with GPU clusters, inference optimization, or model-serving architectures (Ray, Triton, KServe). • Consulting or client-facing architecture experience. Benefits: • Medical, Dental, and Vision Insurance • 401(k) • Paid company holidays • Paid time off • Paid parental and caregiver leave • Plus more! See benefits for additional details. Apply tot his job
Apply Now →

Similar Jobs

**Experienced Entry-Level Remote Part-Time Apple Home Advisor: Customer Support Specialist for Apple Devices and Services**

Remote Full-time

Workday Architect -- GA

Remote Full-time

Architect of Software Engineering

Remote Full-time

Azure Technical Architect with Spanish Language-- CDC5712484

Remote Full-time

Sr. Software Engineer/Architect: WFM (Remote)

Remote Full-time

Principal Platform Architect- Insurance

Remote Full-time

Staff Security Engineer-Remote

Remote Full-time

Web App Pen Tester (San Diego or Irvine)

Remote Full-time

Penetration Tester Remote / Telecommute Jobs

Remote Full-time

Senior Penetration Tester - Web Application

Remote Full-time

**Experienced Remote Part-Time Data Entry Specialist – Join arenaflex's Dynamic Team for a Competitive Salary & Flexible Hours**

Remote Full-time

Kubernetes Engineer Remote

Remote Full-time

Hematology/Oncology Position-Work from Remote

Remote Full-time

Senior Account Executive, SaaS Sales

Remote Full-time

Global Payroll Customer Implementation Success Manager

Remote Full-time

Mobile App Developer Needed (AI + OCR + SMS Integration) — Safety-Critical Field App

Remote Full-time

Remote Data Entry Specialist - Accurate Data Management for blithequark's Global Financial Excellence

Remote Full-time

WDI Creative Development Inclusive Strategies Intern, Spring 2026 Glendale, CA, USA

Remote Full-time

Machine Learning Engineer -Remote Job at YO IT CONSULTING in United

Remote Full-time

HR Manager - Multi-State Veterinary Practice

Remote Full-time
← Back to Home