Senior AI Engineer, San Francisco, CA, United States

Senior AI Engineer

New Yesterday

At Orion, we’re building the intelligent infrastructure that powers modern financial advisory. As part of our mission to unify planning, investing, and client engagement, we’re looking for a Senior AI Engineer to rapidly deploy cutting-edge large language models (LLMs) and generative AI into production, powering scalable systems and customer experiences.This is a hands-on, product-facing role focused on shipping working AI features. You’ll work closely with product managers, designers, and engineers to build systems that directly impact thousands of advisors and millions of investor accounts.Looking for candidates in the San Francisco, CA area.In this role, you’ll get to:Integrate LLMs and generative AI into advisor facing products and workflowsDesign and build RAG systems using internal and external data sourcesApply techniques like prompt engineering, fine-tuning (e.g. LoRA), and custom embeddings to optimize domain-specific performanceEvaluate and productionize open-source and proprietary models (GPT-4, Claude, LLaMA, Mistral, etc.)Develop APIs and services to deliver AI- powered features at scaleCollaborate across product and engineering teams to deliver rapidly and reliablyContinuously measure AI feature quality: accuracy, latency, and user impactWe’re looking for talent who:Has proven experience working with large language models (e.g. OpenAI GPT, Claude, LLaMa)Has familiarity with embedding models, understanding tokenization, prompt engineering and fine-tuning (e.g., LoRA).Has practical experience with at least 1 vector database: Pinecone, Weaviate, FAISSHas strong proficiency with orchestration tools like LangChain, Haystack or LlamaIndex for building RAG systemsHas ability to design and implement retrieval-augmented generation (RAG) systems.Has experience in data preprocessing, chunking and vectorization pipelines5+ years in software engineering, with 2+ years in applied ML or AIHas deep understanding of LLMs, embeddings, RAG architecture, and vector searchHas strong grasp of prompt design, fine-tuning strategies, and model evaluationHas proficiency with tools such as: LangChain, LlamaIndex, Pinecone, Weaviate, OpenAI, Hugging Face, FastAPI, DockerHas strong engineering discipline and communication skills, especially in cross-functional settings#LI-AP1Salary Range:$125,336.00 - $196,765.00About this Opportunity:At Orion, we’re building the intelligent infrastructure that powers modern financial advisory. As part of our mission to unify planning, investing, and client engagement, we’re looking for a Senior AI Engineer to rapidly deploy cutting-edge large language models (LLMs) and generative AI into production, powering scalable systems and customer experiences.This is a hands-on, product-facing role focused on shipping working AI features. You’ll work closely with product managers, designers, and engineers to build systems that directly impact thousands of advisors and millions of investor accounts.Looking for candidates in the San Francisco, CA area.In this role, you’ll get to:Integrate LLMs and generative AI into advisor facing products and workflowsDesign and build RAG systems using internal and external data sourcesApply techniques like prompt engineering, fine-tuning (e.g. LoRA), and custom embeddings to optimize domain-specific performanceEvaluate and productionize open-source and proprietary models (GPT-4, Claude, LLaMA, Mistral, etc.)Develop APIs and services to deliver AI- powered features at scaleCollaborate across product and engineering teams to deliver rapidly and reliablyContinuously measure AI feature quality: accuracy, latency, and user impactWe’re looking for talent who:Has proven experience working with large language models (e.g. OpenAI GPT, Claude, LLaMa)Has familiarity with embedding models, understanding tokenization, prompt engineering and fine-tuning (e.g., LoRA).Has practical experience with at least 1 vector database: Pinecone, Weaviate, FAISSHas strong proficiency with orchestration tools like LangChain, Haystack or LlamaIndex for building RAG systemsHas ability to design and implement retrieval-augmented generation (RAG) systems.Has experience in data preprocessing, chunking and vectorization pipelines5+ years in software engineering, with 2+ years in applied ML or AIHas deep understanding of LLMs, embeddings, RAG architecture, and vector searchHas strong grasp of prompt design, fine-tuning strategies, and model evaluationHas proficiency with tools such as: LangChain, LlamaIndex, Pinecone, Weaviate, OpenAI, Hugging Face, FastAPI, DockerHas strong engineering discipline and communication skills, especially in cross-functional settings#LI-AP1Salary Range:$125,336.00 - $196,765.00The pay listed in this posting indicates the estimated pay at the time of this posting; however, may vary depending on geographic location, job-related knowledge, skills, and experience. In addition, Orion offers a competitive benefits package which includes health, dental, vision, and disability coverage on day one, 401(k) plan with employer match, paid parentalleave, pet benefits including pawternity leave and pet insurance, student loan repayment and more.About UsAt Orion, we achieve our best work when we support one another, staying personally accountable to each other and the clients we serve. We create a welcoming environment where everyone is respected, valued, and heard. Our commitment to create raving fans ensures we consistently exceed client expectations. Thinking differently is in our DNA—we innovate always, push boundaries, and reject the status quo to deliver transformative outcomes. Together, we support one another and see it through to success, driving our collective achievements and those of our clients. #J-18808-Ljbffr

Apply

Location:: San Francisco, CA, United States