Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps

45 Days Old

Senior AI Engineer, NeMo Retriever - Model Optimization and MLOpsJoin us at NVIDIA as a Senior AI Engineer, NeMo Retriever - Model Optimization and MLOps and be part of the AI revolution that powers self-driving cars, robotics, co-pilots, and more.NVIDIA's NeMo Retriever is a collection of NIMs for building multimodal extraction, re-ranking, and embedding pipelines with high accuracy and data privacy. We are seeking an AI Engineer to focus on machine learning development, performance optimization, and MLOps, working on innovative hardware and software platforms for Generative AI, LLM, MLLM, and RAG workflows.What You'll Be DoingDevelop and maintain NIMs that containerize optimized models using OpenAPI standards with Python or similar performant languages.Collaborate with partner teams to understand requirements, build & evaluate POCs, and develop production roadmaps.Enable development of integrated AI Blueprints providing a unified, turnkey experience.Build and maintain Continuous Delivery pipelines for faster, safer deployments.Conduct peer reviews focusing on performance, scalability, and correctness.What We Need To SeeBachelor’s or Master’s Degree in Computer Science, Engineering, or related field (or equivalent experience).8+ years of relevant experience.Proficiency in Python and Deep Learning frameworks like PyTorch.Experience with cloud software delivery, cloud infrastructure, and MLOps tools such as Docker, Kubernetes, Helm.Familiarity with ML libraries like PyTorch, TensorRT, or TensorRT-LLM.Deep understanding of NLP, LLM, MLLM, Generative AI, and RAG workflows.Self-motivated with a passion for learning and sharing knowledge.Enthusiasm for emerging technologies and innovation.We offer competitive salaries, benefits, and a diverse, inclusive work environment. The salary range is $184,000 - $356,500, based on experience and location. NVIDIA is an equal opportunity employer.Additional DetailsSeniority level: Mid-Senior levelEmployment type: Full-timeIndustries: Hardware, Software, ElectronicsReferrals can increase your chances of interview success. We accept applications continuously. #J-18808-Ljbffr
Location:
Washington, DC, United States