Sr. Research Engineer, Machine Learning, AGI Foundations
40 Days Old
Sr. Research Engineer, Machine Learning, AGI Foundations Job ID: 2894216 | Amazon.com Services LLC
The Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Senior SDE with a strong machine learning background, to lead the development of industry-leading models with multimodal systems.
As a Senior SDE with the AGI team, you will be responsible for leading the development of novel algorithms and modeling techniques to advance the state of the art with multimodal systems. Your work will directly impact our customers and will leverage Amazon’s heterogeneous data sources and large-scale computing resources to accelerate development with multimodal Large Language Models (LLMs) and Generative Artificial Intelligence (Gen AI). You will have significant influence on our overall strategy by working at the intersection of engineering and applied science to scale pre-training workflows and build efficient models. You will drive the system architecture and spearhead the best practices that enable a quality infrastructure.
The ideal candidate is clearly passionate about new opportunities and has a demonstrable track record of success in delivering new features and products. A commitment to teamwork, hustle, and strong communication skills (to both business and technical partners) are absolute requirements. Creating reliable, scalable, and high-performance products requires exceptional technical expertise, a sound understanding of the fundamentals of Computer Science, and practical experience building large-scale distributed systems. This person has thrived and succeeded in delivering high-quality technology products/services in a hyper-growth environment where priorities shift fast.
Key job responsibilities:
Responsible for pre-training multimodal LLMs.
Work closely with Applied scientists to scale pre-training of machine learning models on GPUs while optimizing the training workflows using highly distributed training techniques and frameworks (like FSDP, NVIDIA NeMo, Megatron Core, etc).
Investigate design approaches, prototype new technology, and evaluate technical feasibility.
Work in an Agile/Scrum environment to deliver high-quality software against aggressive schedules.
BASIC QUALIFICATIONS 5+ years of non-internship professional software development experience.
5+ years of programming with at least one software programming language experience.
5+ years of leading design or architecture (design patterns, reliability, and scaling) of new and existing systems experience.
Experience as a mentor, tech lead, or leading an engineering team.
2+ years of expertise in Machine Learning and model training.
PREFERRED QUALIFICATIONS 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience.
Bachelor's degree in computer science or equivalent.
Expertise in training Generative AI vision models.
#J-18808-Ljbffr
- Location:
- San Francisco, CA, United States
- Salary:
- $250,000 +
- Category:
- IT & Technology