AI Performance Software Engineer
6 Days Old
This range is provided by Signify Technology. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range $220,000.00/yr - $300,000.00/yr
Direct message the job poster from Signify Technology
Senior Recruitment Consultant| Focusing On Critical Hires For Embedded Software, Electrical Engineer, Machine Learning, Computer Vision | @Signify… AI Performance Engineer – CUDA & PyTorch Focus
Location: San Fransisco, CA
Compensation: $200,000-$300,000
A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.
This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.
What You’ll Do:
Drive core research and implementation of performance optimizations for modern AI models
Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
Design and build scalable, distributed compute strategies across GPU-based systems
Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency
What We're Looking For:
Strong background in CUDA and low-level GPU performance tuning
Proven experience building with PyTorch and deploying high-performance ML models
Proficiency in Python and C++
Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
Exposure to AI compilers or frameworks like MLIR is a plus
Interest in system design, scalability, and accelerating LLM workloads in real production environments
If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.
Reach out to learn more.
Seniority level Seniority level Mid-Senior level
Employment type Employment type Full-time
Job function Job function Research and Science
Industries IT System Custom Software Development, Software Development, and Research Services
Referrals increase your chances of interviewing at Signify Technology by 2x
Sign in to set job alerts for “Performance Engineer” roles. San Francisco, CA $90,000.00-$250,000.00 1 year ago
AI Engineer & Researcher - Pre-training Scaling, Data, and Eval San Francisco, CA $77,880.00-$116,320.00 4 days ago
San Francisco, CA $140,670.00-$195,400.00 5 days ago
San Mateo, CA $120,000.00-$160,000.00 2 weeks ago
San Francisco, CA $88,000.00-$140,000.00 1 month ago
San Francisco, CA $180,000.00-$198,000.00 20 hours ago
San Francisco, CA $90,000.00-$150,000.00 2 weeks ago
San Francisco, CA $130,000.00-$230,000.00 6 months ago
San Francisco, CA $90,000.00-$150,000.00 2 weeks ago
Founding Engineer (LLM Performance Optimization) San Francisco, CA $140,000.00-$150,000.00 5 days ago
San Francisco, CA $115,000.00-$185,000.00 4 days ago
AI/ML Developer Relations - US (San Francisco) San Francisco, CA $150,000.00-$230,000.00 4 months ago
San Francisco, CA $130,000.00-$160,000.00 5 months ago
Perception Software or Machine Learning Engineer San Francisco, CA $18,000.00-$36,000.00 1 month ago
San Francisco, CA $190,000.00-$250,000.00 14 hours ago
San Francisco, CA $100,000.00-$300,000.00 1 year ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr
- Location:
- San Francisco, CA, United States
- Job Type:
- FullTime
- Category:
- IT & Technology