Get AI-powered advice on this job and more exclusive features.
Do you have the following skills, experience and drive to succeed in this role Find out below.
This range is provided by Harrison Clarke. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range $210,000.00/yr - $250,000.00/yr
Join a trailblazing force in the sales automation and enablement industry, revolutionizing how enterprises drive revenue with state-of-the-art AI-powered solutions. Backed by top-tier investors, this rapidly scaling company is seeking a seasoned Site Reliability Engineer to spearhead the design and optimization of high-scale, high-impact infrastructure. Thrive in a dynamic, high-growth environment where your expertise will power a cutting-edge platform used by global enterprises.
About the Role
As an SRE, you’ll take ownership of shaping the infrastructure for a high-scale, AI-driven platform that processes massive data volumes and powers mission-critical workflows. This is a leadership role designed for seasoned platform or DevOps engineers who excel in high-growth environments and are passionate about scaling innovative, large-scale products that redefine industries.
What You'll Own
Architect and lead the development of robust, scalable infrastructure for core services, data pipelines, and AI/ML workloads, supporting millions of transactions across cloud (AWS) and on-prem environments.
Drive the adoption of Infrastructure as Code (IaC) with tools like Terraform and Ansible to ensure automated, consistent, and rapid deployments in a high-growth setting.
Design and optimize CI/CD pipelines using tools like GitHub Actions or ArgoCD to enable fast, reliable releases that keep pace with our rapidly expanding product.
Collaborate with AI teams to deploy and scale complex, GPU-intensive ML workloads, ensuring performance and reliability at enterprise scale.
Mentor engineering teams, establish technical standards, and champion best practices in system reliability, performance, and scalability in a high-velocity environment.
What You Need
10+ years of experience in platform, infrastructure, or DevOps/SRE roles, with 5+ years in a senior or lead capacity, ideally within high-growth tech companies or on large-scale, high-impact products.
Proven expertise in architecting and scaling infrastructure for rapidly growing platforms, with deep knowledge of AWS (or similar), Kubernetes, and IaC tools (Terraform, Ansible).
Advanced proficiency in scripting (e.g., Python, Bash) and CI/CD systems, with a track record of building pipelines for high-throughput, high-availability systems.
Hands-on experience with observability tools (Prometheus, Grafana, OpenTelemetry) and managing secure, compliance-driven environments at scale.
Strong leadership skills, with a demonstrated ability to drive infrastructure strategy, foster cross-team collaboration, and thrive in the fast-paced, dynamic environment of a high-growth company.
Why Join?
Solve high-impact problems that shape enterprise decision-making for a rapidly growing customer base.
Work with cutting-edge AI/ML technologies and NVIDIA DGX clusters, powering a platform that scales to meet global demand.
Lead with technical autonomy in a collaborative, high-caliber team that values innovation and excellence.
Gain equity in a high-growth, early-stage startup backed by world-class investors, with the opportunity to make a lasting impact on a market-leading product.
This Role May Not Suit You If:
You prefer stable, predictable environments over the fast-paced, ever-evolving world of a high-growth startup.
You’re not excited about hands-on coding alongside strategic leadership in a dynamic, high-scale product environment.
Seniority level Seniority levelMid-Senior level
Employment type Employment typeFull-time
Job function Job functionEngineering, Design, and Information Technology
IndustriesSoftware Development, Robotics Engineering, and IT System Custom Software Development
Referrals increase your chances of interviewing at Harrison Clarke by 2x
Inferred from the description for this job Medical insurance
Vision insurance
401(k)
Paid maternity leave
Child care support
Pension plan
Paid paternity leave
Student loan assistance
Disability insurance
Tuition assistance
Get notified about new Site Reliability Engineer jobs in San Francisco, CA .
San Francisco, CA $160,000.00-$180,000.00 2 weeks ago
Hayward, CA $100,000.00-$150,000.00 6 months ago
San Francisco, CA $200,000.00-$250,000.00 3 days ago
San Francisco, CA $175,000.00-$250,000.00 1 month ago
San Francisco, CA $150,000.00-$250,000.00 1 year ago
Distributed Systems Software Engineer - Public Cloud (Mid/Senior/Lead/Principal) San Francisco, CA $125,700.00-$334,600.00 1 day ago
San Francisco, CA $218,000.00-$240,000.00 2 weeks ago
Associate Site Reliability Engineer/Site Reliability Engineer Redwood City, CA $116,000.00-$168,000.00 2 weeks ago
San Francisco, CA $125,000.00-$175,000.00 2 months ago
Platform Engineer — Infra / Reliability Specialist San Francisco, CA $150,000.00-$300,000.00 11 months ago
Foster City, CA $160,000.00-$250,000.00 5 months ago
San Francisco, CA $160,000.00-$300,000.00 5 months ago
Senior Site Reliability Engineer, Supply San Francisco, CA $130,000.00-$238,000.00 2 weeks ago
Novato, CA $98,400.00-$145,620.00 1 month ago
Oakland, CA $130,000.00-$180,000.00 1 month ago
San Francisco, CA $150,000.00-$300,000.00 11 months ago
San Francisco, CA $200,000.00-$240,000.00 2 days ago
San Francisco, CA $150,000.00-$250,000.00 10 months ago
Site Reliability Engineer — GPU InfrastructureSite Reliability Engineer - Field Operations Redwood City, CA $129,000.00-$169,000.00 2 days ago
San Francisco, CA $133,800.00-$200,600.00 6 days ago
San Francisco, CA $120,000.00-$150,000.00 5 days ago
San Francisco, CA $149,998.00-$250,000.00 5 days ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr