Lead Site Reliability Engineer

New Yesterday

Salary: Lead Site Reliability Engineer (SRE) Location: Dallas
Join CellPoint Digital: Shape the Future of Payments with Us! At CellPoint Digital, were revolutionizing the way businesses in the air, travel, and hospitality sectors manage their payments. With our Leading Payment Orchestration Platform, were turning payments into a strategic advantage, helping clients optimize their payment experience to boost profits, maximize approvals, lower costs and take control of their payment, resulting in more money to the bottom line. We believe payments should be a strategic asset, delivering financial, customer, and operational value. Our vision is to unify the payment ecosystem, opening up a world of opportunities for leading brands in the air, travel, and hospitality industries. We transform the payment supply chain from a cost center into a profit engine, turning every transaction into an opportunity for growth and competitive advantage. At our core, we're innovators and problem-solvers united by five key values: Mission First, Ownership, Trust & Transparency, Driven, and One Team. We're ambitious professionals who embrace accountability and transform payments together. Our diverse community spans the globe, with hubs in Buenos Aires, Bogota, Copenhagen, Dallas, Dubai, London, Mexico, Miami, Pune, Singapore, and Sofia, along with remote team members worldwide. We celebrate the unique perspectives and experiences that make our team extraordinary. Join us as a Site Reliability Engineer (SRE) on our mission to turn payments into possibilities! Lead Site Reliability Engineer (SRE) As an SRE at CellPoint Digital, youll be a key player in ensuring our payment platform runs reliably, securely, and at scaleprocessing thousands of payments per second. Working closely with our Product, Development, and Architecture teams, youll blend hands-on operational excellence with a software engineering mindset to drive automation, observability, and reliability across our global infrastructure. Youll be a leader of this function so youll be able to develop and grow youre leadership capability, and already have; Strategic ownership not just execution. Leadership and mentoring guiding a team and setting direction. Cross-functional influence aligning infrastructure goals with business and engineering objectives. Ownership of reliability vision driving long-term improvements, not just day-to-day operations. Your Impact As a Lead Site Reliability Engineer, you will drive the vision and execution of reliability across our platform. You will be accountable for the performance, scalability, and resilience of our systems, while also mentoring and enabling a high-performing team of engineers. You will: Provide technical and strategic leadership to ensure the health and reliability of our production environment. Architect and drive development of software and systems that improve infrastructure reliability and operational excellence. Lead initiatives that improve system uptime, performance, and delivery velocity across our payment platforms. Partner cross-functionally to influence product, platform, and architectural decisions with a reliability-first mindset. Guide and refine incident response practices, ensuring issues are swiftly resolved and learnings are systematized. Define and evolve our SRE strategy, embedding reliability principles into engineering culture and delivery processes. Champion a metrics-driven approach to system health using SLAs, SLOs, and error budgets. Partner closely with Product, Architecture, and Platform Engineering to shape infrastructure roadmaps aligned with company goals. Lead operational excellence by enabling automated, scalable solutions to prevent incidents and reduce toil. Mentor and upskill other SREs and engineers through technical leadership and coaching. Skills you will have fine-tuned: Proven experience as a senior or lead-level SRE owning the reliability of large-scale, distributed systems. Strong collaboration with Product, Engineering, and Architecture teams to define SLAs and system reliability targets. Leadership in release engineering, working alongside Release Managers to ensure smooth and reliable deployments. Expertise in incident management and postmortem culture; you champion learning from failures to drive systemic improvements. Track record of proactive reliability engineering, including building tools, automation, and monitoring systems to prevent outages. You design systems that are resilient by default and promote a culture of boringly reliable operations. You optimize infrastructure performance with data-driven decisions, rather than gut feel. You lead by example, documenting and automating so your knowledge scales across the team. You thrive in complex environments, debugging across the stack, from network to application. You look ahead, planning for infrastructure growth and scaling challenges before they become urgent. Deep knowledge of our modern stack or equivalents: Cloud Provider: Google Cloud Platform (GKE, Apigee, Cloud Storage, Load Balancers, etc.) Kubernetes and container orchestration Infrastructure as Code: Terraform, Config Connector, Helm, Skaffold CI/CD Pipelines: GitHub Actions, Cloud Build/Deploy Observability: Grafana, Prometheus Durable Execution: Temporal or similar platforms What's in it for you: We offer you the opportunity to be an innovator, challenge the status quo, and redefine the payments category Competitive salary in a fast-growing start-up Rewards & Recognition system Opportunity for personal and professional growth in a dynamic industry Work from anywhere in the world; we're a fully distributed company, and we provide the tools, culture, and support to make your work setup work for you Joining a scaling company that is growing and an opportunity to have great impact Occasional travel to Europe (UK, Copenhagen, Bulgaria)
What makes CellPoint Digital a leader in the payment landscape isnt just our technology - its our people and how we work together. Weve built a global community where diverse talents and perspectives unite to create innovative solutions. When you join us, you become part of something bigger: a collaborative culture that crosses borders and disciplines, bringing out the best in every team member to deliver breakthrough results for our clients and partners. Together, we are transforming the payments industry - challenging, supporting, and inspiring one another in the process.
Location:
Irving
Category:
Technology

We found some similar jobs based on your search