About the Role
We are seeking a skilled engineer with exceptional DevOps skills to join our team. Responsibilities include automating and scaling Big Data and Analytics technology stacks on Cloud infrastructure, building CI/CD pipelines, setting up monitoring and alerting for production infrastructure, and keeping our technology stacks up to date.
What you'll be doing:
Develop best practices around cloud infrastructure provisioning, disaster recovery, and guiding developers on the adoption
Scale Big Data and distributed systems
Collaborate on system architecture with developers for optimal scaling, resource utilization, fault tolerance, reliability, and availability
Conduct low-level systems debugging, performance measurement & optimization on large production clusters and low-latency services
Create scripts and automation that can react quickly to infrastructure issues and take corrective actions
Participate in architecture discussions, influence product roadmap, and take ownership and responsibility over new projects
Collaborate and communicate with a geographically distributed team
We're excited if you have:
Bachelor’s degree, or equivalent work experience
8+ years of experience in DevOps or Site Reliability Engineering
Experience with Cloud infrastructure such as Amazon AWS, Google Cloud Platform (GCP), Microsoft Azure, or other Public Cloud platforms. GCP is preferred.
Experience with at least 3 of the technologies/tools mentioned here: Big Data / Hadoop, Kafka, Spark, Airflow, Presto, Druid, Opensearch, HA Proxy, or Hive
Experience with Kubernetes and Docker
Experience with Terraform
Strong background in Linux/Unix
Experience with system engineering around edge cases, failure modes, and disaster recovery
Experience with shell scripting, or equivalent programming skills in Python
Experience working with monitoring and alerting tools such as Grafana and PagerDuty, and being part of call rotations
Experience with Chef, Puppet, or Ansible
Experience with Networking, Network Security, and Data Security
#LI-SR2
Benefits
Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Our employees can take time off work for vacation and other personal reasons to balance their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.
The Roku Culture
Roku is a great place for people who want to work in a fast-paced environment where everyone is focused on the company's success rather than their own. We try to surround ourselves with people who are great at their jobs, who are easy to work with, and who keep their egos in check. We appreciate a sense of humor. We believe a fewer number of very talented folks can do more for less cost than a larger number of less talented teams. We're independent thinkers with big ideas who act boldly, move fast and accomplish extraordinary things through collaboration and trust. In short, at Roku you'll be part of a company that's changing how the world watches TV.
We have a unique culture that we are proud of. We think of ourselves primarily as problem-solvers, which itself is a two-part idea. We come up with the solution, but the solution isn't real until it is built and delivered to the customer. That penchant for action gives us a pragmatic approach to innovation, one that has served us well since 2002.