System Engineer (kubernetes)

New Today

We are seeking an experienced On-Prem Kubernetes Systems Engineer to manage and support our self-hosted Kubernetes infrastructure. This role will involve maintaining cluster health, troubleshooting both application and infrastructure-level issues, and developing new features and automation to improve the performance and reliability of services running in the Kubernetes environment.
Check you match the skill requirements for this role, as well as associated experience, then apply with your CV below. Key Responsibilities: • Deploy, maintain, and scale on-premises Kubernetes clusters. • Troubleshoot and resolve issues across the full stack-containers, pods, nodes, networking, and storage. • Collaborate with application developers to identify and fix deployment and runtime issues. • Develop tools, operators, and automation to streamline workflows and cluster operations. • Monitor cluster performance and resource usage; implement enhancements to improve reliability and scalability. • Write and maintain Helm charts, manifests, and CI/CD pipelines for Kubernetes-based applications. • Stay current with Kubernetes ecosystem trends and recommend improvements. • Document infrastructure components, design decisions, and troubleshooting procedures. Required Qualifications: • Proven experience managing on-prem Kubernetes clusters (Kubeadm, RKE, etc.). • Proficiency in container technologies like Docker and containerd. • Strong understanding of Linux systems, networking, and storage as they relate to Kubernetes. • Experience with monitoring tools (Prometheus, Grafana, etc.). • Familiarity with CI/CD tools (e.g., GitLab CI, Jenkins, ArgoCD, FluxCD). • Experience writing Helm charts, Kustomize, or other templating tools. • Strong troubleshooting and debugging skills across application and infrastructure layers. • Scripting skills (Bash, Python, or Go preferred). Preferred Qualifications: • Experience with service mesh (Istio, Linkerd), ingress controllers (NGINX, Traefik), or internal DNS. • Familiarity with GitOps and Infrastructure as Code principles. • Exposure to secure Kubernetes practices (RBAC, PodSecurityPolicies, NetworkPolicies). • Experience supporting high-availability and disaster recovery configurations
Location:
Austin, TX
Category:
Healthcare And Medical

We found some similar jobs based on your search