Position responsibilities:
Design and develop Infrastructure-As-Code solutions to streamline the management and orchestration of our multi-cloud production environment utilizing many different AWS / GCP services.
Work closely with developers and architects to implement state of the art ML and AI technologies.
Research and evaluate emerging and existing technologies and develop tools and processes to reduce the operational toil on the infrastructure team.
Implement SRE principles to enhance security, availability, reliability and cost optimization across the organization.
Position Requirements:
At least 5 years of experience in a Devops / Cloud Infrastructure engineer role, developing highly available and scalable infrastructure for SaaS products and microservice architectures.
Significant experience deploying and operating Kubernetes in production at scale – a must.
Significant experience with cloud infrastructure automation tools such as Terraform, Pulumi, Spinnaker, Helm and Ansible. – a must.
Full proficiency in at least one programming language (preferably Python).
Experience developing custom monitoring solutions utilizing technologies such as Prometheus, Graphite and InfluxDB.
Experience building beautiful and efficient CI/CD pipelines utilizing IaaC and GitOps principles.
Solid IT fundamentals – TCP/IP, IPSec, Linux, Containers.
Very strong analytic and troubleshooting skills.
Tech Stack:
AWS / GCP
Python
Kubernetes – EKS /GKE
Terraform
Helm
Knative /Airflow/Kubeflow
Prometheus
Jenkins
ArgoCD