Description | About Us: Soum is on a mission to revolutionize e-commerce in the MENA region and beyond by building the most convenient and trustworthy re-commerce marketplace in the region. We're reshaping how C2C marketplaces operate making buying and selling second-hand products seamless and reliable. Since our launch in July 2021 Soum has become one of the fastest-growing startups in the region achieving rapid expansion in both our team and product offerings. In recognition of our impact and growth we were proudly named one of the Top 10 LinkedIn Startups in Saudi Arabia for 2024.Role Overview: We are looking for a Mid level DevOps Engineer/Tech interested in building performant stable and resilient infrastructure. You will be responsible for architecting and building the infrastructure as well as coordinating with the teams responsible for other layers of the product infrastructure. Building a stable infrastructure is a highly collaborative effort and as such a strong team player with a commitment to perfection is required. ➡ Key Responsibilities: ➡ Take responsibility for the scalability stability and availability of our low-latency & mission-critical systemsEnhance the CI/CD pipeline using Github ActionsAble to generate and maintain helm manifests for KubernetesMaintain our IaCAble to setup and maintain monitoring via NewRelic Prometheus & GrafanaAbility to debug infrastructure and detect bottlenecksTake responsibility for company-wide whole tech infrastructure for all environments.Responsibility for managing vpn load balancers and firewallsResponsible for high availability and disaster recoveryImplementation of secure and stable infrastructureResponsibility for continuous improvement and cost optimizationResponsibility of 24x7 monitoring and resolving infrastructure tier incidents and escalate to upper tiers where necessaryResponsibility for investigating and clear documentation of incident reports Qualifications: ➡ Excellent understanding in AWS cloud Knowledge of containerized environments with Kubernetes & DockerAbility to develop well managed infrastructure setupDeep knowledge of Linux internals networking routing & protocolsStrong hands-on building High availability infrastructure using EKS on AWSStrong understanding in IaC specifically TerraformStrong understanding in GitOps specifically ArgoCDStrong understanding in securing production infrastructure Experience: ➡ Minimum 2 years of DevOps experience in production applicationsAt least 1 year working with TerraformAt least 1 year with monitoring tools (NewRelic Prometheus Grafana etc.)Experience designing and architecting high-performing applicationsBackground in Agile development practicesExperience in a fast-growing startup (preferred)AWS Certifications (preferred) ➡ |