Senior Site Reliability Engineer

2 weeks ago


Cape Town, Western Cape, South Africa Sana Commerce Full time
Company Description

At Sana Commerce, we're committed to creating an inclusive environment because we know our diverse workforce is one of our greatest strengths.

What started in 2007 with a pizza and a plan has grown into a fast-moving SaaS company that helps manufacturers, distributors, and wholesalers thrive in B2B commerce complexity.

Our mission? To transform the way businesses buy and sell, so they can grow, build stronger relationships, and make the most of digital commerce. Join us and take ownership of your career in a dynamic, fast-moving environment.

At Sana Commerce, we're looking for a Senior Site Reliability Engineer to strengthen our reliability, observability, and automation capabilities across our Azure and Kubernetes-based platforms. This role blends hands-on operational excellence with engineering practices, ensuring uptime today while building the systems that make tomorrow more resilient.

This SRE position focuses on engineering reliability in everything we do: automating repetitive tasks, improving monitoring signals, running deep root cause analysis, and shaping systems for scalability. You'll be the engineer others look to during critical incidents, and the one raising the bar on how we prevent them in the first place.

What you'll get:

  • The opportunity to make an impact at a fast-growing SaaS scale-up;
  • A global and customized onboarding program (9,1/10 rated by previous hires);
  • A hybrid working model – 3 days from the office, 2 days from home.
Job Description

What you'll be doing

  • Lead incident response and root cause analysis by driving deep investigations, educating the team, and delivering actionable post-incident insights that prevent recurrence.
  • Manage Kubernetes and Azure environments by owning cluster configurations, platform usage, and ensuring availability, cost efficiency, and security best practices.
  • Develop observability and monitoring strategies with Dynatrace, Honeycomb, ElasticSearch, Kibana/Grafana, and Azure Monitor to measure performance, user impact, and continuously refine alerts and dashboards.
  • Implement and maintain edge and CDN integrations (Fastly WAF, bot management, CDN) to enhance performance, security, and reliability of customer-facing services.
  • Write and debug automation scripts in PowerShell, Bash, Python, or C#, ensuring logging, rollback, and versioning practices make the platform more resilient and self-healing.
  • Drive Infrastructure-as-Code adoption with Terraform, Bicep, and ARM to standardize environments, automate deployments, and reduce manual interventions.
  • Optimize system and application performance through deep monitoring, dump analysis, and right-sizing of resources to eliminate bottlenecks and maximize efficiency.
  • Collaborate across teams to break down complex problems, contribute to CI/CD and SDLC improvements, and embed reliability into development and release pipelines.
  • Participate in the on-call rotation by taking ownership of incidents, coordinating responses, and ensuring sustainable fixes rather than temporary workarounds.
Qualifications

What you bring

  • 5+ years of experience in SRE, DevOps, or Cloud Infrastructure, with demonstrated ownership of large-scale systems.
  • Strong hands-on knowledge of Microsoft Azure services and practical experience operating Azure Kubernetes clusters in production.
  • Expertise in Dynatrace, Honeycomb, ElasticSearch, Kibana/Grafana, Azure Monitor (KQL). Able to design actionable monitoring that leads to prevention, not just detection.
  • Proficient in at least one programming/scripting language (PowerShell, Bash, Python, or C#). Strong debugging and logging practices.
  • Hands-on experience with Infrastructure-as-Code (Terraform, Bicep, or ARM) to automate and manage cloud infrastructure.
  • Solid understanding of TCP/IP protocols and troubleshooting network issues in distributed systems.
  • Ability to go beyond surface fixes, identify patterns, and engineer permanent improvements.
  • Strong communicator who can work with cross-functional teams and explain complex issues simply.
  • Microsoft Certified: Azure Administrator Associate
  • CKA: Certified Kubernetes Administrator

Who we are:

So, what does it mean to be a part of the Sana Commerce team?

At Sana Commerce, our values guide how we work, collaborate, and drive success.

  • Champions of Our League. "We deliver lasting success, balancing quick wins and long-term value."

    We take pride in our unique product and extensive B2B knowledge and continuously strive to improve. No matter our role, we bring value every day, helping our customers and partners succeed.
  • Supercharge Our Customers. "We're revolutionizing B2B commerce together, helping our customers to lead and succeed."

    Our customers are at the heart of everything we do. We go beyond solutions, providing the tools and support they need to grow.
  • Determined to Grow. "We embrace challenges, growing and raising the bar for ourselves and our industry."

    We take on challenges, seek feedback, and keep learning. Every setback is a chance to improve and move forward.
  • Bold Together. "We dare to be bold because we have each other's back."

    We collaborate across teams and time zones, challenge the status quo, and support each other to achieve the best outcomes.

Apply now

Additional Information

#LI-Hybrid



  • Cape Town, Western Cape, South Africa Sana Commerce Full time

    Company Description What started in 2007 with a pizza and a plan has grown into a fast-moving SaaS company empowering manufacturers, distributors, and wholesalers to thrive in complex B2B commerce.Our mission is simple: help businesses build stronger relationships through seamless digital commerce.At Sana Commerce, you'll join a team that's bold,...


  • Cape Town, Western Cape, South Africa Sana Commerce Full time

    Company DescriptionWhat started in 2007 witha pizza and a planhas grown into a fast-moving SaaS company empowering manufacturers, distributors, and wholesalers to thrive in complex B2B commerce.Our mission is simple: help businesses build stronger relationships through seamless digital commerce.At Sana Commerce, you'll join a team that's bold,...


  • Cape Town, Western Cape, South Africa Sana Commerce Full time

    Company Description What started in 2007 with a pizza and a plan has grown into a fast-moving SaaS company empowering manufacturers, distributors, and wholesalers to thrive in complex B2B commerce.Our mission is simple: help businesses build stronger relationships through seamless digital commerce.At Sana Commerce, you'll join a team that's bold,...


  • Cape Town, Western Cape, South Africa Luno Full time

    About us:Luno is the crypto investment app you can rely on, enabling you to buy, store and explore crypto securely. We're committed to putting the power of cryptocurrency in everyone's hands sensibly and responsibly.Since 2013, we've helped millions of people around the world invest safely in crypto. We do this by cutting through the hype and supporting...


  • Cape Town, Western Cape, South Africa LexisNexis Full time

    About Our TeamLexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX, a global provider of information-based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market to...


  • Cape Town, Western Cape, South Africa Remitly Full time

    About our Team LexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX, a global provider of information-based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market...


  • Cape Town, Western Cape, South Africa RELX Group Full time

    About our TeamLexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part ofRELX, a global provider of information-based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market to...

  • Sr. Site Engineer

    7 days ago


    Cape Town, Western Cape, South Africa Utopia Full time

    We are seeking a dynamic and experienced Local Site Manager to take the lead on an exciting high-end luxury villa construction project in Cape Town (relocation needed if not already living there). In this role, you will be at the heart of transforming designs into reality. Your mission is to ensure the successful delivery of a luxury villa that fulfills our...

  • Site Engineer

    2 weeks ago


    Cape Town, Western Cape, South Africa RPO Recruitment Full time

    RPO Recruitment's client, a well-established construction and property development firm in South Africa is currently seeking a professional and detail-oriented Site Engineer to provide technical support on a large development project in Cape Town. This role is ideal for a driven individual with strong analytical abilities and construction site...

  • Sr. Site Engineer

    2 days ago


    Cape Town, Western Cape, South Africa Utopia Design Full time

    We are seeking a dynamic and experiencedLocal Site Managerto take the lead on an exciting high-end luxury villa construction project inCape Town(relocation needed if not already living there). In this role, you will be at the heart of transforming designs into reality. Your mission is to ensure the successful delivery of a luxury villa that fulfills our...