AWS Site Reliability Engineer

16 hours ago


Cape Town, Western Cape, South Africa Prescient Full time R80 000 - R120 000 per year

Purpose of role:We're looking for an AWS Site Reliability Engineer (SRE) to help us build and operate highly reliable, secure, and scalable cloud platforms. This role is ideal for someone who thrives at the intersection of software engineering, cloud infrastructure, and operations, and enjoys automating everything.As an AWS SRE, you'll be a key player in shaping our cloud environment, mentoring engineers, and ensuring our AWS workloads are secure, cost-efficient, and always available.Duties and responsibilities:You will be responsible for building and operating resilient infrastructure, automating operational processes, and driving continuous improvements in system performance and availability. This role requires a balance of hands-on technical expertise, problem-solving skills, and a passion for delivering highly reliable services that support critical business operations.Reliability & UptimeDesign, implement, and maintain highly available and resilient AWS cloud infrastructure.Monitor system health and performance, ensuring services meet SLAs.Respond to and resolve production incidents, performing root cause analysis and implementing long-term fixes.Automation & ScalabilityBuild automation for deployment, monitoring, scaling, and recovery using Infrastructure as Code (Terraform, AWS CDK, CloudFormation).Automate repetitive operational tasks to reduce toil and improve system reliability.Implement CI/CD pipelines to ensure smooth and reliable delivery of applications.Monitoring & ObservabilityConfigure and manage observability solutions (CloudWatch, Grafana, etc.).Define and track Service Level Indicators (SLIs) and Objectives (SLOs).Develop proactive alerting and anomaly detection mechanisms.Security & ComplianceApply AWS security best practices, including IAM governance, secrets management, encryption, and compliance monitoring.Work closely with InfoSec teams to ensure systems adhere to regulatory standards (e.g., PCI DSS, POPIA, GDPR, ISO27001).Perform regular audits of cloud resources, ensuring alignment with organizational policies.Performance & Cost OptimizationContinuously optimize cloud infrastructure for performance, efficiency, and cost-effectiveness.Analyse usage patterns and right-size resources or recommend reserved/spot instances where appropriate.Provide visibility into AWS spend and assist teams in cost governance.Incident & Problem ManagementDrive post-incident reviews, documenting learnings and improving runbooks.Develop self-healing and fault-tolerant systems to minimize impact of failures.Collaboration & Continuous ImprovementPartner with development teams to embed reliability, scalability, and observability into applications.Advocate and implement SRE best practices across the organization.Mentor engineers on AWS, DevOps, and reliability engineering practices.Required experience:Strong experience (> 5 years) with AWS services (EC2, ECS/EKS, Lambda, RDS, DynamoDB, S3, CloudFront, VPC, Route 53, IAM).Expertise in Infrastructure as Code (Terraform, AWS CDK, CloudFormation).Proficiency in monitoring & observability tools (CloudWatch, Grafana, ELK/OpenSearch).Experience with CI/CD pipelines (GitHub Actions, GitLab CI, AWS Code Pipeline).Knowledge of containerization & orchestration (Docker, Kubernetes, ECS, EKS).Strong scripting/coding skills (Python, Bash, Go, etc.).Experience with incident management & on-call operations.Required Qualifications:AWS Professional certifications.Experience running Kubernetes/EKS in production.Knowledge of compliance frameworks (ISO27001, SOC2, PCI-DSS, POPIA).Key competencies:Problem-solving mindset with focus on root cause analysis and prevention.Strong communication skills to collaborate across engineering, security, and business teams.Ability to prioritize reliability, scalability, and performance in production systems.Continuous improvement mindset, with passion for automation and efficiency.Why this role:As an AWS Site Reliability Engineer, you'll be at the centre of our mission to deliver secure, reliable, and scalable digital services. This is not just about keeping systems running — it's about designing for resilience, driving automation, and enabling our business to innovate at speed while staying safe and compliant.



  • Cape Town, Western Cape, South Africa LexisNexis Full time R1 000 000 - R2 500 000 per year

    About Our TeamLexisNexis Legal & Professional, which serves customers in more than 150 countries with 11,800 employees worldwide, is part of RELX, a global provider of information based analytics and decision tools for professional and business customers. Our company has been a long-time leader in deploying AI and advanced technologies to the legal market to...


  • Cape Town, Western Cape, South Africa Mind Detect Full time

    Our ultra-modern, scaling, payments platform client is seeking aSite Reliability Engineer(SRE) to join their world-class Engineering team, located inCape Town(hybrid). Due to their unique market positioning and backing by world-leading payment companies, VCs and fintech platforms alike, they are set for high growth and expansion in the coming years.AsSRE,...


  • Cape Town, Western Cape, South Africa Electrum Software Full time R750 000 - R1 200 000 per year

    Electrum is a next-generation payment software technology company.Since 2012, we've delivered trusted, enterprise-grade, cloud-native software to optimise financial transaction processing. Our deep expertise has established us as a respected partner in high-volume, low-value payment schemes, enabling clients to deliver services to millions of South Africans...


  • Cape Town, Western Cape, South Africa Zensar Technologies Full time R1 200 000 - R2 400 000 per year

    Zensaris hiring for a skilled and proactiveSite Reliability Engineer(SRE) with8 to 10 yearsexperience.The SRE will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure.This role blends software engineering with IT operations, to build fault-tolerant, self-healing systems and drive continuous improvement...

  • AWS Cloud Engineer

    15 hours ago


    Cape Town, Western Cape, South Africa DYNAMIC VISUAL TECHNOLOGIES LIMITED Full time R900 000 - R1 200 000 per year

    Company DescriptionDYNAMIC VISUAL TECHNOLOGIES LIMITED is a computer software company headquartered in Johannesburg, South Africa. The company offers innovative software solutions and services to meet various industry needs. With a focus on delivering quality and excellence, DYNAMIC VISUAL TECHNOLOGIES LIMITED provides a platform for professionals to...

  • AWS DevOps Engineer

    1 week ago


    Cape Town, Western Cape, South Africa DT Projects SA Full time R75 000 - R750 000 per year

    Job Title:AWS DevOps EngineerSalary:R75,000 per month (gross basic) + pension contributionArea:Cape TownType:Onsite initially, then move to HybridStart Date:5 January 2026SummaryWe're looking for an AWS DevOps Engineer to help shape and protect a cloud-native engineering environment built on AWS. You'll take ownership of core infrastructure, refine CI/CD...


  • Cape Town, Western Cape, South Africa Amazon Web Services (AWS) Full time R1 200 000 - R3 600 000 per year

    DescriptionAWS is hiring a Product Manager, to develop, maintain and improve our operational business model for the new Skills Center program. The AWS Skills Center team engages both Individual Learners and Organizations in targeted audience segments and geographies across EMEA.In this role you will be responsible for managing the business KPIs and...


  • Cape Town, Western Cape, South Africa Sana Commerce Full time R1 500 000 - R2 500 000 per year

    Company DescriptionWhat started in 2007 with a pizza and a plan has grown into a fast-moving SaaS company empowering manufacturers, distributors, and wholesalers to thrive in complex B2B commerce.Our mission is simple: help businesses build stronger relationships through seamless digital commerce.At Sana Commerce, you'll join a team that's bold,...

  • Site Reliability

    6 days ago


    Cape Town, Western Cape, South Africa Canonical - Jobs Full time R80 000 - R120 000 per year

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers,...


  • Cape Town, Western Cape, South Africa Canonical - Jobs Full time R600 000 - R1 200 000 per year

    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and...