Site Reliability Engineer

1 week ago


WorkFromHome, South Africa Risingsun Softsol Full time

Risingsun is Hiring SRE (Site Reliability Engineer) Work model: hybrid – 2/3 days at the office per week Open to Visa holders: No – only SA Citizens or SA ID holders Employment type: 12-month contract, renewable Location: JHB Client type: Banking Candidate having skilled and proactive Site Reliability Engineer (SRE) with 5+ Years experience The SRE will be responsible for ensuring the reliability, scalability, and performance of our systems and infrastructure. This role blends software engineering with IT operations to build fault‑tolerant, self‑healing systems and drive continuous improvement across our technology stack. Required Skills & Qualifications Proficiency in Core Java technology Hands‑on experience with Kubernetes and container orchestration Strong understanding of CI/CD pipelines and tools (GitLab CI/CD, Jenkins) Familiarity with monitoring tools, Batch processing Excellent problem‑solving and communication skills Ability to work in on‑call rotations and respond to incidents effectively Key Responsibilities System Reliability & Availability: Design and maintain fault‑tolerant architectures using redundancy, load balancing, and failover mechanisms. Monitor system health using observability tools and respond to incidents to minimize downtime. Incident Management: Implement automated alerting and response systems. Conduct blameless postmortems and drive long‑term improvements. Automation & Tooling: Automate repetitive tasks using scripting and Infrastructure as Code (IaC) tools like Terraform, Ansible. Develop and maintain internal tools for deployment, monitoring, and debugging. Performance Monitoring: Use metrics, logs, and traces to identify and resolve performance bottlenecks. Build monitoring systems that alert on symptoms rather than outages. Capacity Planning & Scalability: Analyze traffic patterns and infrastructure load to predict demand. Optimize resource allocation and implement scalable solutions. Collaboration & Culture: Work closely with development, QA, and operations teams to foster a culture of shared responsibility. Promote transparency and continuous feedback loops. Interested can share CVs to #J-18808-Ljbffr



  • WorkFromHome, South Africa Robin AI Full time

    Robin AI City of Cape Town, Western Cape, South Africa Join or sign in to find your next job Join to apply for the Site Reliability Engineer role at Robin AI Robin AI City of Cape Town, Western Cape, South Africa Join to apply for the Site Reliability Engineer role at Robin AI About RobinRobin is on a mission to rebuild the legal industry — starting with...


  • WorkFromHome, South Africa DuckDuckGo Full time

    1 week ago Be among the first 25 applicants Who We AreHi, we're DuckDuckGo, the online protection company and remote-first team of 300+ on a mission to raise the standard of trust online. Founded in 2008 and profitable since 2014, our annual revenue now exceeds $100 million USD. Millions use our browser on Mac, Windows, iOS, and Android, our search engine,...


  • WorkFromHome, South Africa Canonical Full time

    Overview Site Reliability Engineer role at Canonical. Global remote location. Canonical is a leading provider of open source software and operating systems to the enterprise and technology markets, known for Ubuntu and open source infrastructure platforms. We deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying...


  • WorkFromHome, South Africa Sana Commerce Full time

    Company Description What started in 2007 with a pizza and a plan has grown into a fast-moving SaaS company empowering manufacturers, distributors, and wholesalers to thrive in complex B2B commerce. Our mission is simple: help businesses build stronger relationships through seamless digital commerce. At Sana Commerce, you’ll join a team that’s bold,...


  • WorkFromHome, South Africa Duckduckgo Full time

    Who We Are DuckDuckGo is an online protection company and remote‑first team dedicated to raising the standard of trust on the web. Your Team & Role As part of the Site Reliability Team, you will build and maintain world‑class infrastructure that serves millions of users. Your work will involve high‑level languages such as Perl, Go, and Python, and...


  • WorkFromHome, South Africa Sana Commerce Full time

    6 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Company Description What started in 2007 with a pizza and a plan has grown into a fast-moving SaaS company empowering manufacturers, distributors, and wholesalers to thrive in complex B2B commerce. Our mission is simple: help businesses build stronger...


  • WorkFromHome, South Africa Luno Full time

    A leading cryptocurrency platform based in Cape Town is seeking a Site Reliability Engineer to build and scale infrastructure, manage containerized environments using Kubernetes, and apply Infrastructure as Code principles. Candidates should have experience in DevOps roles and managing large infrastructure projects. The role offers flexible working options...


  • WorkFromHome, South Africa Canonical Full time

    Overview Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. The company is a pioneer of global distributed collaboration, with 1200+ colleagues in...


  • WorkFromHome, South Africa Canonical Full time

    Canonical is a leading provider of open‑source software and operating systems for global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include the world's leading public cloud and silicon providers, and...


  • WorkFromHome, South Africa k0deHut Full time

    Site Reliability Engineer (SRE II) (Kubernetes/Python) Job Openings Site Reliability Engineer (SRE II) (Kubernetes/Python) About the job Site Reliability Engineer (SRE II) (Kubernetes/Python) Intermediate Site Reliability Engineer (SRE II) Our Client is offering the right candidate a great opportunity to join a fast growing South African fintech that enables...