Mid-Level Site Reliability Engineer
4 weeks ago
- Jobs by Location
- Job by industries
Purpose of role
- Site Reliability Engineers work tightly with Tech Support teams and product/platform engineering teams and are responsible for maximising the uptime of their platforms and clients, maintaining and enhancing their observability, responding to incidents raised and documenting/investigating/fixing the underlying root causes of these incidents. They also may work on documentation for the areas in which they specialise, in order to help upstream teams when future issues arise.
Duties and responsibilities
- Work closely with the Platform and Product engineering teams to ensure that the platform, infrastructure and services are designed and optimised for availability, latency and performance
- Own and configure observability tooling
- Create and tune alerts to ensure we have adequate warning of impending failures, and check alerts as they are raised
- Investigate and resolve support issues escalated from the Tech Support team
- Lead incident response, resolution, root cause investigation, retrospective writing up and follow-up actions so we can take every opportunity to learn, improve and make our services more resilient
- Identify patterns in incoming incidents and document these for further investigation
- Collaborate with other SREs and Tech Support to improve processes and share knowledge/best practice
Skills/Experience
- Mid-level experience responsible for delivery and automation in a SRE, Platform or DevOps team
- Knowledgeable and comfortable with agile development practices & legacy platforms
- Comes from an engineering background, and is familiar with modern programming languages, ideally Python but others will be accepted
- Experienced (mid-level) at scripting for automation
- Cloud Certifications, or demonstratable knowledge
- Is experienced in investigating and resolving technical issues, spanning performance, functionality and system interactions
- Is confident in proposing solutions to technical issues, and is able to communicate the pros and cons of said solutions
- Is capable of documenting causes of underlying issues, creating runbooks for others to follow
- Mid-level experience (competent general usage) with any public cloud providers i.e. GCP, AWS, Azure (ideally GCP)
- Mid-level experience (competent general usage) of observability, both in terms of best practices and tooling implementation/use (Datadog preferable, others will be accepted)
- Mid-level proficiency in using Infrastructure as Code, such as Terraform or alternatives
- Database experience and ability to understand/write SQL (mySQL/MariaDB preferable)
- Understanding of Linux Operating Systems (Debian preferable)
- Has understanding of the DevSecOps culture and experience in delivering technical outcomes within this culture
- Possesses strong communication and stakeholder management skills, with an ability to communicate complex technical topics to non-technical stakeholders
- Is comfortable with providing limited on-call cover at evenings and weekends
- ICT jobs
-
Mid-Level Site Reliability Engineer
5 days ago
Cape Town, Western Cape, South Africa Potentiam Limited Full timeJob title : Mid-Level Site Reliability EngineerJob Location : Western Cape, Cape TownDeadline : May 10, 2025Quick Recommended LinksJobs by Location Job by industries Purpose of role Site Reliability Engineers work tightly with Tech Support teams and product/platform engineering teams and are responsible for maximising the uptime of their platforms and...
-
Senior Site Reliability Engineer
4 weeks ago
Cape Town, Western Cape, South Africa Lulalend Full timeALL STAFF APPOINTMENTS WILL BE MADE WITH DUE CONSIDERATION OF THE COMPANY'S EE TARGETSJob title: Senior Site Reliability Engineer (Senior Azure Cloud Engineer)Reporting to: Site Reliability Team LeadLocation: Cape TownWHAT WE DOWe're Lula. We build innovative fintech products to help SMEs make cash flow. From instant access to funding to all-in-one business...
-
Sre (Site Reliability Engineer)
3 weeks ago
Cape Town, Western Cape, South Africa Travelstart Full timeOur Travelstart team is seeking an SRE (Site Reliability Engineer) for our Dev Team.This role ensures the reliability, performance, and scalability of the Travelstart systems.This role bridges the gap between software development and system operations, focusing on automating infrastructure and processes to improve reliability and efficiency.Key...
-
Sre (Site Reliability Engineer)
2 weeks ago
Cape Town, Western Cape, South Africa Travelstart Full timeOur Travelstart team is seeking an SRE (Site Reliability Engineer) for our Dev Team. This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between software development and system operations, focusing on automating infrastructure and processes to improve reliability and efficiency. Key...
-
Intermediate Site Reliability Engineer
4 weeks ago
Cape Town, Western Cape, South Africa Tumaini Consulting Full timeOur client, a global leader in virtual gaming and software development, is seeking an experienced Site Reliability Engineer (SRE) / Senior Support Engineer to ensure the stability, scalability, and performance of their high-availability gaming platforms. Responsibilities: · Manage and optimize cloud-based and on-premise infrastructure. · Automate...
-
Site Reliability Engineer
6 days ago
Cape Town, Western Cape, South Africa Olarm Full timeJoin the Olarm Team: Where Innovation and Collaboration Thrive At Olarm, we're not just a company – we're a passionate team of forward-thinkers, tech enthusiasts, and problem solvers. Our mission to revolutionise the home security and monitoring markets is driven by a set of core values that define who we are and how we work together. We believe in...
-
Site Reliability Engineer
3 weeks ago
Cape Town, Western Cape, South Africa Olarm Full timeJoin the Olarm Team: Where Innovation and Collaboration ThriveAt Olarm, we're not just a company – we're a passionate team of forward-thinkers, tech enthusiasts, and problem solvers. Our mission to revolutionise the home security and monitoring markets is driven by a set of core values that define who we are and how we work together. We believe in...
-
SRE (Site Reliability Engineer)
1 week ago
Cape Town, Western Cape, South Africa Travelstart Full timeOur Travelstart team is seeking an SRE (Site Reliability Engineer) for our Dev Team. This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between software development and system operations, focusing on automating infrastructure and processes to improve reliability and efficiency.(This role is...
-
SRE (Site Reliability Engineer)
3 weeks ago
Cape Town, Western Cape, South Africa Travelstart Full timeOur Travelstart team is seeking an SRE (Site Reliability Engineer) for our Dev Team. This role ensures the reliability, performance, and scalability of the Travelstart systems. This role bridges the gap between software development and system operations, focusing on automating infrastructure and processes to improve reliability and efficiency.(This role is...
-
Mid-Senior Level Integration Engineer
2 weeks ago
Cape Town, Western Cape, South Africa Acuity Consultants Full timeMid-Senior Level Integration Engineer Opportunity in Cape TownWe are seeking a highly skilled Integration Developer to join our team in Cape Town. In this role, you will be responsible for designing, architecting, and building out integration solutions that meet business demands and align with our overall systems architecture.You will work closely with...
-
Site Reliability Engineer New South Africa
2 days ago
Cape Town, Western Cape, South Africa stitch Full timeStitch is a payments infrastructure company on a mission to make it easier for enterprise businesses to connect to the financial system and build better experiences for their customers.We are expanding the team to enable Stitch to broaden our product offering and extend our geographical footprint.Site Reliability EngineerThe Site Reliability Engineer is...
-
Mid-Senior Level Electrical Technician
2 weeks ago
Cape Town, Western Cape, South Africa Airports Company South Africa Full timeWe are looking for a highly skilled and experienced Mid-Senior Level Electrical Technician to join our team at Airports Company South Africa. As an integral member of our maintenance team, you will be responsible for ensuring the reliable operation of our electrical infrastructure.Key Responsibilities:Conduct regular inspections and maintenance activities on...
-
Mid-Level Programmer
2 days ago
Cape Town, Western Cape, South Africa TrudyQ Consulting Full timeAbout the Role:We are looking for a talented Mid-Level Programmer to join our team at TrudyQ Consulting. The successful candidate will be responsible for working on various projects, including writing casino games in a Full Stack approach, utilizing C# and JavaScript/TypeScript technologies.The ideal candidate should possess a degree or diploma in Computer...
-
Site Reliability Engineer
4 weeks ago
Cape Town, Western Cape, South Africa Flash Group Full timeFlash2024/12/12 Western CapeJob Reference Number: T169Department: TechnologyIndustry: FintechJob Type: PermanentPositions Available: 2Salary: Market RelatedWe are looking for an individual passionate about technology and experience in developing and managing cutting-edge environment monitoring solutions, as well as using software and automation to solve...
-
Site Reliability Engineer
2 days ago
Cape Town, Western Cape, South Africa Flash Group Full timeFlash2024/12/12 Western CapeJob Reference Number: T169Department: TechnologyIndustry: FintechJob Type: PermanentPositions Available: 2Salary: Market RelatedWe are looking for an individual passionate about technology and experience in developing and managing cutting-edge environment monitoring solutions, as well as using software and automation...
-
Mid-Level Software Engineer
2 weeks ago
Cape Town, Western Cape, South Africa The Business Directory South Africa Full timeThe Business Directory South Africa is looking for a highly skilled Mid-level Full Stack Developer to join our team on a contract basis. As a Full Stack Developer, you will be responsible for developing and maintaining web applications using a range of technologies including C#, .Net Core, Angular, SQL Server, Docker, and Azure DevOps.About the JobThis is a...
-
DevOps Engineer
3 weeks ago
Cape Town, Western Cape, South Africa Level Up Full timeLevel-Up Cape Town, Western Cape, South AfricaJoin or sign in to find your next jobJoin to apply for the DevOps Engineer role at Level-UpAs a DevOps Engineer at our client, you will play a crucial role in bridging the gap between development and operations, ensuring seamless integration, deployment, and delivery of our software solutions. Your...
-
Reliability Engineer
1 week ago
Cape Town, Western Cape, South Africa Smart Procurement Full timeAbout the RoleIn this role as a Reliability Engineer - GCC Factory, you will be responsible for ensuring the reliability of equipment and processes in the factory. Your primary focus will be on implementing maintenance strategies that minimize downtime and optimize production efficiency.The ideal candidate will have a strong background in mechanical or...
-
Level 2 IT Engineer
5 days ago
Cape Town, Western Cape, South Africa Zest Worx Full timeLevel 2 IT Engineer / Technician – Cape Town.Are you a tech savvy problem solver who thrives on delivering fast, effective IT support? Do you take pride in offering service that is not just good – but exceptional? If so, we are looking for you to join a dynamic team as Level 2 IT Engineer / Technician.You will be the go-to expert for providing basic to...
-
Reliability Engineer Specialist
6 days ago
Cape Town, Western Cape, South Africa Olarm Full timeAbout OlarmAt Olarm, we're committed to shaping the future of smart security solutions. We're a team of motivated individuals who thrive on delivering results and making a positive impact on people's lives.We're looking for a talented Site Reliability Engineer (SRE) to join our team and contribute to our mission. As an SRE, you'll be responsible for...