Site Reliability Engineer
21 hours ago
Site Reliability Engineer (Datadog)
Recruiter:
Data Centrix
Job Ref:
JHB006874/LD
Date posted:
Friday, November 14, 2025
Location:
Johannesburg, South Africa
SUMMARY:
Are you a
Site Reliability Engineer
with solid
Datadog
experience? Our client in the Warehousing and Logistics sector is looking to employ an Engineer to Support the design, implementation, and optimization of
Datadog
monitoring solutions across infrastructure, applications and services.
POSITION INFO:
Qualifications and Experience:
- Datadog Certified Fundamentals – Must have
- Degree in Information Technology or Computer Science
- Management of operations on virtualized and distributed infrastructures,
- Management of operations on environment with clustering, replication, load balancer
- ITIL Practitioner (V3) / ITIL Specialist (V4)
- Windows Server: Advantage
- 1–3 years of experience working with a modern monitoring/observability tool, ideally Datadog (or alternatives like Prometheus, Grafana, New Relic, or Dynatrace).
Experience in:
Deploying and configuring monitoring agents
- Creating dashboards and monitors
Parameterizing tags and labels for proper data correlation
Basic familiarity with cloud platforms (AWS, Azure or GCP) and container environments (Docker/Kubernetes)
- Experience working with Centreon - Advantage
- Strong interest in monitoring, DevOps, SRE, or cloud infrastructure
- Knowledge of basic scripting (e.g., Bash, Python) is a plus
Duties:
- Support the design, implementation, and optimization of Datadog monitoring solutions across infrastructure, applications, and services.
- Work alongside DevOps, infrastructure, and application teams to ensure complete observability using custom dashboards, alerts, and tagging strategies.
- Assist in the deployment and onboarding of new systems into the monitoring ecosystem.
- Serve as the go-to person for building visualizations, improving signal-to-noise ratios in alerting, and aligning monitoring with business objectives.
- Ideal for a young and motivated engineer looking to grow within observability and cloud-native monitoring.
- Deploy and configure Datadog agents across various environments (cloud and on-prem).
- Create and customize dashboards, monitors, and alerts for systems, services, containers, and applications.
- Implement tagging strategies to organize, filter, and correlate metrics and logs effectively.
- Integrate Datadog with various platforms (AWS, Azure, GCP, Kubernetes, Docker, etc.) to collect telemetry data.
- Collaborate with developers, DevOps, and infrastructure teams to identify key business and system metrics to monitor.
- Continuously tune and optimize monitors to reduce false positives and improve actionable alerting.
- Document dashboards, alert logic, best practices, and knowledge for cross-team enablement.
- Analyze incidents and outages post-mortem to identify monitoring gaps and enhance visibility.
- Assist in evangelizing observability practices within the organization and contribute to monitoring as code efforts (e.g., Terraform for Datadog resources).
- Stay up to date with new Datadog features and industry trends in observability and monitoring.
-
Site Reliability Engineer
2 weeks ago
Johannesburg, Gauteng, South Africa Nedbank Full time R1 800 000 - R2 500 000 per year*Requisition Details & Talent Acquisition Consultant*REQ Keabetswe ModiseClosing Date: 05 December 2025*Job Family*Information Technology*Career Stream*Application Development*Leadership Pipeline*Manage Self: ProfessionalJob PurposeTo serve as an IT professional specialising in Site Reliability Engineering (SRE) at Nedbank, contributing to the strategic...
-
Site Reliability Engineer
2 weeks ago
Johannesburg, Gauteng, South Africa ExecutivePlacements - The JOB Portal Full time R900 000 - R1 200 000 per yearSite Reliability Engineer (Datadog)Recruiter:Data CentrixJob Ref:JHB006874/LDDate posted:Tuesday, October 7, 2025Location:Johannesburg, South AfricaSUMMARY:Are you aSite Reliability Engineerwith solidDatadogexperience? Our client in the Warehousing and Logistics sector is looking to employ an Engineer to Support the design, implementation, and optimization...
-
Principal Site Reliability Engineer
5 days ago
Johannesburg, Gauteng, South Africa Deimos Full time R120 000 - R180 000 per yearDeimos is a Cloud-native Developer and Security Operations technology services company. We help companies of all sizes adopt the Cloud for improved service delivery to their clients. We're a fully remote African-based team of engineers who are passionate about implementing engineering best practices. We leverage the latest technologies while building...
-
Site Reliability Engineer
5 days ago
Johannesburg, Gauteng, South Africa Hire Resolve Full time R250 000 - R500 000 per yearA fintech company committed to making life simpler and more secure for African communities through innovative financial and technology solutions seeking a proactive and skilled Site Reliability Engineer (SRE) to be the guardian of their systems' uptime, performance, and scalability. This is a unique opportunity to build the foundation of infrastructure...
-
Senior Site Reliability Engineer
5 days ago
Johannesburg, Gauteng, South Africa Boardroom Appointments Full time R2 000 000 - R2 500 000 per yearDuties and responsibilities:Being part of the integral foundation of the company platform, you will get to apply your knowledge and experience to the various SRE projects at our company. Building and scaling infrastructure, version control systems and CI/CD processes on the company platform by applying and enforcing Infrastructure as Code, which is accessed...
-
Site Reliability Engineer
5 days ago
Johannesburg, Gauteng, South Africa k0dehut Full time R500 000 - R1 200 000 per yearIntermediate Site Reliability Engineer (SRE II) Our Client is offering the right candidate a great opportunity to join a fast growing South African fintech that enables seamless and innovative end-to-end customer onboarding services that drive conversion rates, prevent fraud, reduce risk and costs. They provide automated and easy to implement solutions that...
-
Support Reliability Engineer
20 hours ago
Johannesburg, Gauteng, South Africa Iress Full time R60 000 - R120 000 per yearSee yourself being part of a large, transformational change? This could be the role for youAt Iress, we make things happen We believe technology should help people perform better every day. Since our beginning in 1993, people across financial services have trusted us to take their performance to the next level. From the world's most established financial...
-
Johannesburg, Gauteng, South Africa Cummins Full time R1 200 000 - R2 400 000 per yearJob DescriptionWe are looking for a talented Systems Reliability Engineering Technical Specialist to join our team specializing in Engineering for our Distribution Business Unit in Johannesburg, Gauteng.In this role, you will make an impact in the following ways:Reliability Strategy Development:Develop and implement reliability programs and initiatives for...
-
Junior to Mid-level skilled SharePoint x2
1 week ago
Johannesburg, Gauteng, South Africa Gig Engineer Full time R250 000 - R500 000 per yearGig Engineerisseeking2x Junior to Mid-level skilled SharePoint developersfor its client in the banking sector.Contract Duration:From 1 February to 31 December 2026Location:The resources would initially be onsite to meet and integrate with the existing SharePoint team; thereafter, they would be remote unless required onsite for meetings and/or...
-
Site Engineer
5 days ago
Johannesburg, Gauteng, South Africa Hire Resolve Full time R400 000 - R800 000 per yearHire Resolve is currently seeking an experienced Site Engineer to join a reputable construction company in Johannesburg, South Africa. The successful candidate will be responsible for overseeing and managing various building projects in the area, ensuring they are completed on time and within budget.Responsibilities: Does all or some of the following:...