Data Engineer

3 days ago


Remote South Africa Sand Tech Holdings Limited Full time R250 000 - R500 000 per year

ABOUT SAND

Sand Technologies is a global leader in digital transformation, empowering leading organisations and governments worldwide to achieve their digital aspirations. 

We offer a comprehensive suite of services, including enterprise AI solutions, data science, software engineering, and IoT, delivered from our centres in the Americas, Europe, and Africa. 

Our training programmes, in partnership with organisations like the Mastercard Foundation, Amazon Web Services, Holberton, and ALX cultivate the next generation of agile digital leaders.

Through recent strategic acquisitions, Sand Technologies has further strengthened its capabilities in advanced analytics and intelligent software development, enhancing our ability to solve our clients' most pressing challenges across telecom, utilities, healthcare, and insurance industries. 

We believe in harnessing technology to deliver real impact and value, helping organisations bridge the gap between their current reality and digital future.

ABOUT THE ROLE

Sand Technologies focuses on cutting-edge cloud-based data projects, leveraging tools such as Databricks, DBT, Docker, Python, SQL, and PySpark to name a few. We work across a variety of data architectures such as Data Mesh, lakehouse, data vault and data warehouses. Our data engineers create pipelines that support our data scientists and power our front-end applications. This means we do data-intensive work for both OLTP and OLAP use cases. Our environments are primarily cloud-native spanning AWS, Azure and GCP, but we also work on systems running self-hosted open source services exclusively. We strive towards a strong code-first, data as a product mindset at all times, where testing and reliability with a keen eye on performance is a non-negotiable.

JOB SUMMARY

A Data Engineer, has the primary role of designing, building, and maintaining scalable data pipelines and infrastructure to support data-intensive applications and analytics solutions. They closely collaborate with data scientists, analysts, and software engineers to ensure efficient data processing, storage, and retrieval for business insights and decision-making. From their expertise in data modelling, ETL (Extract, Transform, Load) processes, and big data technologies it becomes possible to develop robust and reliable data solutions.

RESPONSIBILITIES

  1. Data Pipeline Development: Design, implement, and maintain scalable data pipelines for ingesting, processing, and transforming large volumes of data from various sources using tools such as databricks, python and pyspark.
  2. Data Modeling: Design and optimize data models and schemas for efficient storage, retrieval, and analysis of structured and unstructured data.
  3. ETL Processes: Develop and automate ETL workflows to extract data from diverse sources, transform it into usable formats, and load it into data warehouses, data lakes or lakehouses.
  4. Big Data Technologies: Utilize big data technologies such as Spark, Kafka, and Flink for distributed data processing and analytics.
  5. Cloud Platforms: Deploy and manage data solutions on cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP), leveraging cloud-native services for data storage, processing, and analytics.
  6. Data Quality and Governance: Implement data quality checks, validation processes, and data governance policies to ensure accuracy, consistency, and compliance with regulations.
  7. Monitoring, Optimization and Troubleshooting: Monitor data pipelines and infrastructure performance, identify bottlenecks and optimize for scalability, reliability, and cost-efficiency. Troubleshoot and fix data-related issues.
  8. DevOps: Build and maintain basic CI/CD pipelines, commit code to version control and deploy data solutions.
  9. Collaboration: Collaborate with cross-functional teams, including data scientists, analysts, and software engineers, to understand requirements, define data architectures, and deliver data-driven solutions.
  10. Documentation: Create and maintain technical documentation, including data architecture diagrams, ETL workflows, and system documentation, to facilitate understanding and maintainability of data solutions.
  11. Best Practices: Continuously learn and apply best practices in data engineering and cloud computing.

QUALIFICATIONS

  • Proven experience as a Data Engineer, or in a similar role, with hands-on experience building and optimizing data pipelines and infrastructure.
  • Proven experience working with Big Data and tools used to process Big Data
  • Strong problem-solving and analytical skills with the ability to diagnose and resolve complex data-related issues.
  • Solid understanding of data engineering principles and practices.
  • Excellent communication and collaboration skills to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders.
  • Ability to adapt to new technologies, tools, and methodologies in a dynamic and fast-paced environment.
  • Ability to write clean, scalable, robust code using python or similar programming languages. Background in software engineering a plus.

DESIRABLE LANGUAGES/TOOLS

  • Proficiency in programming languages such as Python, Java, Scala, or SQL for data manipulation and scripting.
  • Strong understanding of data modelling concepts and techniques, including relational and dimensional modelling.
  • Experience in big data technologies and frameworks such as Databricks, Spark, Kafka, and Flink.
  • Experience in using modern data architectures, such as lakehouse.
  • Experience with CI/CD pipelines and version control systems like Git.
  • Knowledge of ETL tools and technologies such as Apache Airflow, Informatica, or Talend.
  • Knowledge of data governance and best practices in data management.
  • Familiarity with cloud platforms and services such as AWS, Azure, or GCP for deploying and managing data solutions.
  • Strong problem-solving and analytical skills with the ability to diagnose and resolve complex data-related issues.
  • SQL (for database management and querying)
  • Apache Spark (for distributed data processing)
  • Apache Spark Streaming, Kafka or similar (for real-time data streaming) 
  • Experience using data tools in at least one cloud service - AWS, Azure or GCP (e.g. S3, EMR, Redshift, Glue, Azure Data Factory, Databricks, BigQuery, Dataflow, Dataproc

  • Data Scientist

    3 days ago


    South Africa Palindrome Data Full time R500 000 - R1 000 000 per year

    Palindrome Data uses contemporary machine learning to revolutionise traditional public health approaches. We currently focus on HIV and Maternal and Child Health programmes, and our work thus far has been in South Africa and the nearby region.We're growing the team and are opening a Data Science position to add new talent to our team and provide an...


  • South Africa Palindrome Data Full time R100 000 - R200 000 per year

    Palindrome Data uses contemporary machine learning to revolutionise traditional public health approaches. We currently have a focus on HIV and Maternal and Child Health programmes, and our work thus far has focused on South Africa and the surrounding region.We're growing the team and need a sharp Data Science Manager to lead our design, management and...

  • Data Engineer

    3 weeks ago


    Remote, South Africa Collab SA Full time

    What we're Looking For Essential: 5+ years of data engineering experience with a focus on AWS Strong proficiency in Python and SQL Hands-on experience with big data technologies (Hadoop, Spark) Knowledge of AWS data services and best practices Experience with Infrastructure as Code (Terraform or CloudFormation) Strong software engineering fundamentals...

  • Data Engineer

    2 weeks ago


    South Africa b29e6a5b-e27b-490f-a5e1-e163acf00d2b Full time R1 200 000 - R2 400 000 per year

    Job Advert SummaryEduvos is looking to employ the services of a Data EngineerPurpose of the roleThe Data Engineer role at Eduvos reports to the National Manager: Data and BI, and is responsible for designing, building, and maintaining scalable data pipelines and architecture. The role includes supporting both batch and real-time data processing to ensure...

  • Data Engineer

    7 days ago


    South Africa Nedbank Private Wealth Full time R250 000 - R600 000 per year

    Job Classification Data Engineer TA: Luthando BowerClosing date: 1 December 2025 Job Family Information Technology Career Stream Data  Leadership Pipeline Manager of Self Professional Job Purpose The purpose of the Data Engineer is to leverage their data expertise and data related technologies, in line with the Nedbank Data Architecture Roadmap,...

  • Data Engineer

    7 days ago


    South Africa Kaleidocode Full time R600 000 - R1 200 000 per year

    Data EngineerRole PurposeWe are seeking an experienced and passionate Data Engineer to design and build robust data infrastructure that facilitates large data volumes. The role requires a blend of data engineering skills (taking data from various datasets and collections to clean, store, transform and analyse) and database administration skills (working...

  • Data Engineer

    1 week ago


    South Africa Nedbank Private Wealth Full time R250 000 - R500 000 per year

    Requisition Details & Talent Acquisition Consultant REQ Keabetswe Modise Job Family Information Technology Career Stream Data  Leadership Pipeline Manager of Self Professional Job Purpose The purpose of the Data Engineer is to leverage their data expertise and data related technologies, in line with the Nedbank Data Architecture Roadmap, to...

  • Data Engineer

    3 days ago


    Remote Position, South Africa Hire Resolve Full time R60 000 - R120 000 per year

    Hire Resolve is seeking a talented Data Engineer on behalf of our client, a leading provider of Customer Relationship Management (CRM) software solutions. This is an exciting opportunity to join a dynamic team that specializes in developing innovative CRM systems, designed to help businesses manage and enhance customer relationships. As a Data Engineer,...

  • Data Engineer

    3 days ago


    Johannesburg, South Africa PBT Group Full time

    PBT Group is seeking an experienced Data Engineer to join our client’s dynamic data team. The successful candidate will work closely with senior stakeholders to design, develop, and maintain robust data pipelines and cloud-based solutions. This role requires a strong understanding of data engineering principles, data lake management, and cloud technologies...

  • Data Engineer

    3 days ago


    Sandton, South Africa Boardroom Appointments Full time R500 000 - R1 200 000 per year

    Data Engineer - 12 Month Contract Key Responsibilities:Data Pipeline DevelopmentDesign, build, and maintain scalable data pipelines.Develop ETL processes to extract, transform, and load data from various sources.Data Integration and ManagementIntegrate data from multiple sources to create a unified data environment.Ensure data quality and consistency across...