Intermediate Data Engineer

4 days ago


Sandton, Gauteng, South Africa k0dehut Full time R500 000 - R1 200 000 per year


Data Engineer

Hybrid / Sandton Jhb

Job Purpose

We are seeking a talented and experienced Data Engineer to join our MLOps team which drives critical business applications. As a key member of our team, you will play a crucial role in designing, building, testing, deploying, and monitoring end-to-end data pipelines for both batch and streaming use cases. You will work closely with data scientists, actuaries, software engineers, and other data engineers to contribute to architecting our Client's modern Machine Learning ecosystem.

Areas of responsibility may include but not limited to:

Data Pipeline Development:

  • Design, build, and maintain ETL pipelines for both batch and streaming use cases.
  • Optimize and refactor existing ETL pipelines to improve efficiency, scalability, and cost-effectiveness.
  • Data visualization and report building.
  • Re-architecting data pipelines for a modern data stack leveraging modern data tools to support actuarial, machine learning, and AI use cases.

Technology Stack:

  • Utilize expertise in Python and SQL for data pipeline development.
  • Using Linux and shell scripting for system automation.
  • Hands-on experience working with Docker and container orchestration tools is advantageous.
  • Knowledge of Spark is advantageous.

Platforms and Tools:

  • Experience working with ETL tools such as Azure Data Factory, dbt, Airflow, Step Functions, etc.
  • Using Databricks, Kafka and Spark Streaming for big data processing across multiple data sources.
  • Working with both relational and NoSQL databases. Knowledge of and experience with high-performance in-memory databases is advantageous.

DevOps and Automation:

  • Working with Azure DevOps to automate workflows and collaborate with cross-functional teams.
  • Familiarity with Terraform for managing infrastructure as code (IaC) is advantageous.
  • Experience working on other big data platforms could be advantageous.
  • Create and maintain documentation of processes, technologies, and code bases.

Collaboration:

  • Collaborate closely with data scientists, actuaries, software engineers, and other data engineers to understand and address their data needs.
  • Contribute actively to the architecture of our Client's modern Machine Learning data ecosystem.

Personal Attributes and Skills

  • Strong proficiency in Python, SQL, and Linux shell scripting.
  • Experience with Spark is advantageous.
  • Previous exposure to ETL tools, relational and NoSQL databases and big data platforms, with experience in Databricks and Azure Data Factory being highly beneficial.
  • Knowledge of DevOps practices and tools, with experience in Azure DevOps being highly beneficial.
  • Familiarity with Terraform for infrastructure automation.
  • Ability to collaborate with cross-functional tech teams as well as business/product teams.
  • Ability to architect data pipelines for advanced analytics use cases.
  • A willingness to embrace a strong DevOps culture.
  • Excellent communication skills.
  • Commitment to excellence and high-quality delivery.
  • Passion for personal development and growth, with a high learning potential.

Education and Experience

  • Bachelor's or Masters degree in Computer Science, Engineering or a related field. Other qualifications will be considered if accompanied by sufficient experience in data engineering.
  • At least 3 years of proven experience as a Data Engineer.



  • Sandton, Gauteng, South Africa Advtech Full time R900 000 - R1 200 000 per year

    ADvTECH is the leading provider of private education in Africa, passionately committed to enriching people's lives and futures. The academic strategy of ADvTECH's School's Division is to ensure that each student progresses incrementally toward mastery in a learning environment that is flexible, inspiring, engaging and challenging. ADvTECH houses a number of...


  • Sandton, Gauteng, South Africa Dis-Chem Life Full time R2 000 000 - R2 500 000 per year

    Purpose of the RoleThe Senior Data Engineer is the architect and builder of Dis-Chem Life's data foundation, creating the infrastructure that turns raw information into a strategic asset. This role goes far beyond moving data from A to B, it is about designing high-performance, future-proof systems that make data accurate, accessible, and truly powerful.By...


  • Sandton, Gauteng, South Africa 36f54d45-3157-4a45-ada8-49709a7ad8f3 Full time R1 200 000 - R2 400 000 per year

    DescriptionSenior Data EngineerLocation: JohannesburgJob Type: Full-time (100%)About ScytaleScytale is a fast-growing B2B SaaS startup transforming cybersecurity compliance for businesses worldwide. Our innovative Compliance-as-a-Service platform simplifies frameworks like SOC 2, ISO 27001, HIPAA, GDPR, and PCI DSS for startups, scale-ups, and enterprises....

  • Senior Data Engineer

    10 hours ago


    Sandton, Gauteng, South Africa Clinix Health Group (Pty) Ltd Full time R800 000 - R1 200 000 per year

    Company description: The Clinix Health Group (CHG) assists communities to live longer, healthier lives through the provision of quality and affordable healthcare. The group provides integrated world-class healthcare systems both internally and externally to the broader ecosystem as a trusted local citizen.Job Title : Senior Data EngineerLocation : Clinix...

  • Data Engineer

    4 days ago


    Sandton, Gauteng, South Africa Boardroom Appointments Full time R500 000 - R1 000 000 per year

    Contract positionMinimum requirements:Bachelors degree or Diploma in Computer Science or a related field requiredHands-on experience in database design and implementation is requiredData warehousing concepts are advantageousAb Initio development experience is essentialAn agile working approach essentialAdobe Martech technology stackExperience in...

  • Data Engineer

    4 days ago


    Sandton, Gauteng, South Africa Boardroom Appointments Full time R100 800 - R350 000 per year

    Key purpose:As a Senior Data Engineer, you'll have the opportunity to work with our datacenter team on building and maintaining an operational data pipeline. You will be responsible for creating, testing, debugging and documenting infrastructure components as well as managing its performance. You should have experience in writing python code that interacts...


  • Sandton, Gauteng, South Africa Itonga Resourcing Full time R600 000 - R1 200 000 per year

    We're Hiring: Intermediate / Senior C# Developer (Dynamics) – Permanent | Sandton Location: Sandton (Permanent) Role: Intermediate / Senior C# Developer (Dynamics) Company: MS Gold Partner, working on custom, enterprise-grade softwareWhat you'll do• Design, develop and enhance bespoke software using C# and Dynamics 365• Build and consume APIs,...


  • Sandton, Gauteng, South Africa Pkasti Technologies Full time R1 600 000 - R2 200 000 per year

    Company DescriptionPkasti Technologies is dedicated to providing exceptional technology solutions tailored to meet the unique needs of businesses. With a team of highly skilled engineers possessing extensive technical knowledge and hands-on experience, Pkasti Technologies specializes in leveraging the latest software standards, tools, platforms, frameworks,...


  • Sandton, Gauteng, South Africa Boardroom Appointments Full time R1 000 000 - R3 000 000 per year

    Key purpose:As a Cloud Data Engineer, you will be responsible for building and maintaining data solutions in the cloud. You'll work with teams across different disciplines to implement new technology and build new systems that support our business goals. You will use your technical expertise as well as your creativity to think through problems creatively and...

  • Senior Data Engineer

    2 weeks ago


    Sandton, Gauteng, South Africa RedScreen (Pty) Ltd. Full time R2 000 000 - R2 500 000 per year

    SENIOR DATA ENGINEERPermanent Position- HybridA technical professional responsible for designing, building, and maintaining a data infrastructure, including databases and data warehouses, to ensure accurate and timely data access for analysis by other teams or applications, requiring a deep understanding of complex data systems and their...