Aws Data Engineer
3 weeks ago
OverviewResponsible for creating and managing the technological part of data infrastructure in every step of data flow. From configuring data sources to integrating analytical tools — all these systems would be architected, built, and managed by a general-role data engineer.Data Architecture and ManagementDesign and maintain scalable data architectures using AWS services for example, but not limited to, AWS S3, AWS Glue and AWS Athena.Implement data partitioning and cataloging strategies to enhance data organization and accessibility.Work with schema evolution and versioning to ensure data consistency.Develop and manage metadata repositories and data dictionaries.Assist and support with defining, setup and maintenance of data access roles and privileges.Pipeline Development and ETLDesign, develop and optimize scalable ETL pipelines using batch and real-time processing frameworks (using AWS Glue and PySpark). Implement data extraction, transformation and loading processes from various structured and unstructured sources.Optimize ETL jobs for performance, cost efficiency and scalability.Develop and integrate APIs to ingest and export data between various source and target systems, ensuring seamless ETL workflows.Enable scalable deployment of ML models by integrating data pipelines with ML workflows. Automation, Monitoring and OptimizationAutomate data workflows and ensure they are fault tolerant and optimized.Implement logging, monitoring and alerting for data pipelines.Optimize ETL job performance by tuning configurations and analyzing resource usage.Optimize data storage solutions for performance, cost and scalability.Ensure the optimisation of AWS resources for scalability for data ingestion and outputs.Deploy machine learning models into productions using cloud based services like AWS Sagemaker. Security, Compliance and Best PracticesEnsure API security, authentication and access control best practices.Implement data encryption, access control and compliance with GDPR, HIPAA, SOC2 etc.Establish data governance policies, including access control and security best practices. DevelopmentTeam Mentorship and CollaborationWork closely with data scientists, analysts and business teams to understand data needs.Collaborate with backend teams to integrate data pipelines into CI / CD.Assist with developmental leadership to the team through coaching, code reviews and mentorship.Ensure technological alignment with B2C division strategy supporting overarching hearX strategy and vision.Identify and encourage areas for growth and improvement within the team.QMS and ComplianceDocument data processes, transformations and architectural decisions.Maintain high standards of software quality within the team by adhering to good processes, practices and habits, including compliance to QMS system, and data and system security requirements.Ensure compliance to the established processes and standards for the development lifecycle, including but not limited to data archival.Drive compliance to the hearX Quality Management System in line with the Quality Objectives, Quality Manual, and all processes related to the design, development and implementation of software related to medical devicesply to ISO, CE, FDA (and other) standards and requirements as is applicable to assigned products.Safeguard confidential information and data. Role RequirementsBachelor's degree in Computer Science or Engineering (or similar)Honors degree in Computer Science or Engineering (or similar)AWS Certified Solutions Architect orAWS Certified Data AnalystMinimum applicable experience5+ years working experienceRequired nature of experienceExperience with AWS services used for data warehousing, computing and transformations i.e. AWS Glue (crawlers, jobs, triggers, and catalog), AWS S3, AWS Lambda, AWS Step Functions, AWS Athena and AWS CloudWatchExperience with SQL and NoSQL databases (e.g., PostgreSQL, MySQL, DynamoDB)Experience with SQL for querying and transformation of dataSkills and Knowledge (essential)Strong skills in Python (especially PySpark for AWS Glue)Strong knowledge of data modeling, schema design and database optimizationProficiency with AWS and infrastructure as codeSkills and Knowledge (desirable)Knowledge of SQL, Python, AWS serverless microservices,Deploying and managing ML models in productionVersion control (Git), unit testing and agile methodologiesThis job description is not a definitive or exhaustive list of responsibilities and is subject to change depending on changing business requirements. Employees will be consulted on any changes. If you do not hear from us within 30 days, please consider your application unsuccessful. #J-18808-Ljbffr
-
AWS Cloud Data Engineer
3 weeks ago
Pretoria, South Africa OttoBauthentic Full timeJOB OVERVIEW / ROLE PURPOSE We are seeking a Senior Data Engineer with expertise in cloud data platforms, big data pipelines, and advanced analytics . In this role, you’ll architect and maintain scalable, high-performance data ecosystems that power machine learning models, BI dashboards, and AI-driven decision-making . You’ll combine hands-on engineering...
-
AWS Cloud Data Engineer
3 weeks ago
Pretoria, South Africa OttoBauthentic Full timeJOB OVERVIEW / ROLE PURPOSEWe are seeking a Senior Data Engineer with expertise in cloud data platforms, big data pipelines, and advanced analytics. In this role, you’ll architect and maintain scalable, high-performance data ecosystems that power machine learning models, BI dashboards, and AI-driven decision-making. You’ll combine hands-on engineering...
-
Aws Cloud Data Engineer
3 weeks ago
Pretoria, South Africa OttoBauthentic Full timeJob Overview / Role Purpose We are seeking a Senior Data Engineer with expertise in cloud data platforms, big data pipelines, and advanced analytics . In this role, you’ll architect and maintain scalable, high-performance data ecosystems that power machine learning models, BI dashboards, and AI-driven decision-making . You’ll combine hands-on engineering...
-
AWS Engineer
6 days ago
Pretoria, South Africa Jordan Human Resource Full timeReference: JHB -ZN-1 Our Client in the IT industry is looking for an AWS Data Engineer. If you meet the below requirements, kindly send us your CV. Duties & Responsibilities ESSENTIAL SKILLS REQUIREMENTS: Above average experience/understanding (in order of importance): Terraform Python 3x SQL - Oracle/PostgreSQL Py Spark Boto3 ETL Docker Linux / Unix Big...
-
Data Engineer | AWS, Python
2 weeks ago
Pretoria, South Africa Jordan Hr Full timeA reputable global company based in Gauteng is seeking a skilled Data Engineer. The ideal candidate will possess a degree and be a Certified AWS Cloud Practitioner. Key skills include proficiency in Terraform, Python, SQL, and experience with data formats and APIs. This full-time role involves designing and validating data processes to ensure accuracy and...
-
Aws Devops Engineer
2 weeks ago
Pretoria, South Africa Vaxowave Full timeTHE JOB AT A GLANCE Join an incredible team to help drive the growth of our AWS Cloud Practice. You will use your cloud computing skills and knowledge to design and implement solutions to challenging client problems. If you have AWS cloud experience, an entrepreneurial drive, and are seeking a unique opportunity to make a difference, Vaxowave is the place...
-
Pretoria, South Africa E-Merge Full timeWere a team of curious minds and caffeine-fueled builders on a mission to turn raw data into real-world impact. We believe in pipelines that dont leak, schemas that actually make sense, and dashboards that dont make your eyes bleed. Our company is scaling fast, and guess what? So is our data. Thats where you come in.Currently in search for a Data Engineer!!!...
-
Aws Engineer Tshwane
6 days ago
Pretoria, South Africa Jordan Human Resource Full timeReference: JHB -ZN-1 Our Client in the IT industry is looking for an AWS Data Engineer. If you meet the below requirements, kindly send us your CV. Duties & Responsibilities ESSENTIAL SKILLS REQUIREMENTS: Above average experience/understanding (in order of importance): Terraform Python 3.x SQL - Oracle/PostgreSQL Py Spark Boto3 ETL Docker Linux / Unix Big...
-
AWS Cloud Engineer
7 hours ago
Pretoria, Gauteng, South Africa Vaxowave Full timeTHE JOB AT A GLANCEJoin an experienced and forward-thinking DevOps team where your expertise in Nexus Repository, Bitbucket, and Bamboo will be central to ensuring robust and efficient software delivery pipelines. If you're passionate about CI/CD, build automation, and cloud technologies like AWS, and want to work in a fast-paced environment where your input...
-
Data Engineer
6 days ago
Pretoria, South Africa E-Merge Full timeWe are looking for talented Data Engineers and Data Scientists to join our team and help us solve some of the most complex and impactful challenges in engineering and digital transformation. As a Data Engineer / Data Scientist, you will work at the intersection of big data, advanced analytics, and cutting-edge technology. You will design, build, and deploy...