AI Agent Evaluation Analyst
1 week ago
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.
At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
What we do
The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
Who we're looking for:
We're looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil's advocate.
Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?
This is a flexible, project-based opportunity well-suited for:
- Analysts, researchers, or consultants with strong critical thinking skills.
- Students (senior undergrads / grad students) looking for an intellectually interesting gig.
- People open to a part-time and non-permanent opportunity.
About the project:
We're on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you'll have to balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.
You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you've ever excelled in things like consulting, CHGK, Olympiads, case solving, or systems thinking — you might be a great fit.
What you'll be doing:
- Reviewing evaluation tasks and scenarios for logic, completeness, and realism.
- Identifying inconsistencies, missing assumptions, or unclear decision points.
- Helping define clear expected behaviors (gold standards) for AI agents.
- Annotating cause-effect relationships, reasoning paths, and plausible alternatives.
- Thinking through complex systems and policies as a human would to ensure agents are tested properly.
- Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
How to get started:
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.
Requirements
- Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.
- Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.
- Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.
- Ability to assess scenarios holistically: What's missing, what's unrealistic, what might break?
- Good communication and clear writing (in English) to document your findings.
We also value applicants who have:
- Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.
- Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.
- Exposure to LLMs, prompt engineering, or AI-generated content.
- Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong").
- Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).
Benefits
- Get paid for your expertise, with rates that can go up to $20/hour depending on your skills, experience, and project needs.
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
- Influence how future AI models understand and communicate in your field of expertise.
-
AI Bot Developer
5 days ago
Johannesburg, Gauteng, South Africa Sabio Group Full time R250 000 - R500 000 per yearAI Bot DeveloperDepartment: Delivery Employment Type: Full TimeLocation: JohannesburgDescription At Sabio Group, we're dedicated to fostering an environment where employees thrive. Since 1998, we've built a dynamic culture that is both challenging and fun, driven by a team of ambitious, knowledgeable individuals who are passionate about leading the CX...
-
AI Bot Developer
1 week ago
Johannesburg, Gauteng, South Africa Sabio Group Full time R104 000 - R208 000 per yearAI Bot DeveloperDepartment: Delivery Employment Type: Full TimeLocation: JohannesburgDescriptionAt Sabio Group, we're dedicated to fostering an environment where employees thrive. Since 1998, we've built a dynamic culture that is both challenging and fun, driven by a team of ambitious, knowledgeable individuals who are passionate about leading the CX...
-
Freelance Economics Expert
20 hours ago
Johannesburg, Gauteng, South Africa Mindrift Full time R1 296 000 - R2 100 000 per yearThis opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency.At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.What...
-
AI GPT Specialist
2 weeks ago
Johannesburg, Gauteng, South Africa peopleworth Full time R250 000 - R500 000 per yearAt peopleworth, we support work where people and performance thrive. As part of our Employer Group, we work with a variety of forward-thinking partners and are excited to share this opportunity that sits within our growing group.Role OverviewThis short-term freelance role focuses on creating internal GPT or agent style tools that improve operational...
-
AI Tester
2 weeks ago
Johannesburg, Gauteng, South Africa Sourceworx Full time R250 000 - R750 000 per yearJob Purpose:The AI Tester will be responsible for validating the functionality, performance, and reliability of AI-driven systems and applications. This includes designing and executing test cases for machine learning models, automation flows, and intelligent features integrated into enterprise platforms.Mandatory skills:Experience: At least years in the...
-
Senior AI Specialist
3 days ago
Johannesburg, Gauteng, South Africa Boardroom Appointments Full time R1 200 000 - R2 400 000 per yearMinimum requirements:Advanced Diplomas/National 1st DegreesB.Sc Computer Science, B.Com Informatics, Engineering Degrees (preferred) Candidates should have development experience in generative models such as M365 Co-Pilot, Bing Chat, GitHub Co-Pilot, GPT, and Transformer models. Candidates should identify the use case, define POC generative AI models, and...
-
AI Training Experts
6 days ago
Johannesburg, Gauteng, South Africa e0254c18-3a6f-44fc-8fa4-ef6c5782af5e Full time R1 200 000 - R2 144 000 per yearAI Training ExpertsAbout ProlificProlific is not just another player in the AI space – we are building the biggest pool of quality human data in the world.Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.The roleWe're looking for AI...
-
AI Platform Architecture
5 days ago
Johannesburg, Gauteng, South Africa InfyStrat Full time R2 000 000 - R2 500 000 per yearInfyStrat is seeking an experienced AI Platform Architect to join our innovative team. In this role, you will be responsible for the design, architecture, and implementation of AI platforms that drive data analysis and machine learning initiatives across our organization. You will work closely with data scientists, engineers, and business stakeholders to...
-
QA Agent
1 week ago
Johannesburg, Gauteng, South Africa ExecutivePlacements Full time R250 000 - R450 000 per yearRecruiter:One SparkJob Ref:153014Date posted:Thursday, October 23, 2025Location:Johannesburg, South AfricaSalary:SUMMARY:POSITION INFO:Purpose of the Role:As a Quality Assurance Agent, your mission is to protect and elevate every customer experience, ensuring every interaction reflects the care, trust, and excellence we promise. You will be the guardian of...
-
QA Agent
1 week ago
Johannesburg, Gauteng, South Africa Dis-Chem Life Full time R25 000 - R40 000 per yearPurpose of the Role:As a Quality Assurance Agent, your mission is to protect and elevate every customer experience, ensuring every interaction reflects the care, trust, and excellence we promise. You will be the guardian of our standards, the coach who helps agents shine, and the voice that champions both our customers and our people. By listening deeply,...