Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.

Freelance AI Evaluation Engineer at Mindrift | Sweden | RoboApply Jobs

This job posting is no longer active and is not accepting applications.

Freelance AI Evaluation Engineer (Python & Full-Stack)

MindriftRemote — Sweden

Remote Contract $50/hr - $50/hr

No Longer Active

Experience Level

Experience

Qualifications

QualificationsA degree in Computer Science, Software Engineering, or a related field.5+ years of experience in software development, particularly with Python.Strong Full-Stack development skills, including React and back-end systems.Experience in writing functional and integration tests.Proficiency with Docker and CI/CD pipelines.Fluent in English (B2 level or higher).

About the job

Please submit your CV in English, including your English proficiency level.

Mindrift is dedicated to connecting talented specialists with project-based AI opportunities from leading technology firms, focusing on the testing, evaluation, and enhancement of AI systems. Please note that participation is project-based and does not constitute permanent employment.

About the Role

As a Freelance AI Evaluation Engineer, you will design and develop challenging coding test cases that rigorously assess AI coding systems:

Review and enhance realistic coding tasks derived from provided production codebases, ensuring they have realistic scope and requirements.
Develop comprehensive functional tests that validate end-to-end behavior and edge cases, going beyond superficial checks.
Create “fair yet challenging” tasks where the AI has all the necessary context but must work to retrieve it (information spread across files and external resources, requiring complex reasoning).
Analyze AI failures to identify specific challenges the model faces versus areas of proficiency.
Refine your work based on feedback from expert QA reviewers who evaluate your submissions against seven quality criteria.

Qualifications

This role is ideal for seasoned developers, software engineers, or test automation specialists interested in part-time, non-permanent projects. Preferred candidates will possess:

A degree in Computer Science, Software Engineering, or a related field.
5+ years of experience in software development, with a strong focus on Python (pytest, async/await, subprocess, file operations).
A solid background in Full-Stack development, adept at building React-based interfaces and robust back-end systems.
Experience in writing tests (functional, integration—not just executing them).
Proficiency with Docker containers (running evaluations locally in containers).
Understanding of CI/CD processes (experience with GitHub Actions as a user: triggers, labels, reading results).
English proficiency at a B2 level or higher.

How It Works

Apply → Pass qualifications → Join a project → Complete tasks → Get paid.

Effort Estimation

Tasks for this project are estimated to require approximately 20 hours, depending on complexity. This is merely an estimate; you have the flexibility to choose when and how to work. Tasks must be submitted by the deadline and meet specified acceptance criteria to be accepted.

Compensation

On this project, contributors can earn up to $50 per hour, depending on their experience and contribution pace. Compensation may vary across projects based on scope, complexity, and required expertise. Please be aware that other projects on the platform may offer different earning levels based on their requirements.

About Mindrift

Mindrift specializes in connecting skilled professionals with innovative AI projects from leading technology companies. Our focus is on enhancing AI systems through rigorous testing and evaluation, offering specialists the chance to work on cutting-edge technology while contributing to impactful projects.

This job posting is no longer active and is not accepting applications.

Freelance AI Evaluation Engineer (Python & Full-Stack)

MindriftRemote — Sweden

Remote Contract $50/hr - $50/hr

No Longer Active

Experience Level

Experience

Qualifications

About the job

Please submit your CV in English, including your English proficiency level.

About the Role

As a Freelance AI Evaluation Engineer, you will design and develop challenging coding test cases that rigorously assess AI coding systems:

Review and enhance realistic coding tasks derived from provided production codebases, ensuring they have realistic scope and requirements.
Develop comprehensive functional tests that validate end-to-end behavior and edge cases, going beyond superficial checks.
Create “fair yet challenging” tasks where the AI has all the necessary context but must work to retrieve it (information spread across files and external resources, requiring complex reasoning).
Analyze AI failures to identify specific challenges the model faces versus areas of proficiency.
Refine your work based on feedback from expert QA reviewers who evaluate your submissions against seven quality criteria.

Qualifications

This role is ideal for seasoned developers, software engineers, or test automation specialists interested in part-time, non-permanent projects. Preferred candidates will possess:

A degree in Computer Science, Software Engineering, or a related field.
5+ years of experience in software development, with a strong focus on Python (pytest, async/await, subprocess, file operations).
A solid background in Full-Stack development, adept at building React-based interfaces and robust back-end systems.
Experience in writing tests (functional, integration—not just executing them).
Proficiency with Docker containers (running evaluations locally in containers).
Understanding of CI/CD processes (experience with GitHub Actions as a user: triggers, labels, reading results).
English proficiency at a B2 level or higher.

How It Works

Apply → Pass qualifications → Join a project → Complete tasks → Get paid.

Effort Estimation

Compensation

Freelance AI Evaluation Engineer (Python & Full-Stack)

Experience Level

Qualifications

About the job

About Mindrift

Direct Appointment Setter at Southern National Roofing | Columbia, MD

Project Superintendent

Community Support Lead Care Manager at Pacific Health Group | Remote

Physical Therapist at Performance Optimal Health | New Canaan

Part-Time In-Home Veterinarian

Sales Support Specialist at Golden Lighting | Tallahassee, FL

New Home Sales Consultant at LGI Homes | Lebanon, TN

Medical Director - Licensed Psychiatrist

Recruiting Coordinator - Join Our Innovative Team

Experienced Litigation Paralegal - Remote

Senior Director of Digital Communications

Nutritional Cook for Early Childhood Center

FMS Analyst at ACT1 Federal | Patuxent River, MD

Automotive Technician Opportunity at Citrus Kia

Software Security Analyst at TP-Link Systems Inc. | Irvine, California

Network Intrusion Detection Engineer - Active TS/SCI with CI Poly

Tax Associate - Private Client

Lead Behavior Technician - Full-Time Position

Local Roofing Sales Representative - Roof Restoration Specialist

Senior Director of Inventory and Merchandise Planning

Freelance AI Evaluation Engineer (Python & Full-Stack)

Experience Level

Qualifications

About the job

About Mindrift

Freelance AI Evaluation Engineer (Python & Full-Stack)

Experience Level

Qualifications

About the job

About Mindrift

Freelance AI Evaluation Engineer (Python & Full-Stack)

Experience Level

Qualifications

About the job

About Mindrift