company

Senior Python Engineer for AI Testing Project - Freelance

Toloka AIRemote — United States
Remote Contract $80/hr - $80/hr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Essential Qualifications:5+ years of software engineering experience, primarily focused on Python development. Deep functional testing knowledge, particularly with pytest and black-box testing methodologies. Expertise in Docker and Linux environments, including Bash scripting. Proficient in modern Python tooling and familiar with multiple programming languages. Solid understanding of LLMs for enhancing development processes. B2 level or higher in English proficiency.

About the job

Please submit your CV in English and specify your level of English proficiency.

Mindrift is your gateway to project-based AI opportunities, connecting skilled professionals with top-tier tech companies focused on testing, evaluating, and enhancing AI systems. This is a project-based collaboration, not a permanent position.

About the Role

We are looking for a seasoned Python Engineer with extensive functional testing expertise. The ideal candidate will have robust skills in Linux and Docker, a proficiency for reading code across multiple languages (such as C, Rust, and Go) with the aid of LLMs, and the capability to translate migration requirements effectively. Familiarity with tools like Roo Code or Claude Code to streamline iterative development is essential.

Key Responsibilities

  • Develop and implement functional black box tests for sizable codebases across various programming languages.
  • Set up and oversee Docker environments to guarantee fully reproducible builds and test executions across platforms.
  • Monitor code coverage and develop automated scoring criteria aligning with industry benchmarks.
  • Utilize LLMs (such as Roo Code and Claude) to enhance development cycles, automate repetitive tasks, and elevate overall code quality.

Requirements

  • 5+ years of software engineering experience, primarily in Python.
  • In-depth knowledge of pytest (including fixtures, session-scoped, timeouts) and experience in designing black-box functional tests for CLI tools.
  • Advanced proficiency with Docker (including reproducible Dockerfiles, user contexts, and secure workspaces).
  • Strong skills in Linux & Bash scripting and debugging within containers.
  • Familiarity with modern Python tools (like uv, pyproject.toml, and packaging).
  • Ability to interpret and understand multiple programming languages with LLM support (such as C, C++, Rust, or Go).
  • Experience leveraging LLMs (Claude Code, Roo Code, Cursor) for accelerating iterative development and generating test cases.
  • English proficiency at a B2 level or higher.

Preferred Qualifications

  • Previous experience with agent evaluation platforms and MCP CLI.

Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.

Benefits

  • Project-based freelance collaboration via the Mindrift platform (powered by Toloka AI).
  • Fully remote and flexible participation—choose your working hours and commitment (20-30 hours per week).
  • Compensation based on task performance, up to $80/hour*.

About Toloka AI

Mindrift connects talented specialists with innovative, project-based AI opportunities in leading tech companies. We focus on testing, evaluating, and enhancing AI systems, providing a dynamic environment for skilled professionals to contribute to cutting-edge projects.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.