About the job
Please submit your CV in English and specify your English proficiency level.
Mindrift is dedicated to connecting talented specialists with project-based artificial intelligence opportunities for top tech companies, focusing on the testing, evaluation, and enhancement of AI systems. This role is project-based and does not offer permanent employment.
About the Role
We are looking for an experienced Senior Python Developer who possesses extensive functional testing expertise, advanced skills in Linux and Docker, and the capability to read and analyze code across various programming languages (such as C, Rust, Go) with the assistance of large language models (LLMs). You should also be proficient in utilizing tools like Roo Code or Claude Code to expedite iterative development processes.
Key Responsibilities
- Design and implement black box functional tests for large codebases in multiple source languages.
- Establish and maintain Docker environments to ensure fully reproducible builds and test executions across various platforms.
- Monitor code coverage and configure automated scoring criteria to align with industry benchmark standards.
- Utilize LLMs (Roo Code, Claude) to streamline development cycles, automate routine tasks, and enhance overall code quality.
Requirements
- A minimum of 5 years of experience as a Software Engineer, primarily focused on Python.
- Extensive experience with pytest (including fixtures, session-scoped, and timeouts) and the design of black-box functional tests for command-line interface (CLI) tools.
- Advanced Docker skills (creating reproducible Dockerfiles, managing user contexts, and ensuring secure workspaces).
- Strong proficiency in Linux and Bash scripting, with comfort in debugging within containers.
- Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
- Ability to read and comprehend various coding languages with the aid of LLMs (such as C, C++, Rust, or Go).
- Experience in using LLMs (Claude Code, Roo Code, Cursor) to enhance iterative development and test-case creation.
- English language proficiency at B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Freelance project-based collaboration via the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation — you choose your availability and contribution level (approximately 20-30 hours per week).
- Compensation based on task completion, up to $21/hour* depending on performance and volume.
- Opportunity to work on innovative AI projects for leading technology companies.

