About the job
Please submit your CV in English and indicate your level of English proficiency.
Mindrift connects talented specialists with project-based AI opportunities for leading tech companies, focusing on testing, evaluating, and enhancing AI systems. Please note that participation is project-based rather than permanent employment.
About the Role
We are looking for a highly skilled Senior Python Developer who possesses extensive functional testing experience and exceptional Linux and Docker abilities. The ideal candidate should be adept at reading and interpreting code across multiple programming languages (such as C, Rust, and Go) with the aid of LLMs, and should be capable of translating requirements for migration tasks. Familiarity with tools like Roo Code or Claude Code to streamline iterative development is essential.
Key Responsibilities
- Develop and implement functional black-box tests for extensive codebases in diverse source languages.
- Establish and manage Docker environments to guarantee 100% reproducible builds and test executions across varied platforms.
- Oversee code coverage and configure automated scoring criteria to align with industry benchmark standards.
- Utilize LLMs (e.g. Roo Code, Claude) to expedite development cycles, automate repetitive tasks, and enhance overall code quality.
Qualifications
- A minimum of 5 years of experience as a Software Engineer, primarily focused on Python.
- Extensive experience with pytest (including fixtures, session-scoped, timeouts) and designing black-box functional tests for CLI tools.
- Expert-level skills in Docker (including reproducible Dockerfiles, user contexts, and secure workspaces).
- Strong proficiency in Linux and Bash scripting, along with comfort in debugging within containers.
- Proficiency with modern Python tooling (such as uv, pyproject.toml, packaging).
- The ability to read and comprehend multiple coding languages with the assistance of LLMs (C, C++, Rust, or Go).
- Experience using LLMs (Claude Code, Roo Code, Cursor) to facilitate iterative development and generate test cases.
- English language proficiency at B2 level or higher.
Preferred Qualifications
- Prior experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Freelance project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation—choose your schedule (20-30 hours per week).
- Task-based compensation, up to $30/hour*, based on performance and volume.
- Opportunity to make a meaningful contribution to innovative AI projects for leading tech companies.

