About the job
Please submit your CV in English and indicate your level of English proficiency.
Mindrift is a platform that connects talented specialists with project-based AI opportunities in collaboration with leading tech companies, emphasizing the testing, evaluation, and enhancement of AI systems. Note that participation is project-based and does not lead to permanent employment.
About the Role
We are seeking a Senior Python Developer with extensive experience in functional testing. The ideal candidate will have strong skills in Linux and Docker, the ability to read and interpret code across various programming languages with the assistance of LLMs (such as C, Rust, Go), and the capability to translate requirements for migration tasks. Familiarity with tools like Roo Code or Claude Code to enhance iterative development is also essential.
Key Responsibilities
- Develop functional black box tests for extensive codebases in multiple source languages.
- Set up and manage Docker environments to guarantee 100% reproducible builds and test executions across various platforms.
- Oversee code coverage and configure automated scoring metrics to align with industry benchmark standards.
- Utilize LLMs (Roo Code, Claude) to accelerate development cycles, automate repetitive tasks, and enhance overall code quality.
Requirements
- 5+ years of experience as a Software Engineer, predominantly in Python.
- In-depth knowledge of pytest (including fixtures, session-scoped tests, timeouts) and experience in designing black-box functional tests for CLI tools.
- Proficient in Docker with expert-level skills (reproducible Dockerfiles, user contexts, secure workspaces).
- Strong skills in Linux and Bash scripting, with comfort in debugging within containers.
- Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
- Capability to read and comprehend various coding languages with the help of LLMs (e.g., C, C++, Rust, Go).
- Experience employing LLMs (Claude Code, Roo Code, Cursor) to speed up iterative development and generate test cases.
- Proficient in the English language with a B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer
- Freelance project-based collaboration via the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation— choose your own schedule and workload (20-30 hours per week).
- Task-based compensation, potentially up to $12/hour depending on performance and volume.
- Opportunity to engage in innovative AI projects for leading tech companies.

