About the job
Please submit your CV in English and indicate your level of English proficiency.
Mindrift connects talented specialists with dynamic, project-based AI opportunities at leading tech companies, focusing on the assessment, evaluation, and enhancement of AI systems. This engagement is project-based and does not constitute permanent employment.
Role Overview
We are seeking a highly skilled Senior Python Developer with extensive functional testing experience. Ideal candidates will possess robust skills in Linux and Docker, and the capability to read and interpret code in various programming languages, such as C, Rust, and Go, with guidance from LLMs. You should be adept at translating requirements for migration tasks and utilizing tools like Roo Code or Claude Code to expedite iterative development processes.
Key Responsibilities
- Design and implement functional black box tests for extensive codebases across diverse programming languages.
- Develop and maintain Docker environments to guarantee 100% reproducible builds and testing across various platforms.
- Track code coverage and establish automated scoring criteria to adhere to industry benchmark standards.
- Utilize LLMs (Roo Code, Claude) to streamline development cycles, automate repetitive tasks, and enhance the overall quality of code.
Qualifications
- Minimum of 5 years of experience as a Software Engineer, primarily focusing on Python.
- In-depth expertise in pytest (including fixtures, session-scoped tests, and timeouts) and developing black-box functional tests for CLI tools.
- Proficient in Docker (creating reproducible Dockerfiles, managing user contexts, and ensuring secure workspaces).
- Strong skills in Linux and Bash scripting, with comfort in debugging within containers.
- Familiarity with modern Python tooling, including uv, pyproject.toml, and packaging.
- Ability to comprehend and interpret multiple coding languages (C, C++, Rust, Go) using LLMs.
- Experience with LLMs (Claude Code, Roo Code, Cursor) for accelerating iterative development and generating test cases.
- Proficiency in English at a B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Technologies and Tools: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer:
- Freelance, project-based collaboration via the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation—choose your working hours (20-30 hours per week).
- Task-based compensation, up to $30/hour* based on performance and workload.
- Opportunity to engage in innovative AI projects for top technology companies.

