About the job
Please submit your CV in English and specify your English proficiency level.
Toloka AI, via the Mindrift platform, offers freelance opportunities for specialists working on AI projects. This Senior Python Systems Developer role centers on functional testing and is open to candidates based in Greece. The position is fully remote and structured as a freelance collaboration, not a permanent job. Work focuses on testing, evaluating, and improving AI systems for major technology clients.
What you will do
- Design and implement functional black box tests for large, multi-language codebases.
- Set up and manage Docker environments to ensure reproducible builds and consistent test execution across platforms.
- Track code coverage and configure automated scoring to align with industry benchmarks.
- Use LLM tools such as Roo Code and Claude Code to automate repetitive tasks and improve code quality.
Requirements
- At least 5 years as a Software Engineer, with strong Python experience.
- Deep knowledge of pytest, including fixtures, session-scoped tests, and timeouts.
- Experience designing black-box functional tests for CLI tools.
- Advanced Docker skills: reproducible Dockerfiles, user contexts, and secure workspaces.
- Strong Linux and Bash scripting skills, including debugging in containers.
- Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
- Ability to read and interpret code in C, C++, Rust, or Go, with support from LLMs.
- Hands-on experience using LLMs (Claude Code, Roo Code, Cursor) for development and test-case generation.
- English proficiency at B2 level or above.
Preferred qualifications
- Experience with agent evaluation platforms and MCP CLI.
Tools and technologies
Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Collaboration details and compensation
- Freelance, project-based work through Mindrift (powered by Toloka AI).
- Fully remote and flexible: set your own schedule and hours (20-30 hours per week).
- Compensation depends on project and expertise. For this project, AI trainers can earn up to $30 per hour.
