Toloka AI logo

Senior Python Systems Developer for Functional Testing

Toloka AIRemote — Greece
Remote Contract $30/hr - $30/hr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Experience Level

Senior

Qualifications

5+ years of experience in software development, particularly in Python, with a focus on functional testing and proficiency in Docker and Linux. Strong debugging skills in containerized environments and familiarity with multiple programming languages. Experience with LLMs for development acceleration is a plus.

About the job

Please submit your CV in English and specify your English proficiency level.

Toloka AI, via the Mindrift platform, offers freelance opportunities for specialists working on AI projects. This Senior Python Systems Developer role centers on functional testing and is open to candidates based in Greece. The position is fully remote and structured as a freelance collaboration, not a permanent job. Work focuses on testing, evaluating, and improving AI systems for major technology clients.

What you will do

  • Design and implement functional black box tests for large, multi-language codebases.
  • Set up and manage Docker environments to ensure reproducible builds and consistent test execution across platforms.
  • Track code coverage and configure automated scoring to align with industry benchmarks.
  • Use LLM tools such as Roo Code and Claude Code to automate repetitive tasks and improve code quality.

Requirements

  • At least 5 years as a Software Engineer, with strong Python experience.
  • Deep knowledge of pytest, including fixtures, session-scoped tests, and timeouts.
  • Experience designing black-box functional tests for CLI tools.
  • Advanced Docker skills: reproducible Dockerfiles, user contexts, and secure workspaces.
  • Strong Linux and Bash scripting skills, including debugging in containers.
  • Familiarity with modern Python tooling (uv, pyproject.toml, packaging).
  • Ability to read and interpret code in C, C++, Rust, or Go, with support from LLMs.
  • Hands-on experience using LLMs (Claude Code, Roo Code, Cursor) for development and test-case generation.
  • English proficiency at B2 level or above.

Preferred qualifications

  • Experience with agent evaluation platforms and MCP CLI.

Tools and technologies

Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.

Collaboration details and compensation

  • Freelance, project-based work through Mindrift (powered by Toloka AI).
  • Fully remote and flexible: set your own schedule and hours (20-30 hours per week).
  • Compensation depends on project and expertise. For this project, AI trainers can earn up to $30 per hour.

About Toloka AI

Mindrift specializes in connecting talented professionals with innovative project-based AI opportunities, enabling them to contribute to the development and improvement of cutting-edge AI technologies for top tech companies.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages.

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.