About the job
Please submit your CV in English and indicate your English proficiency level.
Toloka AI, working with Mindrift, offers project-based freelance roles for professionals who want to help test, evaluate, and improve artificial intelligence systems for leading technology companies. These are contract assignments, not permanent positions.
Role overview
The Freelance Data Science Engineer (Python & SQL) works remotely from Virginia, United States, and takes on a variety of AI-related projects. Assignments change from project to project, but the core work centers on designing and validating computational data science challenges that reflect real-world analytical problems across industries like telecommunications, finance, government, e-commerce, and healthcare.
- Develop data science problems that require Python programming, using libraries such as Pandas, Numpy, Scipy, Sklearn, Statsmodels, Matplotlib, and Seaborn.
- Ensure tasks are complex enough to need significant computation and cannot be solved manually in a short time.
- Create scenarios involving advanced data processing, statistical analysis, feature engineering, predictive modeling, and business insight generation.
- Design deterministic problems with reproducible results, using fixed random seeds if randomness is necessary.
- Base assignments on real business challenges like customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency.
- Build end-to-end tasks that cover the data science workflow: data ingestion, cleaning, exploration, modeling, validation, and deployment considerations.
- Integrate big data scenarios that require scalable computation strategies.
- Validate solutions using Python, standard data science libraries, and statistical methods.
- Document each problem clearly, including realistic business contexts and verified solutions.
Requirements
- Minimum 5 years of hands-on data science experience with proven business results.
- Portfolio of completed projects or publications that highlight practical problem-solving skills.
- Advanced Python programming for data science, especially with Pandas, Numpy, Scipy, and Scikit-learn.
- Strong background in statistical analysis and machine learning, including algorithms and real-world applications.
- Proficiency in SQL and database operations for data analysis.
- Experience with Generative AI (LLMs, RAG, prompt engineering, vector databases).
- Understanding of MLOps and model deployment processes.
- Familiarity with tools such as TensorFlow, PyTorch, and LangChain.
- Excellent written English skills at C1 level or higher.
How to join
- Apply
- Pass qualifications
- Join a project
- Complete tasks
- Receive compensation

