About the job
Recursion Pharmaceuticals brings together technology and biology to advance drug discovery. The company’s Recursion Operating System combines machine learning, large-scale experiments, and a proprietary dataset that spans biological, chemical, and patient-related data. Each week, the team conducts millions of wet lab experiments and uses a supercomputing environment to find new connections in biology and chemistry.
What you will do
- Develop, scale, and manage the core data platform. This includes building, operating, and optimizing systems that support querying and exploration across Recursion’s vast datasets, such as a chemistry library with billions of compounds, petabytes of microscopy images, and diverse assay results.
- Help make complex datasets accessible for scientific teams. Work closely with biologists, chemists, and data scientists to ensure data from different biological models and treatments can be explored and queried, even for new and evolving research questions.
- Mentor and support colleagues by sharing technical expertise and experience, helping others grow and increasing impact across teams.
The team
This role joins the Data Lake team, responsible for building and maintaining Recursion’s Data Lake and Data Lakehouse. The group manages both relational and object storage, ensuring all data is discoverable, queryable, and connected. As Recursion grows, the team integrates new data types under the guiding principle that all data flows to the Data Lake.
Location
London, England; Milton Park, England

