Data Scientist, Evals

PerplexityLondon

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

QualificationsPhD or MS in a technical discipline or equivalent experience4+ years of experience in data science or machine learningStrong proficiency in Python and SQL, capable of writing production-quality codeExperience with modern cloud data stacks, specifically AWS and DatabricksComfortable with agentic coding workflows and AI-assisted development toolsPreferred Qualifications1+ years of experience with LLMs at scale, particularly LLM-as-a-judgeExperience with consumer-facing web products or apps with real user trafficStrong research background, applying methods to real-world ML problemsExperience defining evaluation metrics and building ground truth datasets

About the job

Responsibilities

Design and manage automated evaluation pipelines that measure answer quality across Perplexity's products, ensuring adherence to high standards of accuracy and usefulness.
Create tailored evaluation datasets and methodologies to assess the influence of tool calls, particularly in web search retrieval, on the quality of final answers.
Develop VLM-based solutions to programmatically analyze the visual rendering of final answers across various platforms and devices.
Consistently evaluate public benchmarks and academic assessments for their relevance to Perplexity's offerings, adapting and integrating them into our ongoing performance evaluations.
Collaborate within a small, high-impact team where your evaluation metrics will directly influence product enhancements, working closely with technical leadership to measure and elevate Answer Quality.

Qualifications

PhD or MS in a technical discipline or equivalent practical experience.
A minimum of 4 years of experience in data science or machine learning.
Proficient in Python and SQL, with the ability to write production-quality code.
Experience with modern cloud data stacks, particularly AWS and Databricks.
Familiarity with agentic coding workflows and utilizing AI-assisted development tools for efficient iteration.

Preferred Qualifications

At least 1 year of experience working with LLMs at scale, especially in LLM-as-a-judge configurations.
Experience developing customer-facing web products or consumer applications with significant user traffic.
A robust research background, demonstrating the application of research methodologies to real-world machine learning challenges.
Experience in defining evaluation metrics, such as factual consistency, hallucination rate, and retrieval precision, along with creating ground truth datasets.

About Perplexity

Perplexity is a forward-thinking technology company that powers millions of daily interactions through a unique LLM-first search engine. Our mission focuses on delivering accurate, high-quality answers by leveraging both innovative models and specialized data sources, setting us apart in the competitive landscape.

Similar jobs

1 - 20 of 1,017 Jobs

Search for Graduate Data Scientist

1,017 results

Select all on this page (20)

Apply

Graduate Data Scientist

Shift Technology

Full-time|Hybrid|United Kingdom - London

Shift Technology stands as the premier AI platform dedicated to revolutionizing the insurance sector. By integrating generative, agentic, and predictive AI, Shift enhances underwriting, claims management, fraud detection, and risk assessment, all while fostering operational efficiency and delivering outstanding customer experiences. Our proven solutions have earned the trust of the world's foremost insurers, allowing us to provide AI-driven insights at scale and with impactful results.Our workplace culture thrives on innovation, trust, and a shared mission to redefine the insurance landscape through our SaaS offerings. With a team representing over 50 diverse nationalities, we are collectively shaping the future of insurance.As a recent graduate, you’re eager to embark on your first full-time Data Science career.In this role, you will engage with a wide range of challenges, contributing to the design and enhancement of our products focused on fraud detection, anti-money laundering, and claims automation. You’ll collaborate with a team rich in technical proficiency and professional knowledge, encompassing data science, data engineering, coding, business acumen, and client interaction. The role entails working with various data types, including structured data, unstructured text, documents, and images.This position is ideal if you’re looking for a permanent role and can commute to our London office 2-3 days a week. Shift Technology is the perfect launchpad for your career journey!

Mar 17, 2026

Apply

Graduate Data Scientist in Analytics at Checkout.com | London

Checkout.com

Full-time|On-site|London

About Checkout.com Checkout.com supports the online payment experience for major brands such as eBay, ASOS, Klarna, Uber Eats, and Sony. The company’s mission centers on making digital payments simple and reliable for both businesses and consumers. Headquartered in London, Checkout.com operates from 19 offices across six continents. The team values innovation, high performance, and ongoing growth. Joining Checkout.com means becoming part of a fintech organization focused on continuous improvement. Role Overview: Graduate Data Scientist in Analytics The Graduate Data Scientist in Analytics will help generate insights into product performance and build a single, trusted source for key business metrics. This position works closely with product managers and data scientists to inform product decisions and development. The role involves end-to-end responsibility for data products, from initial concept through to implementation and ongoing operation. There is also scope to build new analytics tools and shape how data is used across the business. Main Responsibilities Design and build scalable, reusable data models for the data warehouse using dbt and Snowflake. Develop Looker structures (explores, views, and more) to enable self-service analytics for teams across the company. Work with data analysts and business partners to gather requirements and deliver data sets ready for analysis and reporting. Continuously improve, transform, test, deploy, and document data sources and models. Support and help define data warehouse governance: data quality, testing, documentation, coding standards, and peer reviews. Seek out ways to improve analytics engineering workflows and platforms. Location This position is based in London.

Apr 16, 2026

Apply

Staff Data Scientist

Marshmallow

Full-time|On-site|London

Are you a passionate data scientist looking to make a significant impact in a dynamic environment? Join Marshmallow as a Staff Data Scientist and leverage your expertise to drive innovative data solutions. In this role, you will work closely with cross-functional teams to analyze complex datasets, develop predictive models, and provide actionable insights that enhance our product offerings.

Mar 25, 2026

Apply

Data Scientist

Almedia

Full-time|On-site|London

Join Almedia as a Data Scientist, where you will play a crucial role in transforming data into actionable insights. Collaborate with cross-functional teams to analyze complex datasets, develop predictive models, and support data-driven decision-making.

Mar 2, 2026

Apply

Data Scientist

Axle Careers

Full-time|On-site|London

Join our dynamic team at Axle Careers as a Data Scientist, where you will leverage your analytical skills to extract meaningful insights from complex datasets. You will collaborate with cross-functional teams to develop innovative data-driven solutions that enhance business performance and drive strategic decision-making.

Mar 25, 2026

Apply

Data Scientist, Subscriptions

Spotify Ltd.

Full-time|On-site|London

Join Spotify's innovative team as a Data Scientist in our Subscriptions division! We are looking for a passionate individual who can leverage data to drive impactful decisions and enhance our subscription offerings. You will collaborate with cross-functional teams to analyze data trends, develop predictive models, and contribute to our mission of delivering exceptional user experiences.

Mar 27, 2026

Apply

Data Scientist - Fintech

degree6

Full-time|On-site|London

Join degree6 as a Data Scientist specializing in the fintech sector, where you will leverage your analytical skills to drive innovation and insights. You will be pivotal in developing data-driven solutions that enhance financial services and contribute to the evolution of our products.

May 3, 2019

Apply

Data Scientist - Fintech

degree6

Full-time|On-site|London

Join degree6 as a Data Scientist in the fast-paced world of fintech. In this pivotal role, you will leverage your analytical expertise to drive data-driven decisions, enhance product features, and create innovative solutions that solve real-world financial challenges. You will work in a collaborative environment, engaging with cross-functional teams to deliver impactful insights and contribute to the advancement of cutting-edge financial technologies.

May 3, 2019

Apply

Lead Data Scientist

faculty

Full-time|On-site|London

We are seeking a highly skilled and innovative Lead Data Scientist to join our dynamic team at Faculty. In this role, you will leverage your expertise in advanced analytics and machine learning to drive impactful data-driven solutions. You will lead a team of talented data scientists, guiding them in the development of predictive models and analytical frameworks that support our mission to optimize outcomes across various sectors.

Mar 3, 2026

Apply

Senior Data Scientist

GoCardless

Full-time|On-site|London, UK

About GoCardlessGoCardless is a pioneering global bank payment platform that empowers over 100,000 businesses, ranging from innovative startups to established brands, to seamlessly collect and send payments through direct debit, real-time payments, and open banking.With an impressive annual processing volume exceeding US$130bn across more than 30 countries, we facilitate both recurring and one-off payments, alleviating the burdens of follow-ups, stress, and high fees. Our AI-driven solutions enhance payment success rates and minimize fraud risks. Additionally, our open banking integration with over 2,500 banks enables our customers to make faster and more informed financial decisions.Headquartered in the UK, with offices in London and Leeds, we also have a presence in Australia, France, Ireland, Latvia, Portugal, and the United States.At GoCardless, we prioritize support and are dedicated to making our hiring process inclusive and accessible. If you require any adjustments or additional support, please reach out to your Talent Partner — we are here to assist you!Remember, we do not expect you to meet every single requirement. If you are enthusiastic about this role, we encourage you to apply!The RoleData is at the heart of our mission. We utilize bank account information to deliver intelligent payment solutions, enhancing payment success rates and preventing payer fraud.As a Senior Data Scientist in our Payment Intelligence team, you will collaborate with Software Engineers, Product Managers, and Designers to transform innovative concepts into reality. You will be responsible for the complete lifecycle of our algorithms, from the initial idea to the production-ready code that drives our global payment network.Our tech stack is centered around Google Cloud Platform and Vertex AI, creating a high-performance environment for innovation. Our Data Scientists work at the intersection of Python, SQL, and BigQuery to develop and deploy high-performance models at scale.Key ResponsibilitiesLead the comprehensive delivery of models at scale, from initial discovery and feature engineering to production, A/B testing, and ongoing monitoring.Collaborate with cross-functional teams to design and implement advanced data-driven solutions.

Jan 9, 2026

Apply

Lead Data Scientist - Trust and Safety

Wise

Full-time|On-site|London

Role overview Wise seeks a Lead Data Scientist for the Trust and Safety team in London. The main focus is protecting the platform and its users by applying advanced analytics to financial security challenges. This position involves tackling real-world risks and helping to keep Wise’s services safe for everyone. What you will do Analyze large and complex data sets to identify trends and detect suspicious activity Develop and improve models that reduce risk and prevent fraud Collaborate with stakeholders to guide strategic decisions through data-driven insights Shape new methods to strengthen platform security and build user trust Impact The work in this role directly supports Wise’s goal of making money transfers seamless and secure. Efforts here influence how Wise protects its customers and upholds the integrity of its services.

Apr 28, 2026

Apply

Senior Data Scientist at Intercom | London

Intercom

Full-time|On-site|London, England

Join our dynamic team at Intercom as a Senior Data Scientist, where you will leverage your analytical expertise to drive data-driven decisions and contribute to our innovative product development. In this role, you will collaborate with cross-functional teams to build models that enhance user experiences and optimize our services.

Mar 26, 2026

Apply

Senior Data Scientist

DeepL SE

Full-time|On-site|London

Join DeepL as a Senior Data ScientistDeepL is a pioneering AI product and research organization dedicated to crafting secure, intelligent solutions to tackle complex business challenges. With a clientele of over 200,000 businesses and millions of users in 228 global markets, our Language AI platform is relied upon for its human-like translations, enhanced writing capabilities, and real-time voice translation.Founded in 2017 by CEO Jarek Kutylowski, DeepL now boasts a talented team of over 1,000 individuals and backing from esteemed investors like Benchmark, IVP, and Index Ventures. Our mission is to lead the global AI technology space by developing products that enhance communication, foster connections, and make a significant impact on society. We seek passionate professionals ready to innovate and grow their careers in a dynamic and purpose-driven environment.What Makes DeepL UniqueAt DeepL, we blend cutting-edge AI technology with meaningful work in a culture that champions individual growth. Our team of innovators, researchers, and creators shares a vision to unlock human potential by making work more efficient, smarter, and interconnected.Our employees frequently express positive sentiments about their experiences at DeepL, attributing it to our impactful technology and the values of trust, curiosity, and care that permeate our culture.Joining DeepL means becoming part of a dedicated team committed to innovation, growth, and employee well-being. Explore more about life at DeepL on our LinkedIn, Instagram, and Blog.

Feb 5, 2026

Apply

Data Scientist at Trainline | London

Trainline

Full-time|On-site|London

Join our dynamic team at Trainline as a Data Scientist, where you will leverage data to drive insights and influence strategic decisions. You will work with cross-functional teams to develop models that enhance our customer experience and optimize our operations. If you have a passion for data and want to make a real impact in the travel industry, this is the opportunity for you!

Mar 13, 2026

Apply

Lead Data Scientist

Zego

Full-time|On-site|London, England, United Kingdom

About ZegoAt Zego, we believe that traditional motor insurance can be a barrier for exemplary drivers. It's often too complex, costly, and fails to accurately reflect real driving behavior.Since our inception in 2016, we've dedicated ourselves to transforming the insurance landscape. Our goal is to provide the most affordable insurance options for responsible drivers.Our diverse clientele, ranging from van drivers and gig economy workers to everyday car owners, is at the core of our operations.Having successfully sold tens of millions of policies and secured over $200 million in funding, we are just getting started on this exciting journey.Your RoleWe are in search of a Lead Data Scientist to join our Core Pricing team, reporting directly to the Head of Data Science. You will play a critical role in shaping and executing a high-performance data-science operating model, swiftly transforming concepts into production-ready pricing and risk models.Key ResponsibilitiesLead the development and enhancement of next-generation risk models, managing all aspects from innovative feature discovery and sophisticated modeling to dependable pipeline deployment and strategic R&D initiatives.Implement a robust Data Science workflow: ensuring clear guidelines, templates, and documentation throughout the project lifecycle.Accelerate the idea-to-production timeline: from initial data acquisition to live rate adjustments.Promote a culture of collaborative experimentation: integrating Pricing, Data Science, Machine Learning Engineering, and Systems Engineering.Team leadership and mentorship: oversee a team of 2 Data Scientists, guiding their professional growth and development.Establish coding standards and best practices, mapping common processes and creating playbooks.Assist in selecting tools utilized throughout the Data Science project lifecycle.

Mar 4, 2026

Apply

Lead Data Scientist

Hudl

Full-time|On-site|London, United Kingdom

Are you passionate about data and eager to drive innovation? Join Hudl as a Lead Data Scientist and take the reins in shaping data-driven strategies. In this role, you will leverage advanced analytics to enhance performance across various teams and projects, while mentoring junior data scientists to foster their growth.

Mar 12, 2026

Apply

Junior Data Scientist - Investigations

Ravelin

Full-time|On-site|London, England, United Kingdom

Join Ravelin, the forefront of fraud detection technology! We leverage cutting-edge machine learning and network analysis to tackle significant challenges in online transaction security. Our mission is to ensure that online transactions are safe, allowing our clients to confidently serve their customers.We believe in fostering a vibrant workplace culture characterized by empathy, ambition, unity, and integrity. At Ravelin, we emphasize work-life balance and maintain a flat hierarchy across the company. By becoming part of our team, you will quickly gain insights into the latest technologies while collaborating with some of the brightest minds in the industry. Check out our Glassdoor reviews!If you resonate with our values and enthusiasm for preventing fraud, we encourage you to explore our blog to see how you can contribute to safeguarding the world's largest online businesses.The RoleWe are excited to welcome a Junior Data Scientist to our dynamic team of client-facing data scientists and support analysts. In this pivotal role, you will delve into client data to uncover patterns and trends related to fraud. By applying exploratory analysis, you will create meaningful narratives from data, probing not just “what” and “how” but also “why.” Your interactions with clients will be crucial in helping them understand their fraud landscape and refining our focus on targeted solutions.ResponsibilitiesEngage directly with clients to deliver insights from fraud analytics and machine learning models.Examine large datasets to pinpoint fraud patterns, trends, and emerging threats, enhancing client outcomes.Collaborate with clients to address specific fraud challenges, developing effective and innovative solutions.Compile reports and present analytical findings to clients as needed.Identify new model features and enhance model efficacy.Optimize graph network performance through network analysis.Gain practical experience with our cloud infrastructure and utilize available tools to enhance client data and performance.Develop internal tools in Python to improve our analytical abilities.

Mar 30, 2026

Apply

Data Scientist at Faculty | London

Faculty

Full-time|On-site|London

Join our dynamic team at Faculty as a Data Scientist, where your analytical prowess will help drive impactful decisions across various sectors. In this role, you'll leverage advanced statistical techniques and machine learning algorithms to extract insights from complex datasets, delivering actionable recommendations to stakeholders. Collaborate with cross-functional teams to innovate and optimize data-driven solutions that enhance our clients' operational efficiency.

Mar 11, 2026

Apply

Data Scientist at Abound | London

Abound

Full-time|On-site|London

Join Abound, a forward-thinking company dedicated to leveraging data to drive innovative solutions. As a Data Scientist, you will play a crucial role in analyzing complex datasets, developing predictive models, and transforming data into actionable insights. Collaborate with cross-functional teams to enhance our products and services, ensuring that data-driven decisions are at the forefront of our strategy.

Mar 9, 2026

Apply

Data Scientist, Evals

Perplexity

Full-time|On-site|London

Join Perplexity, a cutting-edge company serving millions of users each day with high-quality answers powered by an LLM-first search engine and specialized data sources. We strive to leverage the latest models as they become available, navigating the complexities of the intelligence frontier where traditional benchmarks may fall short. In this pivotal role, you will be responsible for creating specialized evaluations aimed at enhancing answer quality across Perplexity, specifically focusing on search-based LLM responses and other user-favored scenarios.ResponsibilitiesDesign and manage automated evaluation pipelines that measure answer quality across Perplexity's products, ensuring adherence to high standards of accuracy and usefulness.Create tailored evaluation datasets and methodologies to assess the influence of tool calls, particularly in web search retrieval, on the quality of final answers.Develop VLM-based solutions to programmatically analyze the visual rendering of final answers across various platforms and devices.Consistently evaluate public benchmarks and academic assessments for their relevance to Perplexity's offerings, adapting and integrating them into our ongoing performance evaluations.Collaborate within a small, high-impact team where your evaluation metrics will directly influence product enhancements, working closely with technical leadership to measure and elevate Answer Quality.QualificationsPhD or MS in a technical discipline or equivalent practical experience.A minimum of 4 years of experience in data science or machine learning.Proficient in Python and SQL, with the ability to write production-quality code.Experience with modern cloud data stacks, particularly AWS and Databricks.Familiarity with agentic coding workflows and utilizing AI-assisted development tools for efficient iteration.Preferred QualificationsAt least 1 year of experience working with LLMs at scale, especially in LLM-as-a-judge configurations.Experience developing customer-facing web products or consumer applications with significant user traffic.A robust research background, demonstrating the application of research methodologies to real-world machine learning challenges.Experience in defining evaluation metrics, such as factual consistency, hallucination rate, and retrieval precision, along with creating ground truth datasets.

Feb 13, 2026

Create account — see all 1,017 results