Senior or Staff Data Scientist at Windfall | San Francisco
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
About Windfall Data
At Windfall, we believe in the transformative power of data. Our commitment to excellence drives us to provide our partners with the insights they need to make informed decisions. Join us and be part of a team that is reshaping the future of people data.
Similar jobs
Search for Data Scientist Machine Learning At Middesk San Francisco
10,649 results
About MiddeskAt Middesk, we simplify the way businesses connect and collaborate. Since our inception in 2018, we have revolutionized business identity verification by replacing tedious, manual processes with instant access to comprehensive, up-to-date information. Our innovative platform empowers companies across various sectors to confidently verify business identities, expedite customer onboarding, and mitigate risks throughout the customer lifecycle.Originating from Y Combinator and backed by notable investors such as Sequoia Capital and Accel Partners, Middesk has recently earned a spot on the Forbes Fintech 50 List and has been recognized as a leader in business verification by digital identity strategy firm Liminal.The RoleWe are on a mission to develop AI-driven applications that enhance customer workflows, particularly in the realm of business onboarding. Leveraging our proprietary identity data and extensive domain expertise, we are uniquely positioned to broaden our suite of AI-powered solutions that fuel sustainable growth.We are seeking a hands-on applied Machine Learning expert to establish the technical foundation for these initiatives. The ideal candidate will have experience deploying external-facing models in the risk and fraud sectors and will be familiar with the complexities of imbalanced data, limited labels, and evolving behaviors. This is a highly technical and influential role that will shape our ML design, development, and scaling efforts at Middesk.What You'll Do:Develop risk and fraud ML applications: Deliver production ML models in fraud detection, trust and safety, Know Your Business (KYB), and compliance, with a measurable influence on customer workflows.Address challenging data issues: Work on classification tasks characterized by extreme class imbalance, sparse signals, and “cold start” label challenges.Innovate in feature engineering and labeling: Implement graph-based methodologies, weak supervision, LLMs, and AI agents to enhance signal extraction and automate the labeling process.Establish foundational ML infrastructure: Collaborate with the platform engineering team to design feature services, model training pipelines, model serving standards, and orchestration to scale multiple ML applications.What We’re Looking For:A minimum of 7 years of applied ML experience, with demonstrable impact in risk, fraud, trust and safety, compliance, or related high-stakes domains.A proven record of deploying ML models from research to production in client-facing products.Expertise in classification challenges, such as imbalanced labels, sparse signals, and cold start issues.
Join middesk as a Software Engineer specializing in our Data Platform, where you will play a pivotal role in building robust systems that empower our data-driven initiatives. You will collaborate with cross-functional teams to design, implement, and optimize data solutions that enhance our products and services.
Middesk
Join Middesk as a Machine Learning Engineer and contribute to cutting-edge projects that leverage machine learning to drive business insights. You will collaborate with a dedicated team of data scientists and engineers, developing algorithms and models that enhance our product offerings and improve user experience.
About MiddeskMiddesk is revolutionizing the way businesses collaborate by providing seamless business identity verification. Since our inception in 2018, we have replaced cumbersome manual processes with instant access to accurate and current data. Our platform empowers companies across various sectors to confidently verify business identities, accelerate customer onboarding, and mitigate risks throughout the customer journey.As a proud graduate of Y Combinator and backed by esteemed investors such as Sequoia Capital and Accel Partners, Middesk has been recognized in the Forbes Fintech 50 List and acknowledged as a leading authority in business verification by digital identity strategy firm, Liminal.About Middesk Engineering:At Middesk Engineering, we prioritize
About MiddeskAt Middesk, we simplify the way businesses collaborate by streamlining business identity verification. Since our inception in 2018, we have been dedicated to transforming cumbersome, manual processes into efficient access to comprehensive, up-to-date data. Our innovative platform empowers companies across various sectors to confidently verify business identities, expedite customer onboarding, and minimize risk throughout the customer lifecycle.Originating from Y Combinator and backed by prestigious investors like Sequoia Capital and Accel Partners, Middesk has garnered recognition as an industry leader in business verification, being named to the Forbes Fintech 50 List and acknowledged by Liminal, a leading digital identity strategy firm.The Role:As a Senior Product Designer at Middesk, you will be instrumental in aiding businesses to maintain robust compliance standards and navigate risks with assurance. Your role involves crafting intuitive user experiences that equip customers to make informed compliance and risk decisions in highly regulated environments. Collaborating closely with cross-functional Product and Engineering teams, you will oversee the entire design process from initial discovery to final delivery, showcasing your expertise in design craftsmanship, systems thinking, and an inquisitive approach at every stage.We embrace a hybrid work model, anticipating 2 days per week in our San Francisco office. Ideal candidates should reside within a commutable distance, as we value the benefits of in-person collaboration and fostering strong team dynamics while accommodating flexibility whenever feasible.
Jobs for Humanity
Join our dynamic team at Jobs for Humanity as a Machine Learning Data Scientist, where you will harness the power of data to drive innovative solutions for underserved communities. Your expertise will play a crucial role in developing algorithms and models that enhance accessibility and improve lives.As a key member of our team, you will collaborate with cross-functional teams to identify opportunities for leveraging data to create impactful products. If you are passionate about using your data science skills for a greater good, we want to hear from you!
SoFi Technologies, Inc.
Join SoFi, a leading personal finance company, as a Data Scientist where you will leverage data to drive strategic decisions and enhance user experiences. In this role, you will analyze complex datasets, build predictive models, and collaborate with cross-functional teams to propel our innovative solutions forward.
About SesameAt Sesame, we envision a future where computers possess lifelike qualities, enabling them to see, hear, and interact with us in natural and human-like ways. We are dedicated to creating innovative voice agents that seamlessly integrate into our daily lives. Our talented team comprises founders from Oculus and Ubiquity6, along with seasoned professionals from Meta, Google, and Apple, boasting extensive expertise in both hardware and software development. Join us as we redefine the boundaries of technology and create a world where computers come to life.About the RoleAs a Machine Learning Scientist at Sesame, you will play a pivotal role in advancing our product goals through innovative research. We seek a detail-oriented individual with a strong background in Natural Language Processing (NLP), Speech Recognition, and/or Computer Vision, particularly with a focus on deep learning methodologies. You will stay abreast of the latest research and leverage your creativity and intuition to devise novel solutions tailored to our unique applications.Responsibilities:Contribute to the design and enhancement of machine learning models across diverse modalities.Engage with the complete ML stack, including model architecture, data curation, evaluation metrics, and training and inference infrastructure, while conducting research and experimentation.Identify and adopt promising techniques from existing literature, while innovating new methods as necessary to meet our distinct objectives.Required Qualifications:Demonstrated ability to work independently in environments characterized by high ambiguity.Published research in NLP, Speech Recognition, or Computer Vision focused on large-scale deep learning projects.Proficient understanding of cutting-edge advancements in artificial intelligence.Bachelor’s degree or higher in Computer Science or a related field.Preferred Qualifications:Master’s or PhD degree is preferred.Experience working on product development.Familiarity with startup environments.Sesame is committed to fostering a workplace that values, respects, and empowers everyone. We welcome applicants from diverse backgrounds, embracing all aspects of identity, race, gender, orientation, and ability. We also provide reasonable accommodations for individuals with disabilities — please contact careers@sesame.com for assistance.
Join our dynamic team at Laurel as a Senior Machine Learning Data Scientist specializing in Analytics. In this pivotal role, you will leverage your expertise in machine learning and data analysis to drive innovative solutions that enhance our decision-making processes. You will collaborate with cross-functional teams to design and implement predictive models, analyze complex datasets, and translate insights into actionable strategies.Your contributions will be key in shaping the future of our analytics framework, enabling us to better serve our clients and stakeholders. If you’re passionate about data science and eager to make a significant impact, we want to hear from you!
Join Metriport as a Data Scientist and be at the forefront of data-driven decision making! In this role, you will leverage advanced analytics and machine learning techniques to derive insights from complex datasets, enhancing our product offerings and driving strategic initiatives.
Join Hilbert, a pioneering data science-driven growth engine that empowers B2C teams with predictive insights into user behaviors, revenue drivers, and sustainable growth strategies. Our innovative approach compresses lengthy decision-making processes into mere minutes.Trusted by Fortune 10 enterprises and beloved brands like FreshDirect, Blank Street, and Levain Bakery, Hilbert is the backbone of their growth strategies. We are also collaborating with leading AI companies to push the boundaries of what’s possible.We are seeking a talented Data Scientist who possesses a deep understanding of B2C business challenges, develops actionable models using real-world data, and delivers impactful analyses that facilitate significant growth outcomes — all with the initiative and urgency typical of a founder.This is not a role where you simply receive tasks; you will take ownership of problems from start to finish — from problem framing and modeling to measuring impact — for enterprise clients where the stakes are high and feedback is rapid. If you understand the nuances of churn analysis for different sectors, can create effective recommendation systems from sparse data, and can clearly communicate your causal analysis to clients, we want to meet you.ROLE OVERVIEWYou will closely collaborate with the founding team, engineering, product, and go-to-market teams to enhance the data science systems integral to Hilbert. Daily responsibilities include building models, conducting experiments, analyzing data, and producing analyses that influence key decisions. Our focus is B2C, and the challenges we tackle — such as demand forecasting, customer lifecycle management, personalization, and activation — require an individual who can translate business contexts into effective modeling choices. You will thrive in a high-autonomy, high-ambiguity environment where data is often messy, incomplete, or scarce.Key Responsibilities:Develop ML models that enhance core product features: recommendation systems, search relevance, customer segmentation, demand forecasting, and activation optimization.Contribute to configurable, multi-tenant model architectures that adapt to various customer contexts and business needs, avoiding the need for custom solutions for each case.Build effective models using available data — leveraging limited, noisy, or sparse datasets while determining the appropriate level of complexity.Design and implement rigorous A/B tests and recognize when causal inference methods are necessary.
Sciforium is an innovative AI infrastructure company specializing in the development of state-of-the-art multimodal AI models, alongside a proprietary high-efficiency serving platform. With substantial financial backing and direct collaboration with AMD, including hands-on support from AMD engineers, our rapidly growing team is dedicated to building the comprehensive stack that powers cutting-edge AI models and real-time applications.Role OverviewWe are on the lookout for a highly skilled and visionary Data Scientist to spearhead the strategy and creation of vast datasets essential for our foundational models. In the realm of Large Language Models (LLMs), we recognize that data is the key competitive advantage. This role will encompass the entire data lifecycle—from extensive web-scale crawling to the meticulous creation of human-aligned datasets that dictate model behavior.The ideal candidate will embrace data as both a large-scale engineering challenge and a complex analytical puzzle. Your responsibilities will extend beyond simply delivering data; you will design taxonomies, filtering heuristics, and post-training pipelines to ensure our models excel in reasoning, safety, and multimodal comprehension.Key ResponsibilitiesFoundation Dataset Strategy: Oversee the comprehensive creation of pre-training datasets for LLMs, defining the optimal mix of web data, code, literature, and technical documents to enhance downstream model performance.Petabyte-Scale Curation: Innovate and implement advanced pipelines for data cleaning, deduplication (exact and fuzzy), and high-quality signal extraction from vast amounts of unstructured data.Post-Training & Alignment Data: Direct the creation of high-quality post-training datasets, including Supervised Fine-Tuning (SFT) instructions, multi-turn dialogues, and preference modeling data (RLHF/DPO).Multimodal Expansion: Lead the acquisition and processing of vision and video data, addressing the challenges of multimodal alignment, video compression, and temporal data consistency.High-Performance Engineering: Create high-throughput data processing scripts utilizing Python, employing multiprocessing and multithreading to manage large-scale ingestion and transformation without performance bottlenecks.Data Profiling & Analysis: Perform in-depth statistical analysis on training datasets to uncover biases, knowledge gaps, and quality regressions, ensuring a mathematically balanced model diet.Synthetic Data Generation: (Added Value) Develop pipelines to generate high-quality synthetic datasets that enhance model training and capabilities.
Join our dynamic team at tempo-xyz as a Data Scientist, where you'll harness the power of data to drive impactful insights and strategic decision-making. You'll collaborate with cross-functional teams to analyze complex datasets, develop predictive models, and communicate findings to stakeholders. This is an excellent opportunity for innovative thinkers eager to make a difference through data-driven solutions.
Join Altos Labs as a Machine Learning Scientist or Senior Machine Learning Scientist and be at the forefront of innovative research in the virtual cell space. Our team is dedicated to advancing the field of machine learning and its applications in biological research.
Join Hilberts as a Machine Learning Engineer / Data Scientist in our San Francisco office, where you will leverage cutting-edge technology to drive enterprise-level solutions. You will work collaboratively with cross-functional teams to design, develop, and implement machine learning models that enhance our data-driven decision-making processes.
About TacitTacit is an innovative, early-stage deep tech startup located in San Francisco, pioneering advanced hardware solutions that revolutionize human-computer interaction. With prestigious backing from General Catalyst, Khosla Ventures, and Greylock Partners, our founding team comprises experts from Stanford, BrainGate, Oculus, and Tesla. While we are still in stealth mode, our team is passionately addressing avant-garde engineering challenges to create groundbreaking products.About the RoleAs a Data Scientist Intern, you will join our Machine Learning and Research team, focusing on the development of state-of-the-art tools and methodologies for data processing and analysis. Your responsibilities will include designing, executing, and analyzing experiments using our unique systems, interpreting results to guide future data collection, hardware design, and enhancing our decoding algorithms. This role will involve hands-on experience with our custom hardware signals and examining performance scaling across diverse user groups.What You'll DoDesign, implement, and test algorithms for extracting features from bio-signals to enhance machine learning models.Rapidly iterate on experiments to evaluate new hardware prototypes and improve data collection methods.Develop visualization tools and establish metrics for data quality assessment.Collaborate with hardware and electrical engineers to optimize the sensing stack.Work alongside the ML team on data augmentation, multimodal training, and synthetic data experiments to enhance generalization.RequirementsCurrently enrolled in or recently graduated from a PhD program in neuroscience, biomedical engineering, computer science, applied mathematics, or a related discipline.Excellent communication and writing skills.Proven ability to collaborate across multidisciplinary teams and adapt quickly to new domains.Proficiency in Python; experience with PyTorch is a plus.Familiarity with dimensionality reduction techniques and time series data analysis.Strong independent work ethic, flexibility, and resourcefulness.Thrives in a fast-paced startup environment and is eager to contribute independently.Preferred QualificationsExperience with machine learning frameworks and data science tools.
Join Windfall, a pioneering people data and AI company, where data science is at the heart of our mission. We are dedicated to transforming how organizations understand and leverage people data by equipping leading commercial and nonprofit partners with precise, actionable predictive models.We are seeking a product-focused Senior or Staff Data Scientist to be part of our Modeling & AI team. In this pivotal role, you will directly influence customer success by developing custom propensity models that tackle our clients' most pressing challenges. Collaborating closely with customer data and Windfall's extensive data resources, you will craft innovative solutions and create comprehensive modeling pipelines that convert data into tangible value.This position offers a unique chance to witness your contributions in action, working hand-in-hand with our clients to develop a range of models, from sophisticated marketing and lead scoring to donation likelihood models that enhance the missions of the world's leading nonprofit organizations.
Join Scribd Inc., a leading digital library and subscription service, as a Data Scientist II. In this role, you will leverage data analytics to drive insights and support data-driven decision making across the organization. Collaborate with cross-functional teams to implement machine learning models and enhance our user experience.
Join Us as a Founding Data Scientist and Machine Learning EngineerAmplify Your ImpactYou have achieved remarkable milestones in your career—delivering impactful models, influencing key metrics, and showcasing the transformative potential of data science and machine learning. You have positively affected products that touch millions of lives.Now, envision the possibility of enhancing the entire app ecosystem by extending your influence across numerous products and companies, making every app in users’ pockets smarter, more engaging, and indispensable.Your expertise can empower product teams to innovate faster, captivate users, and drive revenue growth, thanks to the intelligence you develop once and deploy universally.We share this ambition; we have successfully achieved it multiple times at leading organizations like Uber, Apple, Meta, Google, and Chime. Our contributions have generated tens of billions of dollars in impacts for essential products relied on by billions, and we are poised to elevate our influence further.If this resonates with the journey you seek, we invite you to continue reading.Our MissionDashboards recount the past; teams require insights for their next move. Palladio AI serves as the intelligence layer between raw data and decisive action, illuminating product opportunities that translate into genuine growth levers and guiding actions so product teams can iterate with confidence and speed rather than wade through noise.Your RoleYou will be part of a team crafting foundational systems in behavioral modeling, causal inference, forecasting, agentic platforms, and beyond. Your contributions will extend these domains: developing ML and AI models to identify and highlight product opportunities, deploying learning loops that enhance with each release. In essence, you will convert fundamental data science principles into a scalable product across various industries.Beyond technical challenges, you will create a platform that aids real people in making informed decisions, transforming data into clarity and clarity into actionable progress.Your ProfilePassion for Craft and Excellence. You dive into complex datasets, prototype swiftly, and refine until insights shine.Impact-Driven Mindset. 6+ years of experience in production ML/DS; you harmonize scientific rigor with a practical approach—“it ships today, iteration follows.”
About MiddeskMiddesk simplifies collaboration for businesses by transforming identity verification processes. Since our inception in 2018, we have replaced outdated, manual methods with an efficient platform that provides seamless access to comprehensive and current data. Our services enable companies across various sectors to confidently verify business identities, accelerate customer onboarding, and mitigate risks throughout the customer lifecycle.As a proud Y Combinator graduate, Middesk is supported by notable investors such as Sequoia Capital and Accel Partners. We have recently been featured on Forbes’ Fintech 50 List and recognized as a leader in business verification by the digital identity strategy firm, Liminal.About the Middesk Engineering Team:At Middesk, we prioritize delivering value to our customers through a concept we call 'Velocity'. This term embodies our commitment to achieving meaningful outcomes rather than merely focusing on code delivery speed. We believe that exceptional products arise from a blend of technical expertise and a profound understanding of our customers’ needs. Our engineering team is composed of humble, self-driven individuals who are dedicated to addressing even the most complex challenges faced by our clients. At Middesk Engineering, our mission is to put customers first.Your Role:We are seeking a talented Infrastructure Engineer to join our DevSecOps team. Your mission will be to empower engineering teams by providing secure, cost-effective, and scalable platform capabilities that enhance software delivery, improve developer experience, and ensure compliance with industry standards. You will be responsible for developing the tools and infrastructure necessary to scale our development and production systems. Your contributions will directly impact the entire Software Development Lifecycle and overall developer experience (DevEx). The systems you will support include Kubernetes, cloud infrastructure, observability, and local development environments.Our work environment is hybrid, requiring a presence in our San Francisco or New York City offices for 2 days each week. Candidates must reside within a reasonable commuting distance as we value in-person collaboration while also supporting flexible work arrangements.Key Responsibilities:Architect, build, and scale cloud infrastructure and orchestration systems (e.g., Kubernetes, Terraform, CI/CD).Take ownership of and enhance developer experience (DevEx) tools and workflows, spanning from local development to deployment.Develop observability systems that offer insights into performance, reliability, and usage metrics.
Sign in to browse more jobs
Create account — see all 10,649 results

