Data Scientist Internship At Tacit San Francisco jobs in San Francisco – Browse 10,589 openings on RoboApply Jobs

Data Scientist Internship At Tacit San Francisco jobs in San Francisco

Open roles matching “Data Scientist Internship At Tacit San Francisco” with location signals for San Francisco. 10,589 active listings on RoboApply Jobs.

10,589 jobs found

1 - 20 of 10,589 Jobs
Apply
companyTacit logo
Internship|On-site|San Francisco

About TacitTacit is an innovative, early-stage deep tech startup located in San Francisco, pioneering advanced hardware solutions that revolutionize human-computer interaction. With prestigious backing from General Catalyst, Khosla Ventures, and Greylock Partners, our founding team comprises experts from Stanford, BrainGate, Oculus, and Tesla. While we are still in stealth mode, our team is passionately addressing avant-garde engineering challenges to create groundbreaking products.About the RoleAs a Data Scientist Intern, you will join our Machine Learning and Research team, focusing on the development of state-of-the-art tools and methodologies for data processing and analysis. Your responsibilities will include designing, executing, and analyzing experiments using our unique systems, interpreting results to guide future data collection, hardware design, and enhancing our decoding algorithms. This role will involve hands-on experience with our custom hardware signals and examining performance scaling across diverse user groups.What You'll DoDesign, implement, and test algorithms for extracting features from bio-signals to enhance machine learning models.Rapidly iterate on experiments to evaluate new hardware prototypes and improve data collection methods.Develop visualization tools and establish metrics for data quality assessment.Collaborate with hardware and electrical engineers to optimize the sensing stack.Work alongside the ML team on data augmentation, multimodal training, and synthetic data experiments to enhance generalization.RequirementsCurrently enrolled in or recently graduated from a PhD program in neuroscience, biomedical engineering, computer science, applied mathematics, or a related discipline.Excellent communication and writing skills.Proven ability to collaborate across multidisciplinary teams and adapt quickly to new domains.Proficiency in Python; experience with PyTorch is a plus.Familiarity with dimensionality reduction techniques and time series data analysis.Strong independent work ethic, flexibility, and resourcefulness.Thrives in a fast-paced startup environment and is eager to contribute independently.Preferred QualificationsExperience with machine learning frameworks and data science tools.

Apr 9, 2026
Apply
companyTacit logo
Full-time|$120K/yr - $180K/yr|On-site|San Francisco

About Tacit Tacit is an early-stage deep tech startup based in San Francisco, focused on hardware that changes how people interact with computers. Backed by General Catalyst, Khosla Ventures, and Greylock Partners, the team brings experience from Stanford, BrainGate, Oculus, and Tesla. The company is committed to solving complex engineering problems to deliver new products that push the boundaries of human-computer interaction. Role Overview The Full-Stack Software Engineer will help turn ambitious ideas into real products, working across Tacit's software stack. This role involves taking features from initial concept through to deployment and shaping user experiences that support the company’s mission. What You Will Do Develop the internal product stack to improve workflows and prototypes across devices and companion clients. Build and expand internal applications for demos and data collection, supporting idea testing and user experience validation. Prototype new features for human-computer interaction, iterate with test users, and refine until the experience feels seamless. Design and run product experiments, including A/B tests and feedback loops, to quickly gather insights and guide product direction. Streamline data collection and participant experiences, including setup flows, session reliability, metadata capture, monitoring, and labeling. Develop demo features that showcase real-time sensing and inference in engaging, reliable ways. Find and implement integrations that improve team workflows and efficiency. Create and improve automated testing and release systems, such as CI/CD pipelines, smoke tests, and regression checks, to support frequent updates. Work closely with machine learning, hardware, and industrial design teams to define requirements, deliver results, troubleshoot, optimize, and ensure reliability. Qualifications Strong skills in Python (FastAPI, Pydantic; experience with pandas or numpy) and modern TypeScript/React. Experience building production backends (APIs, data models, reliability, observability) and delivering features for end users. History of shipping multiple products from start to finish.

Apr 15, 2026
Apply
companyTacit logo
Internship|$40/hr - $60/hr|On-site|San Francisco

About TacitTacit is an innovative deep tech startup located in San Francisco, dedicated to transforming human-computer interaction through groundbreaking hardware solutions. Our venture is supported by prominent investors such as General Catalyst, Khosla Ventures, and Greylock Partners, with a founding team comprising experts from Stanford, BrainGate, Oculus, and Tesla. We are on a mission to tackle pioneering engineering challenges to bring visionary products to life.About the RoleWe are looking for a passionate PhD student to join us as an Antenna Engineering Intern. In this role, you will collaborate closely with our talented engineers and researchers to design, simulate, analyze, and validate advanced antenna systems for our wireless sensing hardware. This internship offers invaluable hands-on experience throughout the entire product lifecycle, emphasizing design optimization and performance reliability. You will have the unique opportunity to work on impactful problems, contributing directly to 0-to-1 product development in a vibrant startup environment.Key ResponsibilitiesDesign and evaluate antennas for specific wireless channel link budgets, considering various solutions and trade-offs between key performance indicators (KPIs).Conduct computational electromagnetic simulations and develop new models utilizing full-wave electromagnetic software tools.Analyze and interpret simulation results, providing insights and recommendations for design enhancements and performance optimization.Create mock-ups for proof-of-concept prototypes and test vehicles, build testing fixtures and platforms, and develop solutions to automate antenna system development and data analysis workflows.Support the PCB layout design process and DFM reviews, ensuring readiness for mass production with vendors and external manufacturing partners.Collaborate with cross-functional teams to integrate antennas into complete device form factors.Conduct antenna characterization to ensure integrated antennas meet specifications in free space and on-body, from prototype to mass production.Assist in sensor calibration to minimize device-to-device variability and maintain reliable performance under various operating conditions and user scenarios.Optimize hardware systems for low-noise and low-interference, high-resolution signal detection.

Mar 26, 2026
Apply
companyTacit logo
Intern|On-site|San Francisco

About TacitTacit is a pioneering deep tech startup located in San Francisco, focused on creating groundbreaking hardware that transforms human-computer interaction. Supported by notable investors such as General Catalyst, Khosla Ventures, and Greylock Partners, our founding team comprises experts from Stanford, BrainGate, Oculus, and Tesla. While we are unable to disclose extensive details at this moment, our team is passionately addressing advanced engineering challenges to develop revolutionary products.About the RoleWe are on the lookout for a driven Electrical Engineering Intern to assist in the development and scaling of test infrastructure for our innovative sensing technology. This internship will provide you with hands-on experience across hardware, firmware, and automation systems, emphasizing the enhancement of test coverage and product reliability. You will have the opportunity to engage in collaborative learning and actively participate in product development within a vibrant startup atmosphere.What You'll DoAutomate lab equipment and validation workflowsDesign and construct custom PCBs for test jigsContribute to embedded firmware to support test hooks and diagnosticsDevelop a continuous integration (CI) test rack to automatically validate firmware releases and sensor performanceAssist in sensor characterization and calibration to minimize device-to-device variabilityDocument and effectively communicate engineering findings and recommendations.Ideal Candidates May HaveProficiency in Python, PCB Design (Altium), Embedded C/C++A strong independent work ethic, adaptability, and resourcefulness.Excellent communication and teamwork abilities.Comfort in a fast-paced startup environment, with a passion for independent building.CompensationHourly rate determined by your individual skills and experience.

Mar 20, 2026
Apply
companyTacit logo
Intern|On-site|San Francisco

About TacitTacit is an innovative deep tech startup located in the heart of San Francisco, pioneering hardware solutions that redefine human-computer interaction. Supported by prominent investors such as General Catalyst, Khosla Ventures, and Greylock Partners, our founding team comprises experts from esteemed organizations like Stanford, BrainGate, Oculus, and Tesla. While our groundbreaking products are still under wraps, our team is dedicated to tackling formidable engineering challenges to revolutionize the tech landscape.About the RoleWe are looking for an enthusiastic Software Engineering Intern to contribute to the development of the next generation of human-computer interfaces. This internship offers invaluable hands-on experience in both front-end and back-end development, focusing on creating user-centric applications and aiding internal research processes. You will collaborate closely with a talented team of engineers, designers, and researchers to bring innovative interaction paradigms to fruition.Key ResponsibilitiesDesign and implement applications for web, mobile, and embedded clients that interface with our hardware.Build backend services for real-time interaction and seamless data streaming.Develop internal tools for data labeling, visualization, and debugging purposes.Rapidly prototype new user experiences utilizing novel input methods.Collaborate with full-time engineers to deploy stable, high-performance software solutions.Document engineering insights and articulate design decisions effectively.Preferred QualificationsProficiency in Python, TypeScript, React, Linux, and command-line development.Demonstrated ability to work independently, with a strong sense of flexibility and resourcefulness.Excellent communication and teamwork skills.Ability to thrive in a fast-paced startup environment, with a keen interest in independent project development.CompensationHourly compensation will be determined based on your individual skills and experience.

Mar 20, 2026
Apply
companyTacit logo
Full-time|$180K/yr - $200K/yr|On-site|San Francisco

About TacitTacit is an innovative, early-stage deep tech startup located in San Francisco, dedicated to revolutionizing human-computer interaction through cutting-edge hardware development. Our team, which includes experts from Stanford, BrainGate, Oculus, and Tesla, is backed by prominent investors like General Catalyst, Khosla Ventures, and Greylock Partners. We are committed to addressing complex engineering challenges to create groundbreaking products that will redefine the tech landscape.About the RoleWe are in search of a Patent Agent to spearhead and execute our patent prosecution strategy. This position is well-suited for an individual with substantial experience at a prestigious law firm who is eager to operate with greater independence while closely collaborating with our technical teams to safeguard and expand our sophisticated intellectual property portfolio.What You'll DoLead patent prosecution activities from invention disclosure to issuance.Draft, file, and prosecute U.S. and international patent applications with minimal oversight.Consult with internal stakeholders regarding patentability, freedom-to-operate, and portfolio strategies.Work directly with engineers, researchers, and leadership to develop inventions.Manage external counsel as necessary and review their work to ensure quality and strategic alignment.Contribute to the formulation of a long-term IP strategy that aligns with the company's product and research roadmap.Desired QualificationsA minimum of 4 years of experience as a Patent Agent in a top-tier law firm.A solid background in electrical engineering or a closely related technical discipline.An advanced technical degree (Master’s or PhD strongly preferred).Registered to practice before the USPTO.Proven ability to manage patent prosecution autonomously.Exceptional communication skills, capable of translating complex technical concepts into robust patent claims.CompensationThe compensation range for this role is between $180,000 and $200,000 per year.BenefitsCompetitive equity compensation package.Comprehensive health, vision, and dental insurance.A collaborative work environment with a company size of 20-30 people.Visa sponsorship available.Unlimited paid time off (PTO).

Mar 20, 2026
Apply
companyTacit logo
Full-time|$160K/yr - $180K/yr|On-site|San Francisco

About Tacit Tacit is a deep tech startup based in San Francisco, focused on advancing human-computer interaction through new hardware. The company is backed by investors including General Catalyst, Khosla Ventures, and Greylock Partners. The founding team brings experience from Stanford, BrainGate, Oculus, and Tesla. While much of the work is confidential, the team tackles ambitious engineering problems to deliver novel products. Role Overview The Computational Neuroscientist will help develop new approaches for data analysis, improve signal extraction, and strengthen Tacit's technology infrastructure. The role involves running and interpreting experiments on proprietary systems, drawing insights to guide future data collection, and refining decoding algorithms. Work includes direct interaction with signals from custom hardware and studying how performance scales across different user groups. This is a full-time, on-site position in San Francisco (5 days per week). What You Will Do Design, implement, and evaluate algorithms to extract features from biosignals for machine learning use. Iterate quickly on experiments to test new hardware prototypes and improve data collection methods. Develop analytical tools for visualizing and assessing data quality. Work closely with hardware and electrical engineers to improve the sensing stack. Requirements PhD in neuroscience, biomedical engineering, machine learning, computer science, or a related field (or equivalent industry experience). Skilled in Python and PyTorch. Experience collaborating across diverse teams and adapting to new technical domains. Background in dimensionality reduction and time series data analysis. Self-driven, adaptable, and resourceful approach to work. Strong communication skills and a collaborative mindset. Comfort working in a startup setting and able to contribute independently. Preferred Qualifications Experience with real-time human-machine interaction technologies, such as automatic speech recognition or closed-loop systems.

Apr 16, 2026
Apply
companyTacit logo
Full-time|On-site|San Francisco

Join Us at TacitAt Tacit, we are a pioneering deep tech startup located in the heart of San Francisco, dedicated to revolutionizing human-computer interaction through innovative hardware solutions. With the backing of esteemed investors such as General Catalyst, Khosla Ventures, and Greylock Partners, our founding team boasts expertise from industry leaders like Stanford, BrainGate, Oculus, and Tesla. Though we are bound by confidentiality regarding specifics, we are passionately engaged in solving advanced engineering challenges to bring groundbreaking products to market.Role OverviewWe are on the lookout for enthusiastic individuals to contribute to the development of our state-of-the-art technology. At Tacit, we emphasize hands-on experience, collaborative learning, and direct engagement in product innovation within an energetic startup atmosphere.Ideal Candidate ProfileA self-starter with a flexible mindset and the ability to resourcefully navigate challenges.Strong communication and teamwork skills.Thrives in a fast-paced startup environment, eager to work independently and contribute to the team.Compensation DetailsWe offer a competitive salary that reflects the role and experience, accompanied by a generous equity package for full-time employees.BenefitsCompetitive equity optionsComprehensive health, vision, and dental coverageA close-knit team of 20-30 professionalsVisa sponsorship availableUnlimited paid time off (PTO)3% 401(k) matching

Mar 20, 2026
Apply
companyTacit logo
Full-time|$150K/yr - $200K/yr|On-site|San Francisco

About TacitTacit is an innovative, early-stage deep tech startup located in San Francisco, focused on developing groundbreaking hardware to redefine human-computer interactions. With support from prominent investors such as General Catalyst, Khosla Ventures, and Greylock Partners, our founding team brings expertise from renowned institutions including Stanford, BrainGate, Oculus, and Tesla. While we are not ready to disclose our projects fully, we are committed to solving complex engineering challenges to launch revolutionary products.Position OverviewWe are seeking a talented Senior Electrical Engineer to lead the architecture, design, and deployment of next-generation neurotechnology hardware products. As a pivotal member of our expanding hardware team, you will take ownership of significant components of our electrical systems from initial design stages to mass production, playing a crucial role in the realization of sophisticated consumer electronics.Key ResponsibilitiesElectrical System Design & DevelopmentLead the electrical architecture and board-level design for mixed-signal consumer hardware systems from concept to production.Oversee schematic design and guide the development process through EVT, DVT, PVT, and mass production stages.Define system partitioning, interfaces, and component selection across computing, sensors, connectivity, and power subsystems.Conduct rigorous design reviews and establish robust electrical design and manufacturing standards.Collaborate closely with mechanical, firmware, and software teams to enhance performance, size, thermal behavior, and reliability.Effectively navigate a startup environment where engineers manage substantial parts of the hardware stack.High-Speed and High-Density Hardware DesignDesign high-density, multi-layer PCBAs employing HDI techniques, fine-pitch BGAs, and controlled-impedance routing.Implement high-speed digital interfaces such as USB, Quad-SPI, MIPI, CSI/DSI, DDR, or similar buses.Create analog and mixed-signal circuits, including amplifiers, filters, sensor interfaces, and ADC/DAC connections.Optimize PCB layout for signal integrity, power integrity, and low-noise performance.Power Electronics & Battery SystemsDesign and implement power electronics and battery management systems for our devices.Ensure compliance with safety and regulatory standards while optimizing performance and efficiency.

Mar 24, 2026
Apply
companySemgrep Inc. logo
Full-time|$125K/yr - $147K/yr|On-site|San Francisco Office

About SemgrepSemgrep stands at the forefront of code security, empowering developers to innovate seamlessly. Our platform enables teams to identify, flag, and resolve genuine security issues before deployment, supported by an adaptive security system that evolves with their development process. Semgrep not only safeguards code in real-time but also offers essential guardrails that facilitate rapid development without compromising security. Trusted by top organizations like Snowflake, Dropbox, and Figma, we are recognized by Gartner for excellence in Application Security Testing. To learn more about our mission, visit semgrep.dev.Founded in San Francisco and backed by prominent investors including Menlo Ventures and Sequoia Capital, Semgrep is dedicated to continuous improvement, ensuring that our AI-driven system minimizes false positives and prioritizes actionable vulnerabilities. Experience the future of secure coding with us.

Feb 19, 2026
Apply
companySoFi Technologies, Inc. logo
Full-time|On-site|CA - San Francisco

Join SoFi, a leading personal finance company, as a Data Scientist where you will leverage data to drive strategic decisions and enhance user experiences. In this role, you will analyze complex datasets, build predictive models, and collaborate with cross-functional teams to propel our innovative solutions forward.

Mar 27, 2026
Apply
companyMercor logo
Full-time|On-site|San Francisco

Join Mercor as a Data ScientistAt Mercor, we stand at the forefront of labor markets and artificial intelligence research. We collaborate with top-tier AI laboratories and businesses to infuse the human intelligence crucial for the evolution of AI.Our expansive talent network empowers frontier AI models, mirroring the way educators impart knowledge: sharing insights and experiences that transcend mere coding. Currently, our network boasts over 30,000 experts generating more than $2 million daily.We are pioneering a new work paradigm where specialized expertise drives AI progress. To realize this vision, we seek a dynamic, fast-paced, and dedicated team. You will collaborate with leading researchers, operators, and AI companies, playing a pivotal role in the systems that are reshaping society.As a profitable Series C company, Mercor is valued at $10 billion and operates from our new headquarters in San Francisco with an in-office work schedule five days a week.Your RoleIn your first year, you will implement analyses and experiments that enhance key product metrics, including match quality, time-to-hire, candidate experience, and revenue. Your responsibilities will include:Establishing north-star and feature-specific metrics for our ranking systems, interview analytics, and payout frameworks.Designing and executing A/B tests and quasi-experiments, translating results into product decisions within the same week.Creating source-of-truth dashboards and streamlined data models to enable teams to self-serve answers.Collaborating with engineers to instrument events, enhancing data quality and latency from ingestion to insights.Rapidly prototyping models (from baseline models to gradient boosting) to optimize matching and scoring.Assisting in the evaluation of LLM-powered agents through the design of rubrics, human-in-the-loop studies, and guardrail mechanisms.What Makes You a Great FitYou possess strong foundational skills in statistics, SQL, and Python, alongside projects you are eager to showcase. You adapt swiftly, frame inquiries, test hypotheses, and deliver results within a day, valuing clarity in communication as much as statistical significance. A keen interest in LLM evaluation, retrieval, and ranking is a plus; you will learn alongside professionals from renowned firms such as Jane Street, Citadel, Databricks, and Stripe.

Aug 30, 2025
Apply
companySuperhuman logo
Full-time|$225K/yr - $275K/yr|Hybrid|San Francisco, CA

Superhuman offers an engaging hybrid working model for this role. This flexible approach provides team members with a balance of focused time and in-person collaboration, fostering trust, innovation, and a vibrant team culture. Team members for this position must reside in the San Francisco Bay Area. About SuperhumanSuperhuman, now part of Grammarly, is an AI productivity platform dedicated to unlocking extraordinary potential in everyone. Our suite of apps and agents seamlessly integrates AI into over 1 million applications and websites, including Grammarly’s writing assistance, Coda’s collaborative workspaces, Mail’s inbox management, and Go, the proactive AI assistant that comprehends context and provides automatic assistance. Founded in 2009, Superhuman empowers more than 40 million individuals, 50,000 organizations, and 3,000 educational institutions worldwide to eliminate busywork and concentrate on what truly matters. Discover more at superhuman.com and explore our values here.The OpportunityTo achieve our ambitious goals, we are seeking skilled Data Scientists to join our Product and Growth Data Science teams. At Superhuman, data teams are regarded as trusted experts who reveal new insights to shape marketing, product, and growth strategies that drive significant outcomes across the organization. We have access to large datasets and are looking for individuals with exceptional technical and analytical abilities who can dissect complex business challenges and offer solutions that deliver high visibility and impactful results for the company.Growth Data Scientists collaborate closely with product, engineering, growth, and/or lifecycle marketing teams. They are tasked with designing and executing product experiments, as well as performing intricate analyses to guide product strategy through advanced analytics and machine learning. The ideal candidate will possess a proven record of delivering significant analytical projects within the Growth domain and working alongside cross-functional colleagues to influence decision-making.Product Data Scientists are responsible for assessing the quality of the Superhuman products, ensuring that our offerings are not only innovative but also user-friendly and effective.

Mar 17, 2026
Apply
companyMixpanel logo
Full-time|Hybrid|San Francisco, US (Hybrid)

Join Mixpanel as a Data Scientist and contribute to our mission of helping businesses make data-driven decisions. In this role, you will analyze complex datasets, develop predictive models, and provide actionable insights to enhance our product offerings.

Apr 2, 2026
Apply
companyMindlance logo
Full-time|On-site|San Francisco

Join Mindlance as a Data Scientist, where you will leverage data to drive insights and support decision-making processes. You will be responsible for analyzing complex datasets, developing predictive models, and collaborating with cross-functional teams to enhance business strategies.

Nov 14, 2016
Apply
companyMetriport logo
Full-time|Remote|San Francisco

Join Metriport as a Data Scientist and be at the forefront of data-driven decision making! In this role, you will leverage advanced analytics and machine learning techniques to derive insights from complex datasets, enhancing our product offerings and driving strategic initiatives.

Mar 20, 2026
Apply
companyAircall logo
Full-time|On-site|San Francisco Office

Join Aircall, the leading integrated customer communications and intelligence platform that empowers growing businesses worldwide. With a trusted network of over 20,000 companies, Aircall seamlessly integrates voice and digital channels into one powerful platform, featuring one-click connections with top CRMs and over 100 essential business tools. Our AI-driven insights and automation capabilities enable sales and support teams to optimize their time, discover new opportunities, and deliver outstanding customer experiences. With a diverse global team of over 600 in cities such as Paris, New York, San Francisco, and more, Aircall is revolutionizing the way businesses engage with their customers, fostering deeper relationships and driving measurable success.Our Work Culture: At Aircall, we prioritize customer satisfaction, continuous learning, and extraordinary results. We encourage open collaboration, ownership, and swift, informed decision-making. If you thrive in a dynamic, team-oriented environment where curiosity, trust, and impact are key, you will fit right in.About Our TeamThe Data team at Aircall is integral to our decision-making process, driving innovation and growth through advanced data solutions, tools, and actionable insights.Role OverviewAs a Senior Data Scientist, you will play a crucial role in providing insights to our product and business teams, collaborating closely with product and engineering leaders. You will support key initiatives such as pricing strategies, packaging, multi-product approaches, and self-service solutions. Additionally, you will work cross-functionally with Engineering, Sales, Finance, Marketing, and Customer Relations to translate data requirements into impactful solutions. Your contributions will also include establishing best practices in analytics and mentoring team members to uphold high standards in data governance and insight generation.

Jul 19, 2024
Apply
company
Full-time|Hybrid|San Francisco

Join Grindr as a Staff Data Scientist in a hybrid role based out of our San Francisco, Los Angeles, or Chicago offices, requiring in-office presence on Tuesdays and Thursdays.Why This Role is Unique?Grindr (NYSE: GRND) is the world’s largest LGBTQ+ social application, boasting over 14 million monthly users globally. We are not just a platform but a vital part of the LGBTQ+ community and a cornerstone of gay culture.As a Staff Data Scientist, you will collaborate closely with product managers, designers, and engineers to create insightful metrics that drive product development. You will design and implement innovative experiments, present data-driven insights for decision-making, and explore new growth strategies through comprehensive analysis. This role allows you to work on deployed models that enhance the user experience for millions, while becoming an informal ambassador for the Data Science team, educating others on effective data utilization.You will be part of a dynamic data organization at Grindr that integrates data scientists, data engineers, and ML/AI engineers into a united and collaborative team. This is a unique opportunity to learn, share knowledge, and make a significant impact alongside industry leaders.Your ResponsibilitiesExtract actionable insights from complex, open-ended queries.Design and assess experiments to evaluate the impact of product changes.Analyze product data to identify root causes behind metric fluctuations.Effectively communicate findings to cross-functional stakeholders to inform product strategies.Develop tools to scale and automate analyses, enhancing company productivity.Mentor and guide team members, recommending best practices.Apply an engineering mindset to reduce complexity while maximizing utility and maintainability.Contribute to the development of future ML solutions to enhance recommendations, detect spam, and better serve our users.

May 13, 2025
Apply
companySciforium logo
Full-time|On-site|San Francisco

Sciforium is an innovative AI infrastructure company specializing in the development of state-of-the-art multimodal AI models, alongside a proprietary high-efficiency serving platform. With substantial financial backing and direct collaboration with AMD, including hands-on support from AMD engineers, our rapidly growing team is dedicated to building the comprehensive stack that powers cutting-edge AI models and real-time applications.Role OverviewWe are on the lookout for a highly skilled and visionary Data Scientist to spearhead the strategy and creation of vast datasets essential for our foundational models. In the realm of Large Language Models (LLMs), we recognize that data is the key competitive advantage. This role will encompass the entire data lifecycle—from extensive web-scale crawling to the meticulous creation of human-aligned datasets that dictate model behavior.The ideal candidate will embrace data as both a large-scale engineering challenge and a complex analytical puzzle. Your responsibilities will extend beyond simply delivering data; you will design taxonomies, filtering heuristics, and post-training pipelines to ensure our models excel in reasoning, safety, and multimodal comprehension.Key ResponsibilitiesFoundation Dataset Strategy: Oversee the comprehensive creation of pre-training datasets for LLMs, defining the optimal mix of web data, code, literature, and technical documents to enhance downstream model performance.Petabyte-Scale Curation: Innovate and implement advanced pipelines for data cleaning, deduplication (exact and fuzzy), and high-quality signal extraction from vast amounts of unstructured data.Post-Training & Alignment Data: Direct the creation of high-quality post-training datasets, including Supervised Fine-Tuning (SFT) instructions, multi-turn dialogues, and preference modeling data (RLHF/DPO).Multimodal Expansion: Lead the acquisition and processing of vision and video data, addressing the challenges of multimodal alignment, video compression, and temporal data consistency.High-Performance Engineering: Create high-throughput data processing scripts utilizing Python, employing multiprocessing and multithreading to manage large-scale ingestion and transformation without performance bottlenecks.Data Profiling & Analysis: Perform in-depth statistical analysis on training datasets to uncover biases, knowledge gaps, and quality regressions, ensuring a mathematically balanced model diet.Synthetic Data Generation: (Added Value) Develop pipelines to generate high-quality synthetic datasets that enhance model training and capabilities.

Jan 7, 2026
Apply
company
Full-time|$195K/yr - $275K/yr|Hybrid|Hub - San Francisco

At Superhuman, we embrace a vibrant hybrid working model that combines focused work time with essential in-person collaboration, cultivating trust, innovation, and a robust team culture.Our Data Scientists must be located in San Francisco, Seattle, or New York City.About SuperhumanSuperhuman is an AI productivity platform aiming to unlock the superhuman potential inherent in everyone. With a suite of applications and agents, Superhuman integrates seamlessly with over 1 million applications and websites, delivering advanced tools including Grammarly's writing assistance, Coda's collaborative workspaces, Mail's inbox management, and Go, the proactive AI assistant that responds to context and offers assistance automatically. Since our founding in 2009, we have empowered over 40 million individuals, 50,000 organizations, and 3,000 educational institutions globally to eliminate mundane tasks and focus on what truly matters. Discover more at superhuman.com and explore our values here.The OpportunityTo fulfill our ambitious objectives, we are seeking accomplished Data Scientists to join our Product and Growth Data Science teams. Data teams at Superhuman are trusted experts who reveal new insights to guide marketing, product, and growth strategies that yield significant outcomes for the company. We manage extensive datasets and are looking for individuals with exceptional technical and analytical capabilities to dissect complex business challenges and provide impactful solutions.Growth Data Scientists work closely with product, engineering, growth, and lifecycle marketing teams. They are responsible for designing and executing product experiments and performing intricate analyses to shape product strategies through advanced analytics and machine learning. The ideal candidate will demonstrate a proven history of delivering significant analytical projects in the Growth sector and collaborating effectively with cross-functional peers to influence decision-making.Product Data Scientists focus on assessing the quality of Superhuman products across the board, evaluating everything from the usefulness and accuracy of text suggestions to the engagement of new AI features.

Sep 29, 2025

Sign in to browse more jobs

Create account — see all 10,589 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.