Ai Financial Modeling Extern jobs in San Francisco – Browse 4,728 openings on RoboApply Jobs

Ai Financial Modeling Extern jobs in San Francisco

Open roles matching “Ai Financial Modeling Extern” with location signals for San Francisco. 4,728 active listings on RoboApply Jobs.

4,728 jobs found

1 - 20 of 4,728 Jobs
Apply
company
Internship|$50/hr - $50/hr|Hybrid|San Francisco

AI Financial Modeling Extern — F2 AILocation: San Francisco, CA / In-Person or RemoteCommitment: 5+ hours per week | 4 - 12+ weeksCompensation: $50/hrAbout F2 AIAt F2 AI, we are revolutionizing private market investments. Our cutting-edge AI technology streamlines the process of analyzing complex, unstructured deal materials, transforming them into actionable, investment-grade insights in mere minutes. By empowering private credit funds, commercial banks, and private equity firms, we enable faster and more confident capital deployment. Supported by top-tier investors such as NFX and Y Combinator, we are committed to expanding our exceptional product and engineering teams, shaping the future of vertical AI for finance.Role OverviewWe are on the lookout for 1–2 exceptional externs with a strong foundation in Investment Banking or Private Equity to contribute to the development of AI-driven financial modeling on the F2 platform.In this role, you will collaborate closely with our Engineering, Product, and Design (EPD) teams in the San Francisco office to translate institutional-level financial modeling standards into automated, intelligent workflows. This hands-on experience will allow you to shape the future of AI in financial modeling.Key ResponsibilitiesEducate F2 agents on best practices for financial modeling.Create and standardize financial modeling templates optimized for AI execution using a first principles approach.Establish formatting, structure, and best practices that align with institutional modeling standards.Conduct rigorous quality assurance on AI-generated outputs to guarantee precision that meets investor expectations.Test edge cases and assist in identifying potential failures in automated modeling workflows.Ideal Candidate ProfilePossess prior experience in Investment Banking, Private Credit, or Private Equity with extensive exposure to financial modeling.Demonstrated ability to build and audit complex 3-statement, LBO, or credit models from the ground up.Strong understanding of model hygiene, structure, and institutional formatting standards.Critical thinker who enjoys analyzing model logic and stress-testing systems.Passionate about leveraging AI to enhance financial workflows.

Feb 20, 2026
Apply
company
Full-time|On-site|San Francisco Office

Slash Financial develops business banking infrastructure tailored for modern companies. The team blends the dependability of traditional banking with features that help businesses operate more efficiently and stay ahead in their markets. Since its start in 2021, Slash Financial has grown quickly, now processing over $10 billion in annual transactions across various industries. Backed by a recent $100M Series C led by Ribbit Capital and supported by Khosla Ventures, Goodwater Capital, NEA, and Y Combinator, the company is focused on expanding its product offerings and scaling operations. The company values in-person teamwork and maintains a hands-on culture at its San Francisco office. Role overview As an Applied AI Engineer at Slash Financial, you will join Slash AI Labs, a team dedicated to integrating AI throughout the company’s financial platform. This early-stage role shapes the direction of Slash’s AI capabilities, focusing on building production-ready AI systems within a rapidly evolving fintech environment. What you will do Develop and release full-stack AI features, including prompt engineering, agent orchestration, React interfaces, and API integrations. Enhance the agent runtime by improving orchestration, tool execution, MCP servers, graph-driven conversation states, and context compaction. Design agent experiences for web, Slack, and API-based user interactions. Create internal AI tools such as model evaluation frameworks, prompt management systems, and utilities for LLM observability. Work directly with Anthropic and OpenAI APIs, the Vercel AI SDK, and Slash’s custom orchestration layer to address tool-calling and multi-step reasoning challenges. Scale and optimize AI infrastructure, including inference routing, model tiering, prompt caching, and streaming on AWS/EKS. Requirements Experience deploying full-stack AI products in production. Strong understanding of AI technologies and frameworks. Background in orchestration and management of AI models. Ability to design applications that function across multiple platforms and interfaces. Strong problem-solving skills and comfort working in a changing environment. Location This position is based onsite at the San Francisco office.

Apr 22, 2026
Apply
companyTavus logo
Full-time|On-site|San Francisco

About TavusTavus is at the forefront of innovation in human computing. Our mission is to develop AI Humans: an advanced interface that bridges the gap between individuals and machines, eliminating the friction found in current technologies. Our state-of-the-art human simulation models empower machines to see, hear, respond, and even exhibit realistic appearances—facilitating genuine, face-to-face interactions. AI Humans integrate the emotional insight of humans with the scalability and dependability of machines, making them reliable agents accessible 24/7, in any language, on our terms.Imagine having access to an affordable therapist, a personal trainer that fits your schedule, or a team of medical assistants dedicated to providing personalized care for every patient. With Tavus, individuals, enterprises, and developers have the tools to create AI Humans that connect, comprehend, and act with empathy on a large scale.We are a Series A company supported by esteemed investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners.Join us in shaping a future where machines and humans genuinely understand one another.The PositionWe are seeking an AI Researcher to join our core AI team and advance the frontiers of multimodal conversational intelligence. If you excel in dynamic environments, enjoy transforming abstract concepts into functional code, and derive motivation from pushing the boundaries of possibility, this role is designed for you.Your Responsibilities Engage in research focusing on Foundational Multimodal Models specifically in the realm of Conversational Avatars (such as Neural Avatars and Talking-Heads).Develop models for video, audio, and language sequences utilizing Autoregressive and Predictive Architectures (e.g., V-JEPA) and/or Diffusion methodologies, with a focus on temporal and sequential data rather than static images.Collaborate closely with the Applied ML team to implement your research into production systems.Remain at the forefront of multimodal learning and assist us in defining what “cutting edge” will mean in the future.Ideal Candidate ProfilePhD (or nearing completion) in a relevant field, or equivalent practical research experience.Experience in multimodal machine learning, particularly focused on conversational interfaces.

Oct 8, 2025
Apply
companyDescript logo
Full-time|$171K/yr - $171K/yr|On-site|San Francisco, CA

At Descript, we envision a world where video editing is an essential tool for every communicator. Gone are the days of needing multiple monitors and advanced degrees to craft engaging video content. Our platform allows you to edit videos as easily as working with documents and slides, increasingly through the power of AI. We are at the forefront of redefining how videos are recorded and generated, making it more user-friendly and accessible.We are in search of a dedicated Product Manager to shape the future of AI-driven video editing. You will collaborate closely with a dynamic and collaborative team of skilled PMs, AI researchers, engineers, designers, and marketing professionals. This role offers a unique opportunity to work hands-on with state-of-the-art AI technology and contribute to a product that resonates with users and accelerates in growth.As a Product Manager, you will lead the AI Research and Enablement roadmap at Descript. This pivotal role lies at the intersection of advanced AI research, robust ML infrastructure, and strategic product development. Your mission will be to ensure our AI capabilities are unparalleled while empowering our product teams to deliver AI-enhanced features that captivate our users.

Feb 4, 2026
Apply
companyTavus logo
Full-time|On-site|San Francisco (London/Europe - OK)

Tavus – Multimodal AI Model OptimizationResearch EngineerAt Tavus, we are pioneering the human aspect of AI technology. Our objective is to make human-AI interactions as seamless and natural as in-person conversations, allowing for a human touch in areas that were once considered unscalable.We accomplish this through groundbreaking research in multimodal AI, focusing on human-to-human communication modeling (encompassing language, audio, and video) and the development of audio-visual avatar behaviors. Our innovative models drive applications ranging from text-to-video AI avatars to real-time conversational video experiences across sectors such as healthcare, recruitment, sales, and education.By empowering AI to perceive, listen, and engage with an authentic human-like presence, we are laying the groundwork for the next generation of AI workers, assistants, and companions.As a Series B company, we are supported by renowned investors, including Sequoia, Y Combinator, and Scale VC. Join us as we shape the future of human-AI interaction.The RoleWe are seeking an accomplished Research Scientist/Engineer with expertise in model optimization to be a vital part of our core AI team.The ideal candidate thrives in dynamic startup environments, is adept at setting priorities independently, and is open to making calculated decisions. We are moving swiftly and need individuals who can help navigate our path forward.Your MissionTransform state-of-the-art research models into fast, efficient, and production-ready systems through techniques such as sparsification, distillation, and quantization.Oversee the optimization lifecycle for critical models: establish metrics, conduct experiments, and evaluate trade-offs among latency, cost, and quality.Collaborate closely with researchers and engineers to convert innovative concepts into deployable solutions.RequirementsExtensive experience in deep learning with PyTorch.Practical experience in model optimization and compression, including knowledge distillation, pruning/sparsification, quantization, and mixed precision.Familiarity with efficient architectures such as low-rank adapters.Strong grasp of inference performance and GPU/accelerator fundamentals.Proficient in Python coding and adherence to best practices in research engineering.Experience with large models and datasets in cloud environments.Capability to read ML literature, reproduce results, and modify ideas accordingly.

Apr 3, 2026
Apply
companyPostman logo
Full-time|$218.5K/yr - $350K/yr|On-site|San Francisco, California, United States

Who Are We?Postman is the premier API platform, empowering over 45 million developers and 500,000 organizations, including 98% of the Fortune 500. We are dedicated to fostering an API-first world by simplifying the entire API lifecycle and enhancing collaboration, enabling users to create superior APIs swiftly.Headquartered in San Francisco, we have offices in major cities like Boston, New York, Austin, Tokyo, London, and Bangalore, where our journey began. As a privately held entity, we have garnered support from notable investors including Battery Ventures, BOND, Coatue, CRV, Insight Partners, and Nexus Venture Partners. Discover more at postman.com or connect with us on X via @getpostman.P.S: We highly recommend exploring The "API-First World" graphic novel to gain insights into our vision at Postman.The OpportunityAs an Applied Scientist focusing on Small Language Models and AI Training, you will spearhead research and development initiatives aimed at crafting efficient, high-performance language models for real-world applications. Collaborating closely with research, engineering, and product teams, you will enhance model training techniques, optimize architectures, and scale AI solutions. Your contributions will play a crucial role in creating AI systems that are safe, interpretable, and impactful across various applications.

Mar 19, 2026
Apply
companySpellbrush logo
Full-time|On-site|San Francisco or Tokyo

Overview: Join us at Spellbrush as we innovate in the world of gaming! We are developing an immersive 3D first-person adventure game where an AI companion drives the gameplay experience. Our goal is to create a game that integrates large language models (LLMs) in a way that enhances storytelling and player engagement, moving beyond simple chat interactions.About SpellbrushAt Spellbrush, we are dedicated to crafting exceptional anime games. As the leading generative AI studio, we are the creative force behind niji・journey. Our mission is clear: to harness AI in bringing vibrant characters to life and to redefine narrative-driven gaming experiences.Our ProjectWe have developed an innovative in-house LLM storytelling system that seamlessly integrates AI with narrative and gameplay, offering players a depth of interaction that transcends traditional gaming.Role OverviewAs an integral member of our small but highly skilled team, you will have the opportunity to shape the future of gaming. Collaborate with leading minds in the industry, including the creator of Warudo and a veteran from Google DeepMind behind Project Astra. Your role will afford you substantial creative and research freedom to pioneer LLM-driven storytelling.

Sep 2, 2025
Apply
companyTavus logo
Full-time|On-site|San Francisco

Join Our Innovative TeamTavus is a forward-thinking research laboratory at the forefront of human-computer interaction. We are dedicated to creating AI Humans, a revolutionary interface that bridges the gap between individuals and machines, eliminating the friction commonly encountered in today's technology. Our advanced human simulation models empower machines to see, hear, respond, and even exhibit lifelike appearances, facilitating genuine, face-to-face interactions. By merging human emotional intelligence with machine efficiency, our AI Humans serve as reliable, empathetic agents available around the clock, in any language, tailored to user needs.Picture a therapist within reach for everyone, a personal trainer that fits seamlessly into your schedule, or a comprehensive team of medical assistants providing focused attention to every patient. With Tavus, individuals and organizations can develop AI Humans to foster connection, understanding, and responsiveness on a grand scale.As a Series A startup, we are backed by prestigious investors such as Sequoia Capital, Y Combinator, and Scale Venture Partners. Join us in crafting a future where humans and machines communicate effortlessly.Your RoleWe seek an AI Researcher to join our dynamic AI team and explore the frontiers of large language modeling within conversational AI. If you excel in fast-paced startup settings, enjoy experimenting with innovative ideas, and are eager to see your contributions realized in production, you will thrive here.Your Mission Conduct in-depth research on large language modeling and its adaptation for Conversational Avatars (e.g., Neural Avatars, Talking-Heads).Create methodologies to effectively model both verbal and non-verbal communication, dynamically adjusting avatar behaviors in real-time.Experiment with fine-tuning, adaptation, and conditioning techniques to enhance the expressiveness, control, and task specificity of LLMs.Collaborate with the Applied ML team to transition research from prototype to full-scale deployment.Stay informed on the latest advancements and contribute to defining the next breakthroughs.

Oct 16, 2025
Apply
companyCrusoe logo
Full-time|$172.4K/yr - $209K/yr|On-site|San Francisco, CA - US

At Crusoe, we are on a mission to accelerate the convergence of energy and intelligence. We are building a powerful engine that enables individuals to innovate boldly with AI, all while upholding principles of scalability, speed, and sustainability.Join us in spearheading the AI revolution through sustainable technology. At Crusoe, you will be at the forefront of meaningful innovation, making a significant impact while collaborating with a team dedicated to shaping the future of responsible, transformative cloud infrastructure.About the Role:As a Senior Software Engineer on the Model Lifecycle team, you will play a pivotal role in developing a managed platform that supports the entire application development lifecycle, with an emphasis on harnessing the power of Machine Learning models, particularly Large Language Models (LLMs).Your Responsibilities:Design and maintain systems for fine-tuning large foundational models (SFT, PEFT, LoRA, adapters), ensuring multi-node orchestration, checkpointing, failure recovery, and cost-effective scaling.Create and manage end-to-end training pipelines for Large Language Models.Implement components for distillation and reinforcement learning pipelines, focusing on preference optimization, policy optimization, and reward modeling.Develop and sustain the core agent execution infrastructure.Implement features for dataset, model, and experiment management, emphasizing versioning, lineage, evaluation, and reproducible fine-tuning.Collaboration and Impact:Collaborate closely with Senior Engineers, Principal Engineers, and various product and platform teams to implement systems abstractions and APIs.Engage in technical discussions surrounding training runtimes, scheduling, storage, and overall model lifecycle management.Bring 4-5+ years of industry experience, demonstrating a strong track record of successfully leading a diverse portfolio of initiatives.Participate in and contribute to the open-source LLM ecosystem.This position involves taking significant ownership of core system components.Your Qualifications:Engineering Fundamentals:Bachelor's degree in Computer Science, Engineering, or a related discipline.Proven experience in software engineering with a focus on AI models and machine learning.

Feb 9, 2026
Apply
companyAchira logo
Full-time|On-site|San Francisco Office

Join Achira in shaping the future of deep learning with cutting-edge generative, representational, and simulation models for molecules and materials. Our mission is to create foundational models that render the atomistic universe understandable, predictable, and designable.Why Choose Achira?Be part of an elite, cross-disciplinary team comprising ML researchers, physicists, chemists, and engineers who are redefining atomistic simulation through expansive foundation models.Advance the integration of deep learning with the principles of nature, merging generative AI, probabilistic reasoning, and molecular physics.Engage in projects at an unparalleled scale, tackling extensive datasets, computational challenges, and ambitious goals.Take full ownership of your research journey — from ideation and architecture to training, evaluation, and deployment.Flourish in a dynamic culture that values rigor, speed, creativity, and impact over bureaucracy.Position OverviewAs a Generative AI Researcher at Achira, you will contribute to the development of foundation simulation models — large-scale systems designed to learn the structure, dynamics, and energetics of the atomistic realm. These models will unite deep representation learning, generative modeling, and sophisticated simulation techniques.Your responsibilities will include:Crafting and training state-of-the-art deep generative models — including diffusion, autoregressive, flow-based, and latent-variable architectures focused on molecules, materials, and atomic systems.Creating expressive representations of molecular and atomistic structures and dynamics utilizing equivariant graph neural networks, geometric transformers, and latent encoders that respect physical symmetries and constraints.Innovating advanced sampling and simulation techniques that blend probabilistic inference, deep learning, and reinforcement learning to facilitate efficient exploration and simulation of learned energy landscapes.Developing models that comprehend, generate, and simulate the physical world, merging reasoning, simulation, and predictive capabilities.Working collaboratively with physicists and chemists to validate models against ab initio, molecular dynamics, and experimental datasets.Rapidly prototyping, benchmarking, and iterating — converting research concepts into reusable, scalable model components across Achira’s foundation model suite.

Oct 24, 2025
Apply
companySpellbrush logo
Full-time|On-site|San Francisco or Tokyo

Overview: At Spellbrush, we are revolutionizing the gaming experience by crafting a 3D first-person adventure game where an AI companion plays a pivotal role. Imagine MiSide enhanced with language learning models, seamlessly integrated into gameplay rather than merely serving as a role-play chatbot.About UsAt Spellbrush, we are dedicated to creating exceptional anime games, and we proudly stand as a global leader in generative AI. Our flagship project is niji・journey.Our mission is straightforward: to use AI to animate characters and redefine narrative-driven gaming.What We’re CreatingWe have engineered an innovative in-house LLM storytelling system that merges AI, narrative, and gameplay, transcending the limitations of conventional chat-only encounters.This results in an AI companion that collaborates with players in solving puzzles, retains memories across different worlds, and alters the progression of each chapter.About The RoleJoin our elite team to redefine video game experiences. Collaborate with leading minds in the industry, including the creator of Warudo and Cytoid, as well as a Google DeepMind veteran behind Project Astra, and top-tier AI researchers.As an integral early member of this team, you will enjoy significant artistic and research autonomy in shaping what could be the next era of LLM-driven storytelling.

Sep 2, 2025
Apply
company
Full-time|On-site|San Francisco

About UsWelcome to Heidi! Recognized as "the AI startup growing faster than Canva" by the Financial Review, we've made significant strides in just 18 months, supporting over 73 million patient visits globally. Our transformation from broad healthcare AI to the world’s leading AI Care Partner has allowed us to facilitate over 2 million patient interactions weekly in 116 countries and across more than 110 languages. Our innovative medical scribe service has become a favorite among clinicians, making documentation seamless and efficient.At the core of our mission is a commitment to enhancing the human connection in healthcare. By empowering clinicians to delegate documentation tasks to Heidi, we not only improve patient engagement but also alleviate bottlenecks in health systems, allowing healthcare providers to focus on what they do best.Now, we need your expertise!Role OverviewWe are in search of a proactive Product Manager to lead our AI models platform. This role is crucial in ensuring that our product teams can operate effectively and efficiently.You will manage the models platform, overseeing evaluation pipelines, fine-tuning infrastructure, model routing, and safety systems. Collaborating closely with engineers, researchers, product managers, and clinical safety teams, you will anticipate needs and drive the development of user-facing products that depend on your team’s innovations.This position is available in either our Sydney or Melbourne office.We value potential over pedigree; whether you're an experienced professional or a passionate newcomer, we want to hear from you. If you’re an engineer eager to transition into product management, this could be the perfect opportunity.

Mar 16, 2026
Apply
companyCrusoe logo
Full-time|Remote|San Francisco, CA - US

Crusoe seeks a Senior Director of AI Model Lifecycle to lead strategy and daily operations for managing AI models in San Francisco, CA. This position plays a central role in shaping how AI models are created, deployed, maintained, and eventually retired. The focus remains on ensuring technical efforts align closely with business priorities. Key responsibilities Define and guide the overall strategy for managing the full lifecycle of AI models Supervise the development, deployment, and ongoing maintenance of AI models Align AI model initiatives with Crusoe’s broader business objectives Identify and implement improvements that increase the impact of AI-driven projects Location This position is based in San Francisco, CA.

Apr 24, 2026
Apply
companyEarnest logo
Full-time|$189.5K/yr - $236.9K/yr|Remote|San Francisco, CA (Remote)

Earnest is dedicated to empowering ambitious individuals to make informed financial decisions and create the lives they aspire to lead.Our team, known as Earnies, is passionate about providing borrowers with smarter borrowing solutions that offer a clearer path toward financial empowerment. If you share our enthusiasm for this mission, we invite you to explore the details below and join us in building something exceptional.The Senior Model Risk Manager will report directly to the Head of Credit Risk.In this role, you will:Take ownership of and enhance Earnest’s Model Risk Management framework, ensuring that our credit, loss forecasting, fraud, marketing, and finance models are robust, transparent, and scalable.Conduct independent end-to-end model validations, from conceptual soundness and data quality to performance monitoring and implementation review, providing constructive feedback to modeling teams.Collaborate closely with Data Science and Risk leaders early in the model design process to refine assumptions, enhance methodologies, and uplift modeling standards throughout the organization.Supervise model performance monitoring and proactively identify emerging risks, performance drift, or control deficiencies, ensuring timely and effective remediation.Produce clear, decision-ready validation reports and effectively communicate technical findings to drive impactful business outcomes and sound risk management decisions.Act as a trusted advisor on model governance, enabling Earnest to operate swiftly while maintaining the necessary discipline and controls of a leading lending platform.

Mar 11, 2026
Apply
companyOkta, Inc. logo
Full-time|$229K/yr - $315K/yr|On-site|San Francisco, California

Okta secures identity for both AI and people, building trusted, neutral infrastructure to help organizations navigate the evolving AI landscape. The company values innovators and problem solvers who act with urgency and precision. Role overview The Director of External Communications for Product and Corporate Initiatives leads Okta’s external communications strategy in the United States, with a focus on positioning Okta as a leader in AI security. This senior leader develops and implements a data-driven communications framework, emphasizing measurable results and decisive action in a competitive industry. The director crafts stories that highlight Okta’s product innovations and corporate initiatives, connecting the brand to key industry trends across both traditional and emerging media platforms. This position reports to the Senior Director of Global Communications. The director manages a team of communications professionals and works closely with executive leadership, as well as product, security, and marketing teams. Together, they create narratives and manage communications projects aimed at engaging audiences such as CIOs, CISOs, developers, and investors. What you will do Advance Okta’s external communications strategy for the U.S. market Build and execute a modern, data-driven communications framework Highlight product innovation and corporate initiatives through compelling storytelling Engage with both traditional and emerging media platforms Lead and mentor a team of communications professionals Collaborate with executive leadership, product, security, and marketing teams Manage high-stakes communications projects targeting CIOs, CISOs, developers, and investors Location San Francisco, California

Apr 21, 2026
Apply
companyZyphra logo
Full-time|On-site|San Francisco

Zyphra is an innovative artificial intelligence company located in the heart of San Francisco, California.The Opportunity:Join our dynamic team as a Research Engineer - Audio & Speech Models, where you will play a pivotal role in advancing Zyphra’s Audio Team. You will be instrumental in developing cutting-edge open-source text-to-speech and audio models. Your contributions will span the full spectrum of the model training process, from data collection and processing to the design of innovative architectures and training approaches.Your Responsibilities:Conduct large-scale audio training operationsOptimize the performance of our training infrastructureCollect, process, and evaluate audio datasetsImplement architectural and methodological improvements through rigorous testingWhat We Seek:A strong research mindset with the ability to navigate projects from ideation to implementation and documentation.Proficiency in rapid prototyping and implementation, allowing for swift experimentation.Effective collaboration skills in a fast-paced research environment.A quick learner who is eager to embrace and implement new concepts.Excellent communication abilities, enabling you to contribute to both research and engineering tasks at scale.Preferred Qualifications:Expertise in training audio models, such as text-to-speech, ASR, speech-to-speech, or emotion recognition.Experience with training audio autoencoders.Solid understanding of signal processing, particularly in audio.Familiarity with diffusion models, consistency models, or GANs.Experience with large-scale (multi-node) GPU training environments.Strong understanding of experimental methodologies for conducting rigorous tests and ablations.Interest in large-scale, parallel data processing pipelines.Competence in PyTorch and Python programming.Experience contributing to large, established codebases with rapid adaptation.

Aug 28, 2025
Apply
companyPerplexity logo
Full-time|On-site|San Francisco

About the RolePerplexity is seeking a talented Model Behavior Architect to join our innovative AI team in San Francisco. In this role, you will be instrumental in developing and evaluating AI products that enhance user experiences across various domains. Collaborating closely with both research and product teams, you will design strategies for prompt and context engineering that ensure high-quality interactions.This position uniquely blends creativity and analytical skills. You will gain a profound understanding of our answer engine by rigorously testing model capabilities and working with our AI infrastructure, including system prompts, tool prompts, skills, and evaluations, to create an exceptional product experience for our users.As the go-to expert on prompting, model quality, and behavioral consistency, you will be pivotal in the deployment of new product features and model releases.Key ResponsibilitiesContext Engineering: Create, test, and refine context strategies and system prompts that influence answer engine behavior across various products, features, and use cases.Evaluation Systems: Develop automated and semi-automated evaluation pipelines to assess model quality, detect regressions, and scale across product surfaces.Model Launch Support: Collaborate with research and engineering teams to validate model behavior prior to and during rollouts, ensuring seamless transitions without any degradation.Research & Analysis: Identify inconsistencies and potential failure modes in model outputs through meticulously designed research initiatives for both internal and production-facing systems.Cross-functional Collaboration: Work closely with design, product, and research teams to translate product objectives into specific model behavior requirements.Knowledge Sharing: Assist engineers across teams in developing a strong understanding of prompt design, context engineering, and evaluation best practices.Staying Current: Keep abreast of the latest alignment, evaluation, and prompting techniques from both industry and academia, and integrate the best ideas into the team.

Jan 15, 2026
Apply
companyCartesia logo
Full-time|On-site|*HQ - San Francisco, CA

Join Cartesia as a Model Architecture ResearcherAt Cartesia, our vision is to revolutionize AI by creating interactive intelligence that is seamlessly integrated into your daily life. Unlike current models, our goal is to develop systems capable of processing extensive streams of audio, video, and text—1 billion text tokens, 10 billion audio tokens, and 1 trillion video tokens—directly on devices.As pioneers in innovative model architectures, our founding team, which originated from the Stanford AI Lab, has developed State Space Models (SSMs)—a groundbreaking foundation for training efficient, large-scale models. Our diverse team merges deep expertise in model innovation with a design-focused engineering approach, allowing us to create and deploy state-of-the-art models and applications.Backed by leading investors such as Index Ventures, Lightspeed Venture Partners, and many others, including industry veterans and advisors, we are poised to shape the future of AI.Your ContributionIn this role, you will drive forward-thinking research in neural network architecture, focusing on alternative models like state space models, efficient transformers, and hybrid architectures.Create innovative architectures that enhance model performance, inference speed, and adaptability in various environments, from cloud infrastructures to on-device implementations.Develop advanced capabilities for models, including statefulness, long-range memory, and novel conditioning mechanisms to boost expressiveness and generalization.Analyze architectural decisions and their effects on model characteristics such as scalability, robustness, latency, and energy consumption.Create frameworks and tools to assess architectural advancements, benchmarking their performance in both research and production contexts.Collaborate with interdisciplinary teams to translate architectural insights into scalable systems that deliver real-world impact.Your QualificationsExtensive experience in architecture design with a focus on advanced models such as state space models, transformers, and RNN/CNN variants.In-depth understanding of the interplay between architectural designs and system constraints, particularly in cloud and on-device deployments.Strong proficiency in the design and evaluation of neural network architectures.

Dec 12, 2024
Apply
companyWorld Labs logo
Full-time|$250K/yr - $325K/yr|On-site|San Francisco

About World Labs: At World Labs, we create foundational world models capable of perceiving, generating, reasoning, and interacting with the 3D environment. Our mission is to unlock the full potential of AI through spatial intelligence, transforming perception into action, reasoning into insight, and imagination into creation. We believe that spatial intelligence will revolutionize storytelling, creativity, design, simulation, and immersive experiences across both virtual and physical realms. Our world-class team is driven by curiosity and passion, boasting diverse backgrounds in technology, from AI research and systems engineering to product design. This synergy fosters a tight feedback loop between our cutting-edge research and user-empowering products. Role Overview We are seeking an innovative Research Scientist specializing in generative modeling, especially diffusion models, to join our modeling team. This position is ideal for individuals with extensive expertise in applying diffusion models to images, videos, or 3D assets and scenes. While not mandatory, experience in any of the following areas will be considered a significant advantage: Large-scale model trainingResearch in 3D computer vision In this role, you will work closely with researchers, engineers, and product teams to translate advanced 3D modeling and machine learning techniques into practical applications, ensuring our technology stays at the forefront of visual innovation. This position entails substantial hands-on research and engineering work, taking projects from conception to production deployment. Key Responsibilities Design, implement, and train large-scale diffusion models for generating 3D worlds. Develop and experiment with large-scale diffusion models to introduce novel control signals, align with target aesthetic preferences, or optimize for efficient inference. Collaborate closely with research and product teams to comprehend and translate product requirements into actionable technical roadmaps. Contribute actively to all phases of model development, including data curation, experimentation, evaluation, and deployment. Continuously investigate and integrate the latest research in diffusion and generative AI. Serve as a key technical resource within the team, mentoring peers and promoting best practices in generative modeling and machine learning engineering.

Feb 18, 2026
Apply
companyMeter logo
Full-time|$160K/yr - $230K/yr|On-site|San Francisco

About MeterAt Meter, we believe that networking is at the heart of technological advancement. We have innovatively unified the entire networking stack and are now on a mission to make it autonomous.Our team is developing a cutting-edge neural network-driven system designed to analyze raw computer networks, enabling us to address all networking challenges. As outlined on Meter.ai, we are creating models within a closed-loop system that utilizes real-time telemetry, logs, and network events to autonomously troubleshoot issues, enhance performance, and resolve challenges.To achieve this, we require not only exceptional models but also robust infrastructure that ensures our models have clean, versioned, and low-latency access to the necessary data throughout training, evaluation, and deployment phases.Why this Role is EssentialEach Meter network deployed in the field serves as a valuable data source for our Models team. However, without meticulous infrastructure design, this data risks becoming fragmented, outdated, or inconsistent. In this role, you will ensure that such pitfalls are avoided. You will be responsible for the core data interface that drives our model development, experimentation, evaluation, and real-time inference.This position is fundamental and offers a significant impact. Your contributions will shape the speed at which we can train new models, the reliability of their evaluations, and their seamless operation across hundreds of real-world networks. You will collaborate closely with modelers to deliver systems that are elegant, scalable, and robust.Your ResponsibilitiesDesign and implement the Models API: a unified interface for accessing training, evaluation, and deployment data across raw, transformed, and feature-engineered layers.Ensure backward compatibility and feature versioning across continually evolving schemas.Develop scalable pipelines to ingest, transform, and serve petabytes of data across Kafka, Postgres, and Clickhouse.Create CI/CD workflows that evolve the API in tandem with changes to the underlying data schema.Facilitate fine-grained querying of historical and real-time data for any network, at any point in time.Help establish and promote the principle of 'smart data, dumb functions': maximizing operations in the data layer to minimize downstream code complexity.Collaborate with modelers to co-design training frameworks that optimize performance.

Jul 26, 2025

Sign in to browse more jobs

Create account — see all 4,728 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.