Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Manager
Qualifications
Proven experience in engineering management, particularly within AI or data-centric environments. Strong background in software development and architectural design. Excellent leadership and team-building skills, with a focus on mentoring engineers. Ability to communicate effectively with technical and non-technical stakeholders. Experience with Agile methodologies and project management tools. A passion for innovation and continuous improvement.
About the job
Decagon is seeking an Engineering Manager to lead its AI & Data Infrastructure team in San Francisco. This role centers on guiding engineers as they develop AI solutions and robust data frameworks to advance Decagon’s technology roadmap.
Role overview
The Engineering Manager will oversee a team dedicated to AI and data infrastructure initiatives. The position involves hands-on leadership, ensuring projects move forward and align with company objectives.
What you will do
Lead and mentor engineers working on AI and data infrastructure projects
Drive project execution to enhance product capabilities
Foster a collaborative and supportive team environment
Oversee strategic planning and allocate resources for the team
Manage team performance and encourage professional growth
Requirements
Experience leading technical teams in AI and data infrastructure
Strong leadership and clear communication abilities
Skill in strategic planning and resource management
Dedication to building technology solutions that make a difference
This position offers the chance to shape Decagon’s products and technology direction through AI and data-driven work.
About Decagon
At Decagon, we are at the forefront of technological innovation, specializing in AI and data-driven solutions. Our team is dedicated to pushing the boundaries of what is possible, creating products that not only meet the needs of our clients but also set new industry standards. We value creativity, collaboration, and a commitment to excellence.
Similar jobs
1 - 20 of 9,982 Jobs
Search for Engineering Manager Ai Data Infrastructure
Decagon is seeking an Engineering Manager to lead its AI & Data Infrastructure team in San Francisco. This role centers on guiding engineers as they develop AI solutions and robust data frameworks to advance Decagon’s technology roadmap. Role overview The Engineering Manager will oversee a team dedicated to AI and data infrastructure initiatives. The position involves hands-on leadership, ensuring projects move forward and align with company objectives. What you will do Lead and mentor engineers working on AI and data infrastructure projects Drive project execution to enhance product capabilities Foster a collaborative and supportive team environment Oversee strategic planning and allocate resources for the team Manage team performance and encourage professional growth Requirements Experience leading technical teams in AI and data infrastructure Strong leadership and clear communication abilities Skill in strategic planning and resource management Dedication to building technology solutions that make a difference This position offers the chance to shape Decagon’s products and technology direction through AI and data-driven work.
Full-time|$200K/yr - $275K/yr|On-site|San Francisco, CA
At Peregrine Technologies, a company backed by top-tier Silicon Valley investors, we empower public safety organizations, state and local governments, federal agencies, and private-sector entities to tackle societal challenges with unparalleled speed and precision. Our cutting-edge AI-enabled platform transforms fragmented and isolated data into actionable operational intelligence, delivering crucial insights that enhance decision-making processes and improve outcomes across various scenarios. Currently, we proudly serve hundreds of clients across over 30 states and two countries, impacting more than 125 million lives, and we are poised for further growth as we expand into the enterprise sector and international markets.TeamWe believe that empathy is key to enhancing our solutions. Our engineering team prioritizes understanding how users interact with our products, which guides us in finding the best solutions. You'll have the chance to collaborate closely with our onsite team to explore the diverse use cases that Peregrine addresses.We value both ownership and teamwork. In this role, you will be responsible for significant features while working alongside fellow engineers to bring them to fruition. We hold humility and empathy in high regard as essential traits for crafting effective solutions, and you will engage directly with our deployment team and users to iterate on problem-solving. Creativity and resilience are vital as we pursue our vision.RoleWe are in search of a Staff Data Infrastructure Engineer to join our dynamic team. In this role, you will take full ownership of the data layer that is foundational to all of Peregrine's operations. You will design and construct the systems responsible for ingesting, storing, and serving vast amounts of real-time operational data, empowering our clients to make critical decisions quickly and confidently.This senior individual contributor position is ideal for someone who excels at tackling complex technical challenges and possesses the experience and judgement necessary to influence key infrastructure decisions. You will engage with a variety of intricate challenges, including:Designing and managing a high-throughput, real-time data integration platform across diverse customer environments.Architecting a scalable open table format layer to ensure reliable data storage at petabyte scale.Building and optimizing distributed data processing pipelines using Apache Spark and related streaming technologies.Enhancing performance, reliability, and cost efficiency across the entire data infrastructure stack.Collaborating with platform and product engineering teams to define data contracts, schemas, and integration pathways.
Position OverviewJoin OpenEvidence as a Data Infrastructure Software Engineer, where you will engineer comprehensive systems that drive essential product and research operations. Your focus will be on optimizing performance, ensuring scalability, and enhancing accuracy, while enjoying the autonomy to manage the infrastructure that assists healthcare professionals in navigating complex clinical decisions in real-time.We value exceptional creators who thrive in versatile roles. Our engineers engage across various products and projects, taking ownership wherever they can make the most significant impact.About OpenEvidenceOpenEvidence is the leading medical AI platform globally, utilized by over 40% of clinicians in the U.S. in just over a year through organic product-led growth. As a $12 billion company, our engineering team comprises 30 talented individuals from MIT, Harvard, and Stanford. We believe that groundbreaking products are born from a small group of exceptional builders, driven by focused goals and empowered to take ownership and act swiftly. We are expanding our team to capitalize on an unparalleled opportunity to set the standard for medical AI platforms.If you are a top-tier engineer or scientist eager to push the boundaries and achieve tangible outcomes that affect millions of lives, we want to connect with you.Our CultureWe expect our work to be performed at an elite level. The journey from concept to execution and scaling is akin to a professional sport, where excellence is non-negotiable. We believe that the creation of innovative technologies is only achievable through complete ownership. Significant achievements happen when individuals take the initiative to see them through.Your ProfileThis role is not for those seeking a 9-to-5 job or merely looking to write papers. If you are ready to dive into the trenches, tackle challenges head-on, and create something from scratch that could impact millions and drive substantial revenue, you might be the perfect fit.We seek brilliant builders who are intelligent, ambitious, resourceful, self-reliant, detail-oriented, driven, hardworking, and humble. Does this sound rare? It is, as we have only found 30 of them so far, and we are eager to discover more.
About Us:At novita-ai, we are a rapidly growing global provider of AI cloud infrastructure, leading the charge in the artificial intelligence revolution. Our innovative platform equips developers and enterprises with powerful, scalable, and user-friendly solutions such as Model APIs, GPU Instances, and Serverless Computing. As organizations around the globe strive to integrate AI into their offerings, we serve as the essential engine that fuels their innovative efforts.Join our world-class team and contribute to our expanding customer base. This unique opportunity allows you to be part of a dynamic company in a hyper-growth market, where your technical skills will directly impact customer success and drive our business forward.The Role:As a Solutions Engineer, you will act as the primary technical leader and trusted advisor for our clients throughout their journey. You will collaborate closely with the sales team to bridge the gap between complex customer challenges and our sophisticated technical solutions. Your mission is to build technical credibility, demonstrate the capabilities of our platform, and design tailored solutions that empower our clients to achieve their AI-related business objectives.What You'll Do:Technical Discovery & Solution Design: Collaborate with Account Executives to gain a deep understanding of customer needs, technical requirements, and business goals. Develop elegant and effective solutions utilizing our AI infrastructure stack (Model APIs, GPU Instances, Serverless).Product Demonstration & Proof of Concept (POC): Conduct engaging, customized product demonstrations and interactive workshops. Plan, manage, and execute successful POCs, showcasing the value and performance of our platform within the client’s environment.Technical Evangelism & Trusted Advisory: Communicate the value proposition of our platform to diverse audiences, including both technical and non-technical stakeholders, from engineers to C-level executives. Establish yourself as the go-to expert for customers on best practices in AI infrastructure.Sales Enablement & Market Feedback Loop: Create and maintain technical sales materials, including whitepapers, best practice guides, and demo scripts. Serve as the voice of the customer, relaying valuable feedback from the field to our Product and Engineering teams to influence our product roadmap.Onboarding & Implementation Guidance: Facilitate a seamless post-sales transition by providing initial onboarding support and architectural guidance, setting customers up for sustained success.
About UsAt Roboflow, our mission is to empower developers to make the world programmable through advanced artificial intelligence solutions. We believe that vision is a fundamental way we comprehend our environment, and soon, this understanding will be reflected in the software we utilize.We are dedicated to creating tools, fostering community, and providing resources that simplify the development and deployment of computer vision models. With over 1 million developers, including teams from half of the Fortune 100, leveraging Roboflow's open-source and hosted machine learning tools, we are on a mission to enhance various industries—from accelerating cancer research through cell counting to improving construction site safety, digitizing floor plans, preserving coral reef ecosystems, guiding drone operations, and much more.Our compact team is driven by a culture of collaboration, where we believe that our users' success is our success. One of our team members aptly described us as a company of
Full-time|$350K/yr - $475K/yr|On-site|San Francisco
At Thinking Machines Lab, our vision is to enhance human potential by advancing collaborative general intelligence. We are dedicated to creating a future where individuals have the resources and knowledge to harness AI for their specific objectives and aspirations.Our team comprises scientists, engineers, and innovators who have developed some of the most popular AI products, including ChatGPT and Character.ai, as well as influential open-weight models like Mistral, along with highly regarded open-source projects such as PyTorch, OpenAI Gym, Fairseq, and Segment Anything.About the RoleWe are seeking a talented engineer to enhance our data infrastructure. You will become part of a dynamic, high-impact team tasked with designing and scaling the foundational infrastructure for distributed training pipelines, multimodal data catalogs, and sophisticated processing systems that manage petabytes of data.Our infrastructure is pivotal; it serves as the foundation for every groundbreaking achievement. You will collaborate directly with researchers to expedite experiments, develop novel datasets, optimize infrastructure efficiency, and derive essential insights from our data repositories.If you are passionate about distributed systems, large-scale data mining, and open-source tools such as Spark, Kafka, Beam, Ray, and Delta Lake, and enjoy building innovative solutions from scratch, we encourage you to apply.Note: This is an evergreen role that we keep open continuously for expressions of interest. We receive a high volume of applications, and while there may not always be an immediate position that aligns perfectly with your skills and experience, we encourage you to apply. We regularly review applications and reach out as new opportunities arise. You are welcome to reapply after gaining more experience, but please refrain from applying more than once every six months. We may also post for specific roles for particular projects or team needs, and in those cases, you are welcome to apply directly in addition to this evergreen role.
About Our TeamAt OpenAI, our Data Platform team is at the heart of our innovative approaches to data management, powering essential product, research, and analytics workflows. We manage some of the largest Spark compute fleets in production, architect data lakes and metadata systems on Iceberg and Delta, and envision exabyte-scale architectures. Our high-throughput streaming platforms utilize Kafka and Flink, while our orchestration is powered by Airflow. We also support machine learning feature engineering tools such as Chronon. Our mission is to provide secure, reliable, and efficient data access at scale, thereby enhancing intelligent, AI-assisted data workflows.Join us in building and maintaining these core platforms that are foundational to OpenAI's products, research, and analytics capabilities.We are not just scaling infrastructure; we are transforming the way people engage with data. Our vision includes intelligent interfaces and AI-powered workflows that make data interactions faster, more reliable, and intuitive.About the PositionIn this role, you will focus on constructing and managing data infrastructure that supports extensive compute fleets and storage systems optimized for high performance and scalability. You will be instrumental in designing, developing, and operating the next generation of data infrastructure at OpenAI. Your responsibilities will encompass scaling and securing big data compute and storage platforms, building and maintaining high-throughput streaming systems, ensuring low-latency data ingestion, and facilitating secure, governed data access for machine learning and analytics. You will also prioritize reliability and performance at extreme scales.You will have complete ownership of the full lifecycle: from architecture to implementation, production operations, and on-call responsibilities.You should be experienced with platforms such as Spark, Kafka, Flink, Airflow, Trino, or Iceberg. Familiarity with infrastructure tools like Terraform, along with expertise in debugging large-scale distributed systems, is essential. A passion for addressing data infrastructure challenges in the AI domain is a must.This role is based in San Francisco, CA. We offer a hybrid work model requiring 3 days in the office each week and provide relocation assistance for new hires.Responsibilities:Design, build, and maintain data infrastructure systems including distributed compute, data orchestration, distributed storage, streaming infrastructure, and machine learning infrastructure, ensuring they are scalable, reliable, and secure.Ensure our data platform can scale significantly while maintaining reliability and efficiency.Enhance company productivity by empowering your fellow engineers and teammates through innovative data solutions.
Full-time|$228.6K/yr - $314.3K/yr|On-site|San Francisco, California
Databricks is seeking an experienced Senior Manager of Infrastructure Data Science to redefine our infrastructure landscape through innovative data science solutions. In this pivotal role, you will confront complex challenges encompassing capacity planning, performance optimization, reliability engineering, infrastructure efficiency, and enhancing customer experience. You will lead a talented team of data scientists and collaborate closely with engineering leaders to provide actionable data-driven insights and solutions.At Databricks, our passion lies in empowering data teams to tackle the world's most pressing challenges, from cybersecurity threat detection to advancements in cancer treatment. We achieve this by developing and managing a cutting-edge data and AI infrastructure platform, enabling our customers to concentrate on the high-impact challenges central to their missions.Founded in 2013 by the original creators of Apache Spark, Databricks has rapidly evolved from a small office in Berkeley, CA to a global powerhouse with over 7,000 employees. Thousands of organizations, from startups to Fortune 100 companies, rely on Databricks for their mission-critical workloads, making us one of the fastest-growing SaaS companies worldwide.Our engineering teams are dedicated to creating highly technical products that address real-world needs. We continuously push the limits of data and AI technology while maintaining the resilience, security, and scalability essential for customer success on our platform.
Join our innovative team at alljoined as a Data Infrastructure Engineer where you will play a pivotal role in shaping our data architecture and ensuring the reliability and efficiency of our data systems. You will collaborate with cross-functional teams to design and implement scalable data solutions that empower data-driven decision-making.
Innovating the Future of SoftwareAs we approach 2026, the software industry is facing an unprecedented challenge: the 'infinite software crisis.' At Sazabi, we are dedicated to redefining how engineering teams support, maintain, and operate the rapid growth in application development.Introducing Sazabi: The AI-Native Observability Platform for Agile Engineering Teams.Our platform empowers teams by providing a centralized solution to inquire about their production systems in natural language, visualize system activities automatically, and diagnose issues ten times faster.Say goodbye to tedious instrumentation, dashboard setups, and alert tuning—just straightforward answers.We are proud to be backed by pioneers from leading AI organizations, including Vercel, Graphite, Daytona, Browserbase, LangChain, Mastra, Replit, and others.
Join Our MissionAt Hyperbolic Labs, we are dedicated to democratizing artificial intelligence by eliminating barriers to computing power through our Open-Access AI Cloud. We aggregate global computing resources to provide an innovative GPU marketplace and AI inference service, making AI affordable and accessible for everyone. As pioneers at the crossroads of AI and open-source technology, we envision a future where AI innovation is driven by imagination, not resource limitations. We invite forward-thinking individuals who share our vision of making AI universally accessible, secure, and cost-effective to join us in crafting a platform that empowers innovators to realize their groundbreaking AI projects.As we gear up for expansion following our Series A funding, our team, led by co-founders with PhDs in AI, Mathematics, and Computer Science, is set to transform the landscape of computing.The RoleWe are on the lookout for a Senior Infrastructure Engineer to drive the development and scaling of Hyperbolic's GPU Cloud Marketplace. In this pivotal role, you will create a multi-tenancy provisioning and virtualization solution that transforms raw GPUs from diverse global suppliers into a programmable, orchestrated resource pool serving thousands of AI developers and researchers. You will work at the forefront of cloud infrastructure, building the core orchestration layer that allows our platform to deliver cost savings of up to 75% compared to traditional cloud providers.
About RoxAt Rox, we are pioneering the development of an AI-native revenue operating system that transforms how enterprises interact with technology. Unlike traditional software designed for human dashboard operators, Rox is engineered for agents managing complex systems.We eliminate static workflows, enabling continuous decision-making processes powered by real-time insights from across the enterprise landscape. Our agents are equipped to analyze signals, reason through them, and autonomously execute actions.To support this innovation, we are constructing a robust infrastructure that integrates:Distributed data platformsReal-time decision-making systemsAgent execution frameworksLow-latency context retrievalBacked by prominent investors like Sequoia, GV, and General Catalyst, we are assembling a talented team of engineers eager to tackle deep technical challenges that have a tangible impact on the world.About the Foundations TeamThe Foundations team is responsible for developing the core infrastructure that powers Rox agents.Our work focuses on:Real-time context ingestionAgent execution and orchestrationEnsuring reliability for long-term AI tasksLow-latency decision-making across distributed systemsIf you have experience with:Streaming compute platformsDistributed query enginesReal-time OLAP systemsMatching enginesLarge-scale data infrastructureMany challenges you encounter here will resonate with your past work but will be applied to a novel category of software. At Rox, agents continuously:Retrieve contextMake decisionsTrigger actionsUpdate stateThe Foundations team builds the infrastructure that ensures these feedback loops are reliable, swift, and observable.The RoleWe are on the lookout for a Foundations Engineer (Deep Infrastructure) to design and oversee the systems that power Rox's agent runtime.
Join Our Team as an AI Infrastructure EngineerAt Spellbrush, the premier generative AI studio behind niji・journey, we are in search of a talented AI Infrastructure Engineer to help us develop and enhance our end-to-end machine learning infrastructure, facilitating the operation of our models across a variety of platforms.Key Responsibilities:Design, implement, and maintain next-generation inference architecture to optimize the performance of our models across mobile, web, and other platforms.Collaborate with a dynamic team focused on creating cutting-edge image generation models that serve over 16 million users globally.Ideal Candidate Profile:Experience with Large Distributed Systems: You possess a strong background in working with modern technologies such as Kubernetes (K8S), Kafka, NATS, Redis, among others. Your hands-on experience spans both on-premises and multi-cloud environments, and you understand the intricacies and potential pitfalls of each system.Expertise in GPU Workloads: Your understanding of GPU processing for handling substantial workloads sets you apart. Having experience in deploying or optimizing GPU workloads end-to-end is a significant advantage.Passion for Anime Aesthetics: As avid anime enthusiasts, we value team members who share our passion for the anime aesthetic, contributing to a creative movement that engages millions.Team Player in Fast-Paced Environments: You thrive in small, agile teams and are eager to work alongside some of the world's top AI researchers, contributing to the best image models globally. We believe in the power of in-person collaboration, with opportunities at our offices in Tokyo (downtown Akihabara) or San Francisco. Visa sponsorships are available.
Join the Revolution at Retell AIRetell AI is pioneering the future of call centers through innovative voice AI, driven by first principles thinking.In just 18 months since our inception, we have empowered thousands of businesses with our AI voice agents, transforming how sales, support, and logistics calls are managed—previously requiring extensive human teams. Supported by prestigious investors such as Y Combinator and Alt Capital, we've rapidly scaled from $5M ARR to an impressive $36M ARR with a compact yet dynamic team of 20.Our ambition for 2026 is to create a revolutionary customer experience platform, where entire contact centers are powered by AI. Moving beyond basic automation, we aim to develop intelligent AI “workers” that serve as frontline agents, QA analysts, and managers, continuously enhancing customer interactions without the need for constant human oversight.As we expand, we are seeking passionate engineers who are eager to solve challenging technical problems, act swiftly, and make a significant impact in one of the fastest-growing voice AI startups. Let’s shape the future together.
About Our Innovative TeamJoin the Workload team at OpenAI, where we are at the forefront of designing and managing the cutting-edge infrastructure that drives the training and inference of large language models (LLMs) at an unprecedented scale. Our systems are engineered to harmonize the complex processes of model training and serving, abstracting performance, parallelism, and execution across extensive GPU and accelerator networks. This robust foundation allows researchers to concentrate on elevating model capabilities, while we take care of the scalability, efficiency, and reliability needed to bring these advanced models to life.Your Role and ResponsibilitiesWe are seeking a talented engineer to design and implement the dataset infrastructure that will fuel OpenAI’s next-generation training stack. Your primary focus will be on creating standardized dataset interfaces, scaling pipelines across thousands of GPUs, and proactively identifying and addressing performance bottlenecks. Collaboration with multimodal researchers and infrastructure teams will be key to ensuring that our datasets are unified, efficient, and user-friendly.Key Responsibilities Include:Design and maintain standardized dataset APIs, including those for multimodal (MM) data that exceeds memory capacity.Develop proactive testing and validation pipelines for dataset loading at GPU scale.Work collaboratively to integrate datasets into training and inference pipelines, ensuring seamless user experiences.Document and maintain dataset interfaces to ensure they are discoverable, consistent, and easily adoptable by other teams.Establish validation systems to assure datasets remain reproducible and unchanged once standardized.Identify and troubleshoot performance bottlenecks in distributed dataset loading, such as stragglers impacting global training speed.Create visualization and inspection tools to highlight errors, bugs, or bottlenecks in datasets.Ideal Candidate ProfilePossess strong engineering fundamentals and experience in distributed systems, data pipelines, or infrastructure.Have a proven track record in building APIs, modular code, and scalable abstractions, with a user-centric approach to design.Be adept at debugging performance issues across large-scale machine fleets.Demonstrate a passion for advancing data infrastructure to enhance research capabilities.
Full-time|On-site|San Francisco, California, United States
At Yutori, we are transforming the way individuals engage with the digital realm by developing AI agents capable of efficiently performing everyday online tasks. Our approach is to create a comprehensive, agent-first ecosystem, encompassing everything from training proprietary models to designing innovative generative product interfaces.To further this mission, we are seeking a skilled AI Engineer to join our pioneering team. Ideal candidates should possess strong technical expertise and a passion for crafting superhuman AI agents that can navigate the web autonomously.Our founders — Devi Parikh, Abhishek Das, and Dhruv Batra — bring a wealth of experience in AI research and product development, particularly in generative, multimodal, and embodied AI, honed during their time at Meta. Our team merges AI proficiency with a design-oriented approach to advance Yutori’s objectives.Yutori is proudly supported by a distinguished group of visionary investors, including Elad Gil, Sarah Guo, Jeff Dean, Fei-Fei Li, Amjad Masad, Guillermo Rauch, Akshay Kothari, Soleio, Oliver Cameron, Julien Chaumond, Logan Kilpatrick, Bryan McCann, Vladlen Koltun, Jamie Cuffe, Michele Catasta, and many others.
Full-time|$204K/yr - $255K/yr|On-site|United States
Founded in 2007, Airbnb has transformed from a small startup welcoming three guests into a global community of over 5 million hosts and more than 2 billion guest arrivals in virtually every country. Our platform offers distinctive stays and experiences, fostering authentic connections between guests and local communities.Join Our TeamAs a member of the Workflow Orchestration team within Airbnb’s Data Infrastructure organization, you'll play a vital role in managing the orchestration layer that coordinates, executes, and monitors sophisticated data workflows across both batch and streaming domains. Our objective is to equip data engineers, ML teams, analytics, and operational applications with scalable, reliable, and observable orchestration platforms, ensuring that essential business workflows operate seamlessly.Your ImpactIn the role of Engineering Manager, you will:Lead and develop a high-performing team tasked with designing, implementing, and maintaining distributed orchestration infrastructure that handles tens of thousands of data workflows daily.Establish the long-term technical vision and roadmap for orchestration, aligning it with Airbnb’s broader Data Infrastructure and Analytics platform strategy.Promote the adoption of best-in-class workflow paradigms and tools across data engineering, reliability, and ML teams, ensuring consistency, performance, and operational excellence.Collaborate closely with cross-functional teams in Data Platform, Compute, Storage, Analytics, and ML Infrastructure to enhance orchestration capabilities within the larger data ecosystem.Mentor and coach engineers to foster strong technical judgment, clarity of thought, and a sense of ownership within the team.A Day in Your LifeWork alongside senior engineering leaders to shape multi-year strategies for workflow orchestration and execution platforms.Stay involved in architectural decisions, review designs, and help resolve technical challenges.Collaborate with product and engineering leaders across Data Infrastructure to prioritize investments that balance reliability, developer experience, and cost.Ensure your team adheres to strong delivery discipline, optimizing workflows and practices.
Full-time|$216.2K/yr - $270.3K/yr|On-site|San Francisco, CA; New York, NY
Join our dynamic Machine Learning Infrastructure team as a Senior AI Infrastructure Engineer, where you will play a pivotal role in designing and constructing platforms that ensure the scalable, reliable, and efficient serving of Large Language Models (LLMs). Our innovative platform supports a range of cutting-edge research and production systems, catering to both internal and external applications across diverse environments.The ideal candidate will possess a solid foundation in machine learning principles coupled with extensive experience in backend system architecture. You will thrive in a collaborative environment that bridges research and engineering, working diligently to provide seamless experiences for our customers and accelerating innovation across the organization.
Full-time|$138K/yr - $259.4K/yr|On-site|San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC
Scale AI is on the lookout for an exceptionally talented and driven Software Engineer, Frontier AI Infrastructure to become an integral part of our innovative Public Sector Engineering team. In this role, you will take charge of the model inference layer, enabling cutting-edge AI models, troubleshooting the latest AI tools, managing networking tasks, addressing latency issues, and monitoring pricing and usage metrics for AI models. You will spearhead technical discussions with cloud vendors and clients to fulfill critical contracts and resolve platform challenges. Additionally, you will collaborate closely with Product teams to anticipate feature requirements, transitioning from reactive 'infra-only debugging' to proactive integration testing.Your Responsibilities Include:Designing and implementing secure, scalable backend systems tailored for Public Sector clients, utilizing Scale's advanced cloud-native AI infrastructure.Owning services or systems while defining long-term health objectives and enhancing the health of related components.Redesigning the architecture to operate in compliant or restrictive environments, which entails creating swappable components (authentication, storage, logging) to adhere to government and security regulations without compromising product integrity.Collaborating with Product teams to develop integration tests that identify issues early, shifting focus from 'infra-only debugging' to preventing upstream failures.Actively participating in customer engagements, liaising with stakeholders to comprehend requirements and deliver innovative solutions.Contributing to the platform roadmap and product strategy for Scale AI's Public Sector division, playing a vital role in shaping the future trajectory of our offerings.
The Bot CompanyAt The Bot Company, we are on a mission to create an innovative robot that enhances everyday life in homes everywhere.Located in the heart of San Francisco, our compact team comprises talented engineers, designers, and operators hailing from esteemed organizations such as Tesla, Cruise, OpenAI, Google, and Pixar. With a track record of delivering exceptional products to hundreds of millions of users, we understand the intricacies involved in crafting remarkable experiences.Our intentionally lean structure fosters swift decision-making while eliminating unnecessary bureaucracy. Each team member operates as an individual contributor, endowed with substantial autonomy, ownership, and accountability. We thrive on a culture of rapid iteration and efficient execution, working collaboratively across the technology stack.
Nov 21, 2025
Sign in to browse more jobs
Create account — see all 9,982 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.