Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Key ResponsibilitiesDesign and implement a reliable, low-maintenance, and high-uptime distributed inference system. Lead initiatives in observability and on-call practices. Optimize for minimal latency and jitter, focusing on achieving exceptional P99 performance. Collaborate with ML, software, and mobile engineers to establish a safe and efficient rollout process to production. Develop and maintain data pipelines for user-facing features and internal tools. Required QualificationsProficient in Python and JavaScript. Demonstrated experience in deploying consumer-facing interactive real-time services. At least 2 years of experience managing a live production system within a small team, servicing a large user base. Minimum 2 years of experience working in large infrastructure teams (Google, Meta, etc.). Proven technical leadership in developing robust processes with teams. Strong commitment to maintaining a healthy live service.
About the job
About Sandbar
At Sandbar, located in the vibrant heart of New York City, we are dedicated to enhancing human capability in an increasingly agentic world. Our innovative team has developed cutting-edge software, machine learning, and hardware products, collaborating with industry leaders including Meta, CTRL-labs, Google, Apple, Fitbit, Peloton, and Equinox.
Our flagship product, Stream, represents the future of human-agent interaction—a sophisticated private voice ring and conversational computer that has garnered attention from WSJ, Bloomberg, and Wired, with shipping set to commence in Summer '26.
We invite you to join us in pioneering a transformative interface for self-augmentation.
Your Role
We are on the lookout for a talented and experienced infrastructure engineer to play a critical role in the development of Stream. You will be instrumental in creating a highly scalable, real-time production service that incorporates multiple stages of machine learning and data processing. Collaboration with mobile and firmware engineers will be essential as you lay the groundwork for groundbreaking human-computer interactions.
About Sandbar
Sandbar is an innovative interface company based in New York City, focused on empowering individuals to think, act, and move freely in an agentic world. Our team has extensive experience building software, machine learning, and hardware products for leading brands, and we are excited about the future of human-agent interactions with our upcoming product, Stream.
Full-time|$140K/yr - $260K/yr|On-site|New York City
At Profound, we are dedicated to empowering businesses to comprehend and manage their AI presence effectively. As an Infrastructure Software Engineer, you will play a pivotal role in building and scaling the systems that facilitate our rapid expansion. Your primary focus will be on ensuring that our infrastructure is not only highly available but also cost-efficient and capable of managing significant traffic and compute demands. You will collaborate closely with engineers across product, research, and operations teams to design scalable architectures, streamline deployment processes, and enhance system observability.Your ResponsibilitiesDesign and maintain core infrastructure across diverse cloud environments.Develop infrastructure-as-code workflows to automate deployment and scaling processes.Enhance monitoring, logging, and alerting systems to ensure system reliability.Oversee CI/CD pipelines to facilitate seamless deployments.Assist in disaster recovery planning to guarantee high system availability.Collaborate with product and research teams to design architectures that can scale with increasing workload demands.Identify and resolve performance bottlenecks in compute, storage, and networking.Implement security best practices and compliance frameworks across the infrastructure.
Full-time|$240K/yr - $270K/yr|On-site|New York, United States
At Genius Sports, we combine cutting-edge technology with premier live data to revolutionize the sports experience for fans around the globe. Our mission is to create more immersive, interactive, and personalized experiences than ever before. Discover more about us at geniussports.com.The Role - Staff Engineer - Infrastructure Platform We are on the lookout for an exceptional Staff Engineer to spearhead critical projects within our core infrastructure platform. Genius Sports is currently integrating its diverse tech teams and acquisitions under a cohesive technical strategy, and our infrastructure platform is the foundation of this transformation. Our primary objective is to empower engineering teams to efficiently build, deploy, and manage Genius Sports’ extensive product catalog in a consistent manner. In this role, you will collaborate with fellow InfraPlat leaders to define and execute the technical vision and implementation across an array of projects. These initiatives encompass multi-account and region Kubernetes clusters, MLOps, standardized deployment processes, and a centralized authentication platform. You will also engage with stakeholders from product engineering teams to assess requests, identify common challenges, and prioritize initiatives.
Full-time|$200K/yr - $250K/yr|On-site|New York, NY
Join Fluidstack: Pioneering the Future of IntelligenceAt Fluidstack, we're transforming the landscape of artificial intelligence infrastructure. Collaborating with leading AI research labs, government entities, and major corporations—including Mistral, Poolside, Black Forest Labs, and Meta—we are dedicated to delivering computing solutions at unprecedented speeds. Our mission is to expedite the realization of Artificial General Intelligence (AGI), and we are seeking passionate individuals who thrive on purpose and excellence.We take immense pride in the systems we develop and the trust we build with our clients. If you are ready to roll up your sleeves and contribute to shaping the future of intelligence, we invite you to join our innovative team.Position OverviewFluidstack, a prominent player in the cloud services arena, is on the lookout for a Software Engineer specializing in Infrastructure Platform Development. In this role, you will be instrumental in constructing the foundational platforms that support our global infrastructure and data center operations. Your focus will be on developing robust internal tools across various domains, including Configuration Management Database (CMDB), asset management, Data Center Infrastructure Management (DCIM), monitoring, observability, security, and operational automation. Collaborating with cross-functional teams, you will craft scalable and user-friendly solutions that enhance our ability to provide top-tier infrastructure services.Key ResponsibilitiesInfrastructure Platform DevelopmentDesign and implement a next-generation CMDB system to serve as the definitive source of truth for infrastructure assets, network architecture, and configuration data.Develop DCIM platforms for managing rack operations, server/GPU deployments, operating system installations, quality assurance, and white-screen activities.Create comprehensive asset lifecycle management systems encompassing receiving, racking, inventory, break-fix, and decommissioning workflows.Build monitoring and observability platforms that integrate telemetry from Building Management Systems (BMS), Environmental Power Monitoring Systems (EPMS), and IT devices, featuring intelligent alerting and incident management capabilities.Develop self-service portals and automation tools for new region initialization, post-deployment operations, and fleet-scale management.Operational Excellence & AutomationMinimize manual tasks through workflow automation and self-service tools that empower our operations and engineering teams.Create workflow orchestration systems to streamline complex multi-step processes that encompass incident, problem, and change management.
About Amperos HealthAmperos Health stands at the forefront of revolutionizing revenue cycle management (RCM) for healthcare clinics, enabling them to optimize revenue collection efficiently and swiftly. Established in 2023 and supported by notable investors such as Uncork, Neo, Nebular.vc, and strategic angels from industry leaders like OpenAI, Stripe, and Twilio, our mission is to transform the relationship between healthcare providers and payers. We envision an AI-driven workforce that alleviates administrative workload and accelerates the financial return for healthcare professionals.About the RoleAs our inaugural Infrastructure Engineer, you will spearhead the development of our infrastructure at Amperos. You will manage DevOps, enhance developer experience, ensure compliance, and oversee observability and monitoring within our AWS environment. This role presents exciting challenges associated with AI infrastructure, and you will have the chance to mold and lead your team as we expand.Key ResponsibilitiesEstablish and maintain the infrastructure to support Amperos' growth trajectory.Enhance and optimize AWS infrastructure, boosting development efficiency and creating modular engineering frameworks.Develop systems that expedite the deployment of AI features and enhance observability of large language models (LLMs).Minimize AWS expenditures while improving visibility into infrastructure costs.Lead initiatives related to security and technical compliance, including VPNs, firewalls, and network configurations.Candidate ProfileA minimum of 5 years of experience in leading infrastructure teams within top-tier organizations.Ability to identify key ROI infrastructure challenges from the outset and develop a strategic roadmap for enhancements.Proficient in leading engineering discussions and articulating the impact of infrastructure projects.Adaptable and willing to take on diverse roles as needed; no task is too small.Demonstrates high agency, with the capability to produce results with minimal guidance.Excellent verbal and written communication skills; proactively shares vital information with the team.Passionate about leveraging technology to address challenges in an underserved industry.
Join a Transformative Force in TechnologyAt Palantir, we create groundbreaking software that revolutionizes data-driven decision-making and operational efficiency. Our platforms enable partners to combat critical challenges—from developing life-saving medical treatments to predicting supply chain disruptions and locating missing persons.Position OverviewAs a Backend Software Engineer at Palantir, you will play an essential role in shaping how organizations leverage data. You will be engaged in all stages of the product lifecycle, from ideation and design to prototyping and final deployment. Collaborating with both technical and non-technical team members, you will gain insights into customer challenges and develop innovative solutions. We promote cross-team collaboration, allowing you to expand your knowledge across various technologies and product facets. You will work autonomously and make impactful decisions within a supportive community that fosters your growth into a technical leader.Our Product Development teams are composed of small, focused groups of Software Engineers, each dedicated to distinct aspects of our offerings. The infrastructure teams are tasked with fortifying the foundational layers of our software stack, emphasizing database technologies, distributed systems, large-scale data architectures, security, and application infrastructure. In this role, you will write robust code that supports Palantir Foundry and Gotham, ensuring high performance, security, and scalability for products utilized by key public and private sector institutions. You will be instrumental in building the core capabilities that empower research scientists, aerospace engineers, intelligence analysts, and economic forecasters worldwide.We seek engineers who are driven by a passion for solving real-world challenges, enabling both developers and end-users to maximize their productivity. If you are eager to create reliable, high-performing, and scalable systems, and to design resilient APIs, this opportunity allows you to make a substantial impact on our products and the communities we serve.
Full-time|$145K/yr - $200K/yr|On-site|New York Office
Join Thread AI as an Infrastructure Software EngineerAt Thread AI, we are pioneering the development of an AI-native workflow orchestration engine. We are in search of passionate and skilled professionals eager to be a part of our expanding team. Our mission is to simplify infrastructure for enterprises and public sector organizations aiming to harness the full potential of artificial intelligence.Located in the heart of New York, our diverse team comprises seasoned experts in AI, product management, and engineering, all dedicated to crafting and implementing intricate workflows and infrastructure solutions.Our Engineering CultureWe pride ourselves on having a compact yet highly committed technical team that encompasses engineering, research, design, product, and operations. We believe that a small, empowered team with a flat organizational structure fosters rapid innovation and superior product development compared to larger, hierarchical entities.Role OverviewWe are looking for a talented software engineer with a robust background in infrastructure to contribute significantly to the scalability and reliability of our platform. The ideal candidate will support complex network topologies and should possess a deep curiosity and creativity, excelling in problem-solving within a fast-paced, ownership-driven environment.
About KiddomKiddom is an innovative educational platform dedicated to enhancing student equity and growth. By combining high-quality instructional resources with interactive digital learning, Kiddom enables schools and districts to take control of their curriculum. This results in personalized learning experiences that cater to the distinct needs and aspirations of local communities. The platform is enriched with insightful data for teachers and leaders, fostering continuous improvement in instructional strategies, school programming, and professional development.As a member of the InfraOps team, you will play a crucial role in supporting Kiddom's engineering efforts by developing a scalable and sustainable infrastructure that aligns with our company objectives. This is a unique chance to join a small, skilled team during a pivotal moment as we transition from Series C to D funding. You will have the autonomy to influence your work significantly, embracing a variety of tasks that reflect your expertise and enthusiasm.Key Responsibilities:Promote and cultivate a robust DevOps culture at Kiddom by collaborating with teams to establish best practices and guide both new and existing services.Implement Infrastructure as Code (IaC) to ensure confidence in automated, repeatable processes.
Full-time|$150K/yr - $165K/yr|Hybrid|New York City
Astronomer develops Astro, a unified DataOps platform powered by Apache Airflow®. Over 800 enterprises rely on Astronomer to help their teams apply software, analytics, and AI to real-world challenges. More details are available at www.astronomer.io. Role overview The Airflow Infrastructure team at Astronomer scales Apache Airflow for organizations with complex data pipelines. This group works within Research and Development, focusing on cloud-native infrastructure and open-source integration. As a Software Engineer on this team, the main responsibility is building the infrastructure layer that connects open-source Airflow with scalable, enterprise-grade cloud systems. The work directly impacts how companies orchestrate and manage data pipelines, emphasizing speed, reliability, and ease of use. This position is based in New York City and follows a hybrid work schedule. What you will do Write and maintain backend services using clean, well-tested code. Collaborate with engineers, product managers, support, and leadership to align technical direction with business goals. Participate in code reviews and provide constructive feedback to peers. Enhance the performance, reliability, and scalability of backend systems. Prototype and suggest new ideas to improve user experience. Document systems and processes in a clear, accessible manner. Join on-call rotations to troubleshoot and resolve incidents as they occur. Requirements 3-5 years of experience with Python, Golang, and Kubernetes. Strong understanding of distributed systems and microservices architecture. Experience working with cloud platforms and CI/CD pipelines. Demonstrated problem-solving skills and a collaborative mindset.
Join Palantir Technologies as a Senior Backend Software Engineer specializing in Infrastructure. In this pivotal role, you will work at the intersection of software engineering and infrastructure systems, contributing to the design and implementation of scalable backend solutions that empower organizations to make data-driven decisions.
Full-time|$180K/yr - $247.5K/yr|Remote|New York City or Remote
Join the Revolution at CheckAt Check, we transform the way people get paid, simplifying payment processes for payroll businesses. As pioneers of embedded payroll, we collaborate with our partners to redefine payroll systems, enabling businesses to launch, grow, and flourish. Discover our journey | Listen in.Check is more than just an API infrastructure; we serve as a launchpad for payroll businesses.Our TeamPayroll systems are in need of innovation. Join a passionate team dedicated to solving these challenges! At Check, your problem-solving skills, critical thinking, and determination will drive impactful changes across our projects. We view challenges as opportunities and encourage collaboration that leverages the unique strengths of every team member.If you are ready to dive in and reshape payroll, let’s work together to simplify complexities and create a brighter future for businesses of all sizes.The RoleEngineering is the backbone of Check. We envision payroll as part of modern financial software, which necessitates robust systems that our operators and partners can rely on. Every solution we develop is built on reliable, scalable, and secure systems that ensure timely payments.We are in search of a Staff Software Engineer who possesses strong software design expertise coupled with hands-on infrastructure experience. In this position, you will enhance the core systems that enable payroll operations, focusing on scalability, production efficiency, and empowering engineers with reliable tools for software deployment.You'll collaborate across product and platform teams to advance our cloud infrastructure, enhance system deployment and monitoring, and simplify the architecture underpinning embedded payroll. Your challenges will often bridge the domains of infrastructure, product, and operations.This role is perfect for individuals who have managed complex systems in dynamic environments and take pride in creating resilient, understandable infrastructure that is essential for business operations.
Join Watershed as a Cloud Infrastructure Software EngineerAt Watershed, we are revolutionizing the way enterprises manage their sustainability efforts. Our platform is trusted by industry leaders such as Airbnb, Carlyle Group, FedEx, Visa, and Dr. Martens to effectively handle climate and ESG data, produce audit-ready metrics for various reporting requirements, and drive meaningful decarbonization initiatives. We are seeking passionate team members dedicated to product innovation and eager to contribute to a mission-driven startup culture.With offices in San Francisco, New York, Denver, London, Paris, Berlin, Sydney, Mexico City, and a growing remote workforce across the US and Europe, we invite you to be part of our expanding team!Your RoleAs part of the Cloud Infrastructure team at Watershed, you will play a pivotal role in developing the foundational systems that our products rely on. You will also be instrumental in creating tools that our engineering teams use to deploy, test, scale, and monitor their applications effectively. This is an exciting opportunity to support a rapidly evolving codebase and build systems that enhance the experience for all engineers at our company.We are looking for candidates with experience in managing cloud infrastructure, navigating various scalability challenges, and optimizing engineering productivity. Familiarity with Google Cloud Platform is a plus, but not mandatory.We value traits such as customer obsession, hard work, inclusivity, and optimism in our team members.Are you a good fit?3-5+ years of engineering experience.Experience working on Infrastructure or Platform teams, or significant involvement in projects with infrastructure components (e.g., multi-region architecture, CI/CD, infrastructure as code, Kubernetes, release management, observability, cloud security).A passion for building the right tools to match our company's growth stages.A collaborative spirit that fosters great outcomes for customers.This position is based in our New York office.
Join imprint as a Senior Software Engineer focusing on Infrastructure, where you will play a pivotal role in designing and implementing scalable systems that support our innovative services. You will collaborate with cross-functional teams to enhance our infrastructure and ensure seamless integration of applications. This role is perfect for a passionate engineer eager to tackle complex challenges and contribute to cutting-edge solutions.
About SandbarAt Sandbar, located in the vibrant heart of New York City, we are dedicated to enhancing human capability in an increasingly agentic world. Our innovative team has developed cutting-edge software, machine learning, and hardware products, collaborating with industry leaders including Meta, CTRL-labs, Google, Apple, Fitbit, Peloton, and Equinox.Our flagship product, Stream, represents the future of human-agent interaction—a sophisticated private voice ring and conversational computer that has garnered attention from WSJ, Bloomberg, and Wired, with shipping set to commence in Summer '26.We invite you to join us in pioneering a transformative interface for self-augmentation.Your RoleWe are on the lookout for a talented and experienced infrastructure engineer to play a critical role in the development of Stream. You will be instrumental in creating a highly scalable, real-time production service that incorporates multiple stages of machine learning and data processing. Collaboration with mobile and firmware engineers will be essential as you lay the groundwork for groundbreaking human-computer interactions.
Join a Revolutionary CompanyAt Palantir, we are at the forefront of developing cutting-edge software that transforms data into actionable insights. Our platforms empower partners to achieve incredible feats, from developing life-saving medications to addressing critical supply chain challenges, and even assisting in locating missing children.Your RoleAs a Software Engineer on our Infrastructure team focused on the Foundry platform, you will play a vital role in shaping how organizations leverage data. You will engage in the entire product lifecycle— from ideation and design to prototyping and deployment. Collaborating with both technical and non-technical team members, you will gain a deep understanding of customer challenges and contribute to innovative solutions. We foster cross-team collaboration, enabling you to broaden your skill set and gain insights into various technologies across our product landscape. As a valued member of our team, you will have the autonomy to make decisions while being supported by a community that encourages your professional growth, allowing you to flourish as a technical contributor and engineering leader.Our Product Development division consists of small, focused teams, each dedicated to specific product aspects. Within the infrastructure team, you will work on the foundational layers of our software stack, concentrating on database technologies, distributed systems, large-scale data management, security, and application infrastructure. Your contributions will ensure the robustness, security, and scalability of the Palantir Foundry and Gotham platforms, powering essential solutions utilized by research scientists, aerospace engineers, intelligence analysts, and economic forecasters worldwide.We are seeking engineers who are passionate about tackling real-world challenges and enabling developers and end-users to perform at their best. If you're driven to create reliable, efficient, and scalable systems, and to design robust APIs and functionalities, this role offers you the chance to make a meaningful impact on our products and the users who depend on them.Frontline ExperienceFoundry Software Engineers may have the unique opportunity to participate in Frontline, an exclusive program that immerses you with customers. This short-term assignment allows you to work directly with users, gaining invaluable insights into product usage and customer challenges, and enabling you to address complex and ambiguous problems effectively.
Full-time|$140K/yr - $200K/yr|Remote|Rochester, NY, USA
Speechify aims to remove barriers to learning by transforming text into audio. Over 50 million people use Speechify’s text-to-speech tools to listen to PDFs, books, Google Docs, news, and websites. The product suite covers iOS, Android, Mac, Chrome, and web platforms. Google recognized Speechify as Chrome Extension of the Year, and Apple awarded it the 2025 Design Award for Inclusivity. The company operates fully remotely with a team of nearly 200. Team members include frontend and backend engineers, AI researchers, and professionals from Amazon, Microsoft, Google, Stanford, and founders of successful startups. Role overview Speechify is hiring a Software Engineer for the Data Infrastructure & Acquisition team in the AI department. This role centers on managing and improving data collection processes that support model training. The team builds large-scale, high-quality datasets for AI research and development, focusing on both scale and cost efficiency. Location Rochester, NY, USA (remote team)
Join Our Team at Basic CapitalBasic Capital is at the forefront of transforming America’s $1 trillion retirement sector. Our innovative approach focuses on developing the mortgage for retirement, granting market access, and ensuring that wealth is within reach for every American. Our mission is to create cutting-edge products, platforms, and a comprehensive credit marketplace that revolutionizes the retirement system.Our founding team comprises seasoned professionals from prestigious companies like Goldman Sachs, Uber, Block, Stripe, and Robinhood. Supported by top-tier investors such as Lux Capital, Forerunner Ventures, BoxGroup, SVAngel, Inspired Capital, and Henry Kravis, we are located in SoHo, NYC, and are building a dynamic, high-performance team dedicated to mitigating wealth inequality.Discover more about us by visiting Basic Capital’s website.Your RoleBecome a pivotal member of our Infrastructure & Security team, ensuring platform reliability, security integrity, and compliance that empowers Basic Capital to scale with assurance.Take ownership of our SOC 2 Type II compliance journey by implementing security measures, documenting essential processes, and collaborating with auditors to secure and uphold certification.Architect and implement robust observability frameworks, including logging, monitoring, alerting, and distributed tracing across our infrastructure.Develop and uphold CI/CD pipelines, automate deployments, and establish infrastructure-as-code practices to facilitate rapid, secure releases.Enhance system performance, reduce latency, and optimize cost-efficiency as we grow to accommodate a larger customer base and increased transaction volumes.Oversee our cloud infrastructure (AWS), managing networking, security configurations, IAM policies, and compliance measures.
About UsAt Credal, we are committed to leveraging cutting-edge AI technologies to empower those who are dedicated to shaping a brighter future.Our platform accelerates the complex operations of leading institutions while embedding robust security and governance at every level of our offerings. Our esteemed clientele includes the U.S. Department of Health and Human Services, MongoDB, Comcast NBC Universal, Lattice, Wise (formerly TransferWise), Checkr, and incident.io, among others.With over $10 million in funding secured, we have ample runway to support our ambitious goals. Join a team of Thoughtful Doers™ hailing from top-tier companies such as Palantir, Cruise, Meta, Uber, and Google. Credal operates in-person from our spacious NYC office, complete with meals provided.
About DecagonDecagon stands at the forefront of the conversational AI revolution, enabling brands to provide exceptional concierge-level customer experiences. Our cutting-edge technology empowers renowned enterprises such as Avis Budget Group, Block’s Cash App and Square, Chime, Oura Health, and Hunter Douglas to implement AI-driven agents. These agents facilitate personalized interactions across all communication channels including voice, chat, email, and SMS.Our vision is to transform the landscape of customer interactions, moving beyond traditional support tickets and hold music toward expedited resolutions, enriched conversations, and stronger relationships. We are proudly supported by prestigious investors such as a16z, Accel, Bain Capital Ventures, Coatue, and Index Ventures, all of whom share our ambitious vision.As an in-office company, we thrive on a collective commitment to excellence and speed. Our core values—Just Get It Done, Invent What Customers Want, Winner’s Mindset, and The Polymath Principle—guide our collaborative efforts and professional growth.About the TeamThe Infrastructure team is responsible for designing and maintaining the essential frameworks that drive Decagon’s operations, including networking, data management, machine learning serving, developer platforms, and real-time voice communication. We collaborate closely with product, data, and machine learning teams to create high-scale, low-latency systems that meet stringent service level objectives and offer exceptional developer experience.Our team focuses on five critical areas:Core Infrastructure: Our foundational cloud architecture encompasses networking, computing, storage, security, and infrastructure-as-code, ensuring reliability, scalability, and cost-effectiveness.Data Infrastructure: We develop streaming and batch data platforms that drive analytics and business intelligence while enabling customer-facing telemetry for both customer-managed and on-premises environments.Machine Learning Infrastructure: Our platforms support GPU and model serving for large language model inference, featuring multi-provider routing and capabilities for on-premises or air-gapped deployments.Developer Experience Platform: We focus on continuous integration and deployment (CI/CD), streamlined processes, and essential services that facilitate fast, safe, and consistent shipping across teams.Voice Infrastructure: Our telephony and WebRTC stack ensures ultra-low-latency, high-quality voice communication, supported by comprehensive observability.Our mission is to deliver extraordinary support experiences where AI agents collaborate with human operators to resolve challenges swiftly and accurately.About the RoleWe are seeking a Senior Infrastructure Engineer who will play a pivotal role in shaping our infrastructure landscape and enhancing the efficiency and performance of our systems.
About Us:At Fireworks AI, we are at the forefront of creating next-generation generative AI infrastructure. Our cutting-edge platform is recognized for delivering the highest-quality models with unparalleled speed and scalability in inference. Independently benchmarked as a leader in LLM inference speed, we drive significant advancements through innovative projects, including our proprietary function calling and multimodal models. As a Series C company valued at $4 billion and backed by leading investors such as Benchmark, Sequoia, Lightspeed, Index, and Evantic, we are a dynamic team of builders, comprised of veterans from Meta PyTorch and Google Vertex AI.The Role:We are seeking a talented Software Engineer to join our AI Infrastructure team. In this pivotal role, you will contribute to designing and developing the foundational systems that power Fireworks AI’s generative AI platform. Your focus will be on building robust infrastructure and tools that guarantee the reliability, performance, quality, and availability of our AI systems.Our mission is to establish Fireworks AI as the most dependable and user-friendly generative AI platform globally. You will collaborate closely with our cloud infrastructure, product, and performance teams to create infrastructure solutions that connect our customers with the high-performance proprietary Fireworks inference engine.Key Responsibilities:Design and develop scalable backend infrastructure supporting distributed training, inference, and data pipelines.Build and maintain essential backend services, including LLM CI/CD pipelines, control planes, and model serving systems.Enhance performance optimization, cost efficiency, and reliability across compute, storage, and networking layers.Create frameworks and safeguards to ensure Fireworks AI maintains the highest model quality in the industry.Work alongside performance, training, and product teams to translate research and product requirements into effective infrastructure solutions.Engage in code reviews, technical discussions, and continuous integration and deployment processes.
The New York Times is on the lookout for a dynamic Principal Software Engineer to spearhead the architecture and advancement of our data and machine learning infrastructure. This pivotal role will lay the groundwork for innovative data-driven products, analytics, and AI applications. You will be responsible for designing robust systems that facilitate large-scale data processing, reliable pipelines, and efficient machine learning development, including feature engineering and real-time model serving. As a principal engineer, you will collaborate closely with product, data science, and platform teams to establish the technical direction, promote the adoption of reusable frameworks, and mentor engineers throughout the organization. Your focus will be on ensuring that both data and ML platforms are scalable, reliable, cost-efficient, and compliant with privacy and governance standards. Our core Data Platform integrates a data lake on AWS S3 with Apache Iceberg for enhanced reliability, while data ingestion leverages Confluent Kafka for real-time streaming and Fivetran for file ingestion. The transformation layer utilizes Apache Flink for stream processing, AWS Glue (Spark) for core ETL, and dbt/Athena for analytical data models. The platform efficiently serves data through specialized data stores, including Amazon DynamoDB for low-latency applications and Google BigQuery as the primary analytics engine. This is a hybrid role based in our New York City headquarters, reporting directly to the Sr. Director of Engineering. Expect to work in the office 2+ days per week.
Feb 5, 2026
Sign in to browse more jobs
Create account — see all 4,077 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.