Principal Software Engineer, Data Infrastructure
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
About The New York Times
The New York Times Company is driven by its enduring mission to seek the truth and foster a deeper understanding of the world. At the heart of our operations is independent journalism, supported by a world-class newsroom with correspondents in nearly 160 countries. We are committed to enhancing our readers' experiences across various mediums, including print, audio, and digital platforms. Our business strategy revolves around delivering exceptional journalism that our audience finds valuable enough to pay for.
Similar jobs
Search for Staff Data Infrastructure Engineer
4,959 results
Peregrine Technologies
At Peregrine Technologies, backed by prominent Silicon Valley investors, we empower public safety organizations, local and state governments, federal agencies, and private sector entities to tackle societal challenges with unmatched speed and precision. Our cutting-edge AI platform transforms isolated data into actionable insights, delivering critical operational intelligence that enables swift, informed decisions to enhance outcomes across numerous sectors. Currently, we serve hundreds of clients across over 30 states and two countries, positively impacting over 125 million individuals. As we scale our operations into the enterprise and global markets, we are poised to amplify our influence even further.TeamOur engineering team prioritizes empathy in developing superior solutions. Understanding user interactions with our products is essential for finding optimal answers. Engineers are encouraged to collaborate closely with our onsite team to grasp the diverse use cases that Peregrine addresses.We emphasize both ownership and teamwork—you will be entrusted with significant features while collaborating with fellow engineers to ensure successful completion. We believe that humility and empathy are crucial to crafting the right solutions, and you will engage directly with our deployment team and users as we refine our offerings to meet their needs. Perseverance and creativity will be key to realizing our vision.RoleWe are seeking a Staff Data Infrastructure Engineer to join our dynamic team. In this role, you will have substantial ownership of the data layer that supports all of Peregrine's operations. Your responsibilities will include architecting and building systems that ingest, store, and serve vast amounts of real-time operational data, enabling our clients to make critical decisions swiftly and confidently.This role is ideal for an experienced individual contributor who thrives on complex technical challenges and possesses the expertise and judgment to influence foundational infrastructure decisions. You will face a variety of intricate challenges, including:Designing and managing a high-throughput, real-time data integration platform across diverse client environmentsArchitecting a scalable open table format layer for reliable data storage at petabyte scaleBuilding and optimizing distributed data processing pipelines using Apache Spark and related streaming technologiesEnhancing performance, reliability, and cost efficiency across the entire data infrastructure stackCollaborating with platform and product engineering teams to establish data contracts, schemas, and integration pathways
About KnotAt Knot, we are on a mission to revolutionize the way consumers and businesses interact through seamless merchant and banking experiences. Think of us as the 'Plaid for merchant connectivity.' Our innovative platform is designed to connect merchants with the multitude of applications that enhance everyday transactions. Our flagship product, CardSwitcher, empowers consumers to effortlessly update and manage their payment methods across various online merchant accounts like Netflix and PayPal. Additionally, our advanced solution, TransactionLink, allows for the retrieval of detailed transaction data, paving the way for new product development on our unique merchant connectivity platform. We invite you to join us in building these exciting new solutions!Founded in 2021 by brothers and Thiel Fellows Rory and Kieran O’Reilly, Knot currently facilitates connected online payment experiences for hundreds of thousands of users. Our technology is trusted by industry leaders like American Express, PayPal, Current, BILT, and Step, who integrate Knot’s SDK into their applications to deliver exceptional experiences to their customers.Backed by a distinguished group of investors including Nava Ventures, 8VC, and prominent figures from companies such as Twitter, Warby Parker, and DraftKings, Knot is well-positioned for continued growth and innovation.Working at KnotWe pride ourselves on having a world-class team from diverse backgrounds, with a strong emphasis on engineering talent. As we expand our footprint in NYC, we aim to be at the forefront of the financial services landscape.Our team is dedicated to building exceptional products for our users, balancing a serious approach to our work with a fun and engaging work environment. We believe both aspects are integral to our success.Your RoleDesign, architect, deploy, document, and oversee our cloud-based network infrastructure.Take ownership of critical API infrastructure that handles hundreds of requests per second.Lead technical decisions, providing justification for designs and coordinating with other teams to ensure alignment on values and requirements.Continuously enhance your knowledge of our infrastructure's long-term needs and capabilities.Manage and troubleshoot complex technical issues and incidents, providing support and solutions as necessary.
Genius Sports
At Genius Sports, we combine cutting-edge technology with premier live data to revolutionize the sports experience for fans around the globe. Our mission is to create more immersive, interactive, and personalized experiences than ever before. Discover more about us at geniussports.com.The Role - Staff Engineer - Infrastructure Platform We are on the lookout for an exceptional Staff Engineer to spearhead critical projects within our core infrastructure platform. Genius Sports is currently integrating its diverse tech teams and acquisitions under a cohesive technical strategy, and our infrastructure platform is the foundation of this transformation. Our primary objective is to empower engineering teams to efficiently build, deploy, and manage Genius Sports’ extensive product catalog in a consistent manner. In this role, you will collaborate with fellow InfraPlat leaders to define and execute the technical vision and implementation across an array of projects. These initiatives encompass multi-account and region Kubernetes clusters, MLOps, standardized deployment processes, and a centralized authentication platform. You will also engage with stakeholders from product engineering teams to assess requests, identify common challenges, and prioritize initiatives.
Join Privy as a Senior Infrastructure Engineer and play a pivotal role in shaping the future of online privacy and user ownership. Our team is dedicated to creating innovative developer tools that prioritize users, leveraging cutting-edge cryptography to redefine digital ownership. You will collaborate with a passionate engineering team to design and manage robust multi-tenant infrastructures that support billions of requests monthly, ensuring high performance and reliability across our services.
At Angi®, our mission for the past 30 years has been simple: to ensure that jobs are done right. We connect homeowners with trustworthy professionals who possess the necessary skills, while simultaneously linking these pros with homeowners seeking the jobs they desire.Angi at a glance:Homeowners have relied on Angi for over 300 million projects.We cover more than 1,000 home service tasks.Our team consists of 2,800 dedicated employees worldwide.Why join Angi:Angi® is on a mission to redefine the home services industry, fostering an environment where homeowners, professionals, and employees all benefit from a greater number of jobs completed successfully.For homeowners, our platform offers a dependable way to locate skilled professionals. For professionals, we act as a trustworthy business partner, helping them discover the work they want when they want it. For our employees, we provide an exceptional workplace that they can proudly call home. We look forward to welcoming you!About the team:We are currently searching for a Senior Data Engineer to join our Data Infrastructure team. This individual will play a pivotal role in constructing and managing the foundational platforms that facilitate data processing, storage, and analytics throughout our organization. The focus of this role will be on advancing our lakehouse architecture, data replication systems, and orchestration frameworks, all while ensuring scalable, reliable, and efficient data workflows.Please note, although this role is remote, we are a global company seeking candidates located in the Eastern Time Zone to align with our team's working hours.
Stratos Labs Inc.
Overview Role: Lead Software Engineer - DevOps and Infrastructure On-site role at our New York City headquarters, 5 days a week Annual base salary: $175,000 - $250,000 Equity: Competitive initial equity package along with refreshers Minimum of 3 years of relevant experience required About Stratos Labs Stratos Labs is revolutionizing commodity risk management for the $10 trillion physical economy. Our innovative platform merges real-time market data with AI-driven exposure modeling and automated trade generation, empowering operators with precise tools to manage volatility. From instantaneous trade execution to ongoing monitoring, alerts, and actionable recommendations, Stratos Labs transforms complex market risks into a seamless, always-on hedging solution. Founded in 2023 by a former macro market-maker from Barclays and a trading systems engineer from Coinbase, we have successfully raised over $20 million in funding from esteemed investors including Andreessen Horowitz (a16z), Crucible Capital, Neo, and DST Global. Key Responsibilities Manage and Optimize AWS Cloud: Take complete ownership of our AWS environment, architecting, scaling, and optimizing our cloud infrastructure to enable new services while ensuring it is cost-effective and integrated with our core trading systems. Infrastructure Architecture: Lead the transition to a full Infrastructure as Code (IaC) model by developing a sophisticated Terraform stack that eliminates manual configurations. Security and Compliance: Act as the gatekeeper for our environments, managing container security, CVE remediation, and ensuring compliance with SOC2 standards without hindering team productivity. Observability Strategy: Design a comprehensive monitoring strategy to proactively identify bottlenecks before they lead to outages. CI/CD Pipeline Refinement: Enhance our GitHub Actions and CI/CD workflows to streamline the process of deploying Go, C++, and TypeScript services from commit to production. Reliability and Incident Response: Participate in on-call rotations, conducting thorough post-mortems to ensure that recurring issues are resolved effectively.
Role overview Braze, Inc. is hiring a Senior Staff Platform Infrastructure Engineer based in New York City. This position centers on designing and improving the core infrastructure that supports Braze’s products and services. The engineer will guide complex technical projects and partner with teams throughout the company. What you will do Design and enhance platform infrastructure to strengthen Braze’s offerings Take the lead on technical projects, managing them from initial planning through execution Work closely with colleagues across departments to deliver reliable and scalable solutions Share ideas and technical knowledge to help drive improvements and innovation in the platform
Join CoreWeave as an Engineering Manager, leading our Data Infrastructure team. You will be at the forefront of designing, building, and scaling our data systems that support our innovative cloud solutions. This role is essential for driving efficiency and performance in our data handling, ensuring our infrastructure meets the demands of our growing customer base.
Harvey develops AI-driven solutions for legal and professional services, serving over 1,000 organizations in more than 60 countries. The company is growing rapidly and has strong support from leading investors. Harvey’s team values ownership, quick decision-making, and close collaboration. Engineers work side by side with leadership and customers to address practical challenges. The company operates in person in New York City and provides relocation support for new hires. Role overview The Staff Software Engineer - Core Infrastructure joins a team responsible for designing, building, and scaling Harvey’s core infrastructure. This platform handles billions of prompt tokens and millions of daily requests, forming the backbone of Harvey’s global legal AI services. The work involves both creating new systems and maintaining high operational standards. Reliability, scalability, and security are central as Harvey continues to expand its reach among top law firms and professional service providers. This is a full-time, in-person position based in New York City. Relocation assistance is available. What you will do Design and build scalable, fault-tolerant infrastructure systems that support Harvey’s AI platform across multiple cloud regions. Take ownership of and improve multi-cloud infrastructure (Azure, GCP), focusing on Kubernetes orchestration, networking, and container management. Lead technical projects in areas such as observability, incident response, and performance optimization.
Speechify aims to remove barriers to learning by transforming text into audio. Over 50 million people use Speechify’s text-to-speech tools to listen to PDFs, books, Google Docs, news, and websites. The product suite covers iOS, Android, Mac, Chrome, and web platforms. Google recognized Speechify as Chrome Extension of the Year, and Apple awarded it the 2025 Design Award for Inclusivity. The company operates fully remotely with a team of nearly 200. Team members include frontend and backend engineers, AI researchers, and professionals from Amazon, Microsoft, Google, Stanford, and founders of successful startups. Role overview Speechify is hiring a Software Engineer for the Data Infrastructure & Acquisition team in the AI department. This role centers on managing and improving data collection processes that support model training. The team builds large-scale, high-quality datasets for AI research and development, focusing on both scale and cost efficiency. Location Rochester, NY, USA (remote team)
Speechify is seeking a Software Engineer focused on Data Infrastructure and Acquisition based in Ithaca, NY. This position centers on building and refining the core systems that drive the company's data operations. The work includes designing and enhancing data pipelines, supporting efficient processing, and developing methods for acquiring new data sources. Key Responsibilities Develop and optimize data infrastructure that supports Speechify’s products and services Implement and improve processes for acquiring and handling data Collaborate with engineers and other teams to strengthen the overall technology stack Contribute to how data is managed, processed, and integrated across the company Collaboration This role works alongside a skilled team dedicated to advancing Speechify’s approach to data. Team members share knowledge, solve problems together, and drive ongoing improvements in data systems.
Join Spotify as a Backend Engineer within our innovative Data Infrastructure engineering team! In this exciting role, you will pioneer new methods that empower our teams to create insightful analytics for both internal use and the music industry. Our cutting-edge platform provides essential functionalities, from revealing the number of streams for artists' latest releases to assisting internal teams in managing cloud resource utilization.As a vital member of our platform team, your contributions will be instrumental in exemplifying, measuring, and enhancing the reliability of our data infrastructure across various squads within Spotify. Collaborating closely with fellow engineers, you will deliver OLAP capabilities to facilitate dynamic and dependable data visualizations, while also sharing the responsibility of diagnosing, resolving, and preventing production issues. We believe in empowering engineering teams to take operational ownership of their products and are dedicated to providing the support they need to succeed.
NBCUniversal Media, LLC
Join our innovative team at NBCUniversal as a Staff Software Engineer specializing in AI Infrastructure and Python development. This role involves designing, building, and maintaining scalable AI systems. We seek a passionate engineer who thrives in a collaborative environment and is eager to contribute to cutting-edge projects that impact millions of users.
SecurityScorecard
About SecurityScorecard SecurityScorecard provides cybersecurity ratings and monitors over 12 million companies across 64 countries. Founded in 2013 by Dr. Alex Yampolskiy and Sam Kassoumeh, the company’s patented technology supports more than 25,000 organizations with self-monitoring, third-party risk management, board reporting, and cyber insurance underwriting. Headquartered in New York City, SecurityScorecard has earned recognition from Inc Magazine as a 'Best Workplace' and from Crain’s NY as one of the 'Best Places to Work in NYC.' The company was named among the 10 hottest SaaS startups in New York for two years running. In 2023, SecurityScorecard appeared on Fast Company’s annual list of the World’s Most Innovative Companies and received the Achievers 50 Most Engaged Workplaces award. Investors include Silver Lake Waterman, Moody’s, Sequoia Capital, GV, and Riverwood Capital. Role Overview: Staff Infrastructure Engineer (Hybrid, NYC) SecurityScorecard is hiring a Staff Infrastructure Engineer to oversee and improve the systems that support company operations. This is a senior, hands-on position based in New York City with a hybrid work arrangement. The Staff Infrastructure Engineer will take primary technical ownership of corporate identity, endpoint management, collaboration platforms, and AI workflow tools. The role involves close collaboration with the CISO and coordination with an IT counterpart in Austin. From the first day, this engineer will manage IT operations and is expected to assume full ownership of the technology stack within the first 90 days.
Speechify builds technology to make reading more accessible for everyone. Our text-to-speech tools help over 50 million people turn PDFs, books, Google Docs, news articles, and websites into audio, making it easier to read, learn, and retain information. Our products span iOS, Android, Mac, and a Chrome extension. Google named our Chrome extension Extension of the Year, and Apple recognized us with the 2025 Design Award for Inclusivity. Our team is fully remote and includes nearly 200 people from backgrounds at Amazon, Microsoft, Google, and top universities such as Stanford. We value inclusion and collaboration across all levels. Role Overview The Data team within Speechify’s AI division is hiring a Software Engineer focused on data infrastructure and acquisition. This engineer will work on building and maintaining systems for large-scale data collection, supporting model training efforts, and helping us create high-quality datasets at petabyte scale. Location New York, NY, USA
Peregrine Technologies
At Peregrine Technologies, we are backed by prominent Silicon Valley investors and are committed to aiding public safety organizations, state and local governments, federal agencies, and private-sector institutions in meeting society's challenges with remarkable speed and precision. Our cutting-edge AI-enabled platform converts fragmented and isolated data into actionable operational intelligence, providing immediate access to crucial information that empowers swift and informed decision-making, enhancing outcomes across all interactions. Presently, Peregrine serves hundreds of clients across over 30 states and two countries, benefiting more than 125 million people, and we are on a mission to expand our impact globally.TeamWe believe that empathy is essential in crafting superior solutions. Understanding how users interact with our products is a priority, allowing us to arrive at the best answers effectively. Engineers will closely collaborate with our onsite team to comprehend the diverse use cases that Peregrine addresses.We are actively seeking a Data Governance Engineer to join our dynamic engineering teams, tackling a variety of challenges, from facilitating real-time user collaboration on intricate maps to constructing high-scale backend architectures capable of processing billions of data points.The Data Governance team is responsible for developing services, systems, and product features that enable our customers to manage their data assets throughout their lifecycle within the Peregrine platform. We ensure robust, secure data access and comprehensive audit capabilities.RoleAs a Software Engineer on our expanding team, you will enjoy significant ownership over our technology stack. You will play an integral role in designing and constructing the next iteration of our core access control and governance systems, which manage access to over 7 billion data points within Peregrine's multi-tenant data platform. Your work will involve implementing fine-grained permissions, audit functionality, policy enforcement, metadata management, data labeling, and compliance across the platform.You will collaborate closely with Product Managers to guide projects from conceptualization to design and execution. Your design and development of data governance features will involve making thoughtful trade-offs to achieve an optimal balance between security and user experience.Our technology stack is continually evolving, built on a robust backend foundation comprising Python, Django, Celery, Airflow, and Kafka, with a frontend developed using React, Redux, and Mapbox. We utilize data stores including PostgreSQL and Elasticsearch, and host machine learning models on Bedrock and Sagemaker, while leveraging AWS, Pulumi, Terraform, and Kubernetes as our foundational infrastructure.
The New York Times
The New York Times is on the lookout for a dynamic Principal Software Engineer to spearhead the architecture and advancement of our data and machine learning infrastructure. This pivotal role will lay the groundwork for innovative data-driven products, analytics, and AI applications. You will be responsible for designing robust systems that facilitate large-scale data processing, reliable pipelines, and efficient machine learning development, including feature engineering and real-time model serving. As a principal engineer, you will collaborate closely with product, data science, and platform teams to establish the technical direction, promote the adoption of reusable frameworks, and mentor engineers throughout the organization. Your focus will be on ensuring that both data and ML platforms are scalable, reliable, cost-efficient, and compliant with privacy and governance standards. Our core Data Platform integrates a data lake on AWS S3 with Apache Iceberg for enhanced reliability, while data ingestion leverages Confluent Kafka for real-time streaming and Fivetran for file ingestion. The transformation layer utilizes Apache Flink for stream processing, AWS Glue (Spark) for core ETL, and dbt/Athena for analytical data models. The platform efficiently serves data through specialized data stores, including Amazon DynamoDB for low-latency applications and Google BigQuery as the primary analytics engine. This is a hybrid role based in our New York City headquarters, reporting directly to the Sr. Director of Engineering. Expect to work in the office 2+ days per week.
At Hugging Face, we're on a mission to democratize outstanding AI solutions. Our platform is rapidly becoming the premier destination for AI developers, boasting over 5 million users and 100,000 organizations that have shared more than 1 million models, 300,000 datasets, and 300,000 applications. Our open-source libraries have garnered over 400,000 stars on GitHub.About the RoleAs the first Data Infrastructure Advocate Engineer at Hugging Face, you will play a crucial role in connecting innovative data infrastructure with a vibrant community of data engineers, researchers, and developers. You will advocate for Xet storage on the Hugging Face Hub, enabling users to efficiently store, version, and collaborate on large datasets. This position is ideal for someone who excels at the intersection of technical expertise (storage, Parquet, deduplication) and community engagement—helping shape the future of open data workflows.In this role, you will collaborate with various teams such as Datasets, Hub, and Infrastructure to enhance how developers interact with data on our platform, inspiring a community to create better, faster, and more scalable data pipelines.Your Key Responsibilities:Build and support the open-source data and infrastructure community by launching initiatives, collaborating with data-focused groups, and organizing events or challenges. Engage with communities such as Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools.Position the Hugging Face Hub as the leading platform for data storage, versioning, and collaboration by curating and showcasing datasets, benchmarks, and tools like Xet.Demonstrate the Hub's value for data workflows by highlighting use cases such as efficient large dataset updates, Parquet editing, and deduplication.Develop demos, benchmarks, and tools (e.g., Colab notebooks) to showcase best practices for data storage and versioning. Experiment with Xet, Parquet, and other data formats to reveal their potential in machine learning and data engineering.Create informative tutorials, blog posts, and videos that simplify complex topics.Share valuable insights on storage optimization, dataset versioning, and deduplication to empower developers.Engage actively in online communities (Discord, GitHub, forums) to showcase contributions, answer queries, and encourage collaboration.Ensure comprehensive documentation for datasets and tools released on the Hub, including clear examples, benchmarks, and use cases.About YouYou are an ideal candidate if you:Possess strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).Are a hands-on developer who enjoys experimenting with data tools, storage optimization, and dataset versioning.Can clearly articulate complex concepts to varied audiences.
Join rogo as a Staff Backend Engineer specialized in Financial Data, where you will play a crucial role in developing robust backend systems that handle critical financial data processes. You will work closely with cross-functional teams to create scalable solutions that meet the evolving needs of our clients.
rowspace
Role overview rowspace is looking for an Infrastructure Engineer based in New York City. This position centers on building and securing the core systems that power our AI data platform. The work involves designing infrastructure that processes large volumes of sensitive financial data, with particular attention to security and compliance. Integrating both public and private, tenant-specific customer data in real time and at scale is a key part of this role. What you will do Design and build scalable infrastructure for an AI knowledge engine that works with structured and unstructured financial data. Develop secure architectures for private cloud environments, ensuring alignment with financial services compliance standards. Create data ingestion pipelines for sources such as CapIQ feeds and internal SharePoint documents. Develop monitoring and alerting tools for our Bring Your Own Cloud (BYOC) platform. Set up access controls and audit trails to trace AI interactions back to original data sources. Collaborate with AI Research and Product teams to optimize infrastructure for large language model (LLM) inference, training, and agent development. Implement CI/CD workflows and infrastructure-as-code for reliable deployments across multiple cloud providers.
Sign in to browse more jobs
Create account — see all 4,959 results

