Principal Software Engineer, Data Infrastructure
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
About The New York Times
The New York Times Company is driven by its enduring mission to seek the truth and foster a deeper understanding of the world. At the heart of our operations is independent journalism, supported by a world-class newsroom with correspondents in nearly 160 countries. We are committed to enhancing our readers' experiences across various mediums, including print, audio, and digital platforms. Our business strategy revolves around delivering exceptional journalism that our audience finds valuable enough to pay for.
Similar jobs
Search for Senior Data Engineer Data Infrastructure At Angi
6,913 results
At Angi®, our mission for the past 30 years has been simple: to ensure that jobs are done right. We connect homeowners with trustworthy professionals who possess the necessary skills, while simultaneously linking these pros with homeowners seeking the jobs they desire.Angi at a glance:Homeowners have relied on Angi for over 300 million projects.We cover more than 1,000 home service tasks.Our team consists of 2,800 dedicated employees worldwide.Why join Angi:Angi® is on a mission to redefine the home services industry, fostering an environment where homeowners, professionals, and employees all benefit from a greater number of jobs completed successfully.For homeowners, our platform offers a dependable way to locate skilled professionals. For professionals, we act as a trustworthy business partner, helping them discover the work they want when they want it. For our employees, we provide an exceptional workplace that they can proudly call home. We look forward to welcoming you!About the team:We are currently searching for a Senior Data Engineer to join our Data Infrastructure team. This individual will play a pivotal role in constructing and managing the foundational platforms that facilitate data processing, storage, and analytics throughout our organization. The focus of this role will be on advancing our lakehouse architecture, data replication systems, and orchestration frameworks, all while ensuring scalable, reliable, and efficient data workflows.Please note, although this role is remote, we are a global company seeking candidates located in the Eastern Time Zone to align with our team's working hours.
At Angi®, our mission for the past 30 years has been simple: to ensure that jobs are completed effectively. We achieve this by facilitating connections between homeowners and trustworthy professionals who possess the necessary skills, while simultaneously helping professionals discover the opportunities they desire.Angi at a glance:Homeowners have relied on Angi for over 300 million projects.We cover more than 1,000 home service tasks.Our team consists of over 2,800 employees globally.Why join Angi:At Angi®, we are revolutionizing the home services landscape by fostering an environment where homeowners, professionals, and employees can thrive together. For homeowners, our platform is a dependable resource for finding skilled service providers. For professionals, we serve as a trusted business ally, helping them secure the work they want at their convenience. For employees, Angi is an exceptional place to grow and contribute. We look forward to welcoming you to our team.About the team:Our product-engineering team operates at the intersection of partnerships and artificial intelligence, developing integrations and LLM-enabled features that are steering the direction of our business. While we move swiftly, we prioritize accuracy and quality: our engineers are just as passionate about clean architecture as they are about delivering results.As a close-knit, remote-first team that spans multiple time zones, we have cultivated a culture rooted in transparent communication, thorough documentation, and mutual trust. If you thrive on ownership of significant challenges and aspire to evolve alongside your colleagues, this is the ideal team for you.Please note: This is a remote position, but we are a global company seeking applicants located in the Eastern Time Zone to align with our team's working hours.What you’ll do:Build and Ship — Develop, modify, and review full-stack code that adheres to our stringent standards for performance, reliability, security, and testability.Architect Thoughtfully — Contribute to discussions about system design and assist in converting business requirements into scalable technical solutions.Own Quality — Create robust automated tests to ensure that every deployment is efficient, reliable, and seamless.Work at the Frontier — Engage with LLM-powered features and partnership integrations that are influencing our product's future direction.Communicate Clearly — Foster clear communication within the team and with stakeholders to ensure alignment and understanding.
Join Spotify as a Backend Engineer within our innovative Data Infrastructure engineering team! In this exciting role, you will pioneer new methods that empower our teams to create insightful analytics for both internal use and the music industry. Our cutting-edge platform provides essential functionalities, from revealing the number of streams for artists' latest releases to assisting internal teams in managing cloud resource utilization.As a vital member of our platform team, your contributions will be instrumental in exemplifying, measuring, and enhancing the reliability of our data infrastructure across various squads within Spotify. Collaborating closely with fellow engineers, you will deliver OLAP capabilities to facilitate dynamic and dependable data visualizations, while also sharing the responsibility of diagnosing, resolving, and preventing production issues. We believe in empowering engineering teams to take operational ownership of their products and are dedicated to providing the support they need to succeed.
Peregrine Technologies
At Peregrine Technologies, backed by prominent Silicon Valley investors, we empower public safety organizations, local and state governments, federal agencies, and private sector entities to tackle societal challenges with unmatched speed and precision. Our cutting-edge AI platform transforms isolated data into actionable insights, delivering critical operational intelligence that enables swift, informed decisions to enhance outcomes across numerous sectors. Currently, we serve hundreds of clients across over 30 states and two countries, positively impacting over 125 million individuals. As we scale our operations into the enterprise and global markets, we are poised to amplify our influence even further.TeamOur engineering team prioritizes empathy in developing superior solutions. Understanding user interactions with our products is essential for finding optimal answers. Engineers are encouraged to collaborate closely with our onsite team to grasp the diverse use cases that Peregrine addresses.We emphasize both ownership and teamwork—you will be entrusted with significant features while collaborating with fellow engineers to ensure successful completion. We believe that humility and empathy are crucial to crafting the right solutions, and you will engage directly with our deployment team and users as we refine our offerings to meet their needs. Perseverance and creativity will be key to realizing our vision.RoleWe are seeking a Staff Data Infrastructure Engineer to join our dynamic team. In this role, you will have substantial ownership of the data layer that supports all of Peregrine's operations. Your responsibilities will include architecting and building systems that ingest, store, and serve vast amounts of real-time operational data, enabling our clients to make critical decisions swiftly and confidently.This role is ideal for an experienced individual contributor who thrives on complex technical challenges and possesses the expertise and judgment to influence foundational infrastructure decisions. You will face a variety of intricate challenges, including:Designing and managing a high-throughput, real-time data integration platform across diverse client environmentsArchitecting a scalable open table format layer for reliable data storage at petabyte scaleBuilding and optimizing distributed data processing pipelines using Apache Spark and related streaming technologiesEnhancing performance, reliability, and cost efficiency across the entire data infrastructure stackCollaborating with platform and product engineering teams to establish data contracts, schemas, and integration pathways
Speechify is seeking a Software Engineer focused on Data Infrastructure and Acquisition based in Ithaca, NY. This position centers on building and refining the core systems that drive the company's data operations. The work includes designing and enhancing data pipelines, supporting efficient processing, and developing methods for acquiring new data sources. Key Responsibilities Develop and optimize data infrastructure that supports Speechify’s products and services Implement and improve processes for acquiring and handling data Collaborate with engineers and other teams to strengthen the overall technology stack Contribute to how data is managed, processed, and integrated across the company Collaboration This role works alongside a skilled team dedicated to advancing Speechify’s approach to data. Team members share knowledge, solve problems together, and drive ongoing improvements in data systems.
Join CoreWeave as an Engineering Manager, leading our Data Infrastructure team. You will be at the forefront of designing, building, and scaling our data systems that support our innovative cloud solutions. This role is essential for driving efficiency and performance in our data handling, ensuring our infrastructure meets the demands of our growing customer base.
Genius Sports
At Genius Sports, we are pioneering a new era in sports engagement by combining advanced technology with the highest quality live data. Our goal is to create immersive, interactive, and personalized experiences for sports fans around the globe. Learn more about us here. geniussports.com.Position Overview - Senior Software Engineer, Data PlatformWe are seeking a Senior Data Platform Engineer to architect and scale the core systems that underpin our data ecosystem. In this pivotal role, you will lead the design and modernization of our data infrastructure, empowering teams throughout the organization to swiftly, reliably, and securely access and utilize data. You will spearhead significant platform initiatives from inception to production, carefully navigate architectural decisions, and closely collaborate with product teams and cross-functional stakeholders to drive engineering strategy.This role is perfect for a technically adept leader with extensive experience in distributed systems, modern data lakehouse and streaming architectures, and large-scale data operations. A strong passion for mentorship, technical excellence, and creating systems that yield substantial business impact is essential.Key ResponsibilitiesDesign, construct, and oversee the foundational systems and features of the Data Platform, facilitating internal teams' ability to access and utilize data efficiently and securely.Lead the modernization of our data infrastructure while managing technical complexities and making informed architectural decisions to enhance our foundation and boost organizational productivity.Act as a technical leader for extensive Data Platform initiatives, influencing architectural choices and engineering direction in collaboration with product and cross-functional teams.Mentor junior and mid-level engineers, enhancing their skills and preparing them for greater responsibilities.Stay informed on emerging technologies and industry advancements in data systems and platform development.
Speechify aims to remove barriers to learning by transforming text into audio. Over 50 million people use Speechify’s text-to-speech tools to listen to PDFs, books, Google Docs, news, and websites. The product suite covers iOS, Android, Mac, Chrome, and web platforms. Google recognized Speechify as Chrome Extension of the Year, and Apple awarded it the 2025 Design Award for Inclusivity. The company operates fully remotely with a team of nearly 200. Team members include frontend and backend engineers, AI researchers, and professionals from Amazon, Microsoft, Google, Stanford, and founders of successful startups. Role overview Speechify is hiring a Software Engineer for the Data Infrastructure & Acquisition team in the AI department. This role centers on managing and improving data collection processes that support model training. The team builds large-scale, high-quality datasets for AI research and development, focusing on both scale and cost efficiency. Location Rochester, NY, USA (remote team)
Hinge
Join Hinge: The Dating App Designed for Lasting ConnectionsIn an age where meaningful relationships are increasingly elusive, Hinge is dedicated to fostering intimate connections that combat loneliness. Our team is passionate about analyzing user behavior to enhance the dating experience, with our ultimate success measured by the number of successful dates we facilitate. With a global user base in the millions, Hinge has become the premier app for those seeking meaningful relationships.Role Overview: As a Senior Data Engineer at Hinge, you will play a vital role in developing key data processes and contributing to a cutting-edge data pipeline that supports our strategic decision-making. Your contributions will not only solve complex challenges but also empower our analytical teams, directly impacting the love lives of countless individuals. You will enhance core functionalities and implement essential data pipelines that serve both the Data Engineering team and the broader organization, all while tackling real-world problems within a big data architecture.
Join CoreWeave as a Senior Site Reliability Engineer specializing in Data Infrastructure. In this pivotal role, you will ensure the reliability and sustainability of our data systems, working closely with our development teams to optimize performance and availability. You will be instrumental in enhancing our infrastructure to support the growing needs of our clients.
CGS Federal
Join CGS as a Senior Data Engineer Employment Type: Full-Time, Mid-levelDepartment: Business Intelligence CGS Federal is actively looking for a dedicated and innovative Senior Data Engineer to enhance our expanding Data Analytics and Business Intelligence platform. Our focus is on providing impactful solutions that enable our federal clients to transform data into actionable insights. The ideal candidate will be a proactive problem solver and lifelong learner, eager to explore diverse technologies while tackling some of the most challenging issues faced by our clients.At CGS, we unite driven, highly skilled, and inventive individuals to address the government's most pressing challenges through state-of-the-art technology. We seek candidates who are enthusiastic about contributing to governmental innovation, value teamwork, and can anticipate the needs of others. Our work environment fosters support and encourages professional development through various learning opportunities. Key Responsibilities:- Develop comprehensive data pipelines to efficiently manage and provision data to stakeholders.- Collaborate actively within an Agile/Scrum team, adhering to best practices.- Write robust code to ensure optimal performance and reliability in data processing and extraction.- Drive automation processes for seamless data ingestion.- Uphold technical excellence by following lean-agile engineering principles, including API-first design and automated testing.- Engage with program management and technical teams to document evolving requirements.- Foster a culture of exceptional customer service, innovation, collaboration, and teamwork.- Work in a cross-functional team including UX researchers, designers, product managers, engineers, and specialists.Qualifications:- Must be a U.S. Citizen.- Ability to obtain a Public Trust Clearance is required.- At least 7 years of IT experience, particularly in the design and management of substantial data sets and models.- Proven experience in developing data pipelines from a variety of structured and unstructured data sources.- Proficient in ETL processes, including testing and validation steps.- Strong skills in data manipulation using Python, R, SQL, or SAS.- In-depth knowledge of big data analysis and storage solutions.
City of New York
Join the City of New York as a Senior Data Engineer, where your expertise in data engineering will drive impactful decisions and enhance city services. You will collaborate with cross-functional teams to design, build, and maintain robust data pipelines, ensuring data integrity and accessibility for analytics.
January
At January, we are revolutionizing the credit landscape. Our innovative, data-driven platform is designed to restore trust, achieve tangible results, and empower millions to progress toward a brighter financial future, while also humanizing the consumer finance experience. By harnessing data intelligence, we strive to build trust and deliver improved outcomes for both consumers and creditors.Our mission is straightforward: to broaden access to credit while equipping consumers with the tools they need to attain long-lasting stability and control over their financial lives. We initiated our journey by creating a robust foundation for creditors to engage and support borrowers throughout the entire debt lifecycle. By merging exceptional performance with unique consumer satisfaction and strict compliance, we have perfected outsourced collections. Our journey is just beginning, and together we are forging a financial system where trust and opportunity catalyze enduring change in people's lives.About the RoleAs the pioneering Senior Data Engineer at January, you will redefine our approach to data utilization in expanding access to credit—not merely by fixing existing issues but by unlocking new possibilities. You will take complete ownership of our modern data stack, transforming it from a system supported sporadically by analysts and engineers into a premier platform that anticipates and facilitates our most ambitious data initiatives. You will architect the data infrastructure that empowers millions to achieve financial stability, ensuring that insights flow seamlessly from production to decision-makers. By establishing data engineering as a core discipline at January, you will allow our analysts to focus on insights while you construct the scalable foundation that propels our next growth phase.What You'll DoOwn and enhance our comprehensive data platform — transforming our Snowflake warehouse from analyst-managed to engineer-optimized while standardizing data models for customer reporting, operational dashboards, and machine learning features.Create robust, self-healing data pipelines — designing ETL processes that scale automatically with data volume, implementing monitoring systems that preemptively identify issues, and optimizing costs without compromising performance.Facilitate accessible data utilization — developing intuitive models that empower PMs, analysts, and operations teams to independently find answers while adhering to security and compliance standards.Integrate engineering with analytics — establishing feedback mechanisms between production systems and analytical demands, ensuring schema changes do not disrupt downstream dependencies, and influencing how new features generate data.Lead data initiatives — championing projects that enhance our data capabilities, contributing to strategic data-driven decisions, and mentoring junior engineers.
At Hugging Face, we're on a mission to democratize outstanding AI solutions. Our platform is rapidly becoming the premier destination for AI developers, boasting over 5 million users and 100,000 organizations that have shared more than 1 million models, 300,000 datasets, and 300,000 applications. Our open-source libraries have garnered over 400,000 stars on GitHub.About the RoleAs the first Data Infrastructure Advocate Engineer at Hugging Face, you will play a crucial role in connecting innovative data infrastructure with a vibrant community of data engineers, researchers, and developers. You will advocate for Xet storage on the Hugging Face Hub, enabling users to efficiently store, version, and collaborate on large datasets. This position is ideal for someone who excels at the intersection of technical expertise (storage, Parquet, deduplication) and community engagement—helping shape the future of open data workflows.In this role, you will collaborate with various teams such as Datasets, Hub, and Infrastructure to enhance how developers interact with data on our platform, inspiring a community to create better, faster, and more scalable data pipelines.Your Key Responsibilities:Build and support the open-source data and infrastructure community by launching initiatives, collaborating with data-focused groups, and organizing events or challenges. Engage with communities such as Apache Parquet, Open Tables Formats, and data engineering forums to promote best practices and Hugging Face tools.Position the Hugging Face Hub as the leading platform for data storage, versioning, and collaboration by curating and showcasing datasets, benchmarks, and tools like Xet.Demonstrate the Hub's value for data workflows by highlighting use cases such as efficient large dataset updates, Parquet editing, and deduplication.Develop demos, benchmarks, and tools (e.g., Colab notebooks) to showcase best practices for data storage and versioning. Experiment with Xet, Parquet, and other data formats to reveal their potential in machine learning and data engineering.Create informative tutorials, blog posts, and videos that simplify complex topics.Share valuable insights on storage optimization, dataset versioning, and deduplication to empower developers.Engage actively in online communities (Discord, GitHub, forums) to showcase contributions, answer queries, and encourage collaboration.Ensure comprehensive documentation for datasets and tools released on the Hub, including clear examples, benchmarks, and use cases.About YouYou are an ideal candidate if you:Possess strong technical skills in Python, data libraries (e.g., pandas, pyarrow, huggingface/datasets), and storage systems (Parquet, Open Table Formats, S3).Are a hands-on developer who enjoys experimenting with data tools, storage optimization, and dataset versioning.Can clearly articulate complex concepts to varied audiences.
Speechify builds technology to make reading more accessible for everyone. Our text-to-speech tools help over 50 million people turn PDFs, books, Google Docs, news articles, and websites into audio, making it easier to read, learn, and retain information. Our products span iOS, Android, Mac, and a Chrome extension. Google named our Chrome extension Extension of the Year, and Apple recognized us with the 2025 Design Award for Inclusivity. Our team is fully remote and includes nearly 200 people from backgrounds at Amazon, Microsoft, Google, and top universities such as Stanford. We value inclusion and collaboration across all levels. Role Overview The Data team within Speechify’s AI division is hiring a Software Engineer focused on data infrastructure and acquisition. This engineer will work on building and maintaining systems for large-scale data collection, supporting model training efforts, and helping us create high-quality datasets at petabyte scale. Location New York, NY, USA
NBCUniversal Media, LLC
Join NBCUniversal Media, a global leader in media and entertainment, as a Senior Staff Data Engineer. In this pivotal role, you will design, build, and maintain scalable data pipelines and architectures to support our data-driven initiatives. Collaborate with cross-functional teams to leverage big data technologies and drive innovation in data processing and analytics.
Charlie Health
We are seeking a highly skilled Senior Data Engineer to join our dynamic team at Charlie Health. In this pivotal role, you will be responsible for designing, building, and maintaining scalable data pipelines that drive our analytics and reporting capabilities. You will work closely with data scientists and analysts to ensure that our data infrastructure meets the needs of our growing organization.The ideal candidate will have a strong background in data engineering, a passion for working with large datasets, and a desire to make a difference in the healthcare sector.
The New York Times
The New York Times is on the lookout for a dynamic Principal Software Engineer to spearhead the architecture and advancement of our data and machine learning infrastructure. This pivotal role will lay the groundwork for innovative data-driven products, analytics, and AI applications. You will be responsible for designing robust systems that facilitate large-scale data processing, reliable pipelines, and efficient machine learning development, including feature engineering and real-time model serving. As a principal engineer, you will collaborate closely with product, data science, and platform teams to establish the technical direction, promote the adoption of reusable frameworks, and mentor engineers throughout the organization. Your focus will be on ensuring that both data and ML platforms are scalable, reliable, cost-efficient, and compliant with privacy and governance standards. Our core Data Platform integrates a data lake on AWS S3 with Apache Iceberg for enhanced reliability, while data ingestion leverages Confluent Kafka for real-time streaming and Fivetran for file ingestion. The transformation layer utilizes Apache Flink for stream processing, AWS Glue (Spark) for core ETL, and dbt/Athena for analytical data models. The platform efficiently serves data through specialized data stores, including Amazon DynamoDB for low-latency applications and Google BigQuery as the primary analytics engine. This is a hybrid role based in our New York City headquarters, reporting directly to the Sr. Director of Engineering. Expect to work in the office 2+ days per week.
DoubleVerify
Join DoubleVerify as a Senior Data Engineer I, where you'll play a pivotal role in transforming the way our clients access and analyze data within our innovative platforms. In this position, you will leverage your expertise to build scalable data pipelines, ensuring the integrity and accessibility of our data assets.Your responsibilities will include collaborating with cross-functional teams to design, implement, and maintain data solutions that drive business decisions. This role demands strong analytical skills and a passion for problem-solving in a dynamic environment.
Join versant3 as a Senior Data Engineer and play a pivotal role in shaping our data infrastructure. You will design and implement scalable data pipelines, ensuring data integrity and accessibility for our analytics and machine learning initiatives.
Sign in to browse more jobs
Create account — see all 6,913 results

