companyThe New York Times logo

Principal Software Engineer, Data Infrastructure

The New York TimesNew York, NY
Hybrid Full-time $198K/yr - $220K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

The ideal candidate will possess a deep understanding of data architecture, machine learning frameworks, and cloud technologies. A minimum of 8 years of experience in software engineering, with a proven track record of leading complex projects. Proficiency in modern programming languages such as Python, Java, or Scala is essential. Experience with AWS technologies, data lakes, and real-time data processing systems is highly desirable. Strong problem-solving skills, the ability to work in a collaborative environment, and excellent communication skills are a must. A passion for journalism and the mission of The New York Times will be a significant plus.

About the job

The New York Times is on the lookout for a dynamic Principal Software Engineer to spearhead the architecture and advancement of our data and machine learning infrastructure. This pivotal role will lay the groundwork for innovative data-driven products, analytics, and AI applications. You will be responsible for designing robust systems that facilitate large-scale data processing, reliable pipelines, and efficient machine learning development, including feature engineering and real-time model serving. As a principal engineer, you will collaborate closely with product, data science, and platform teams to establish the technical direction, promote the adoption of reusable frameworks, and mentor engineers throughout the organization. Your focus will be on ensuring that both data and ML platforms are scalable, reliable, cost-efficient, and compliant with privacy and governance standards. Our core Data Platform integrates a data lake on AWS S3 with Apache Iceberg for enhanced reliability, while data ingestion leverages Confluent Kafka for real-time streaming and Fivetran for file ingestion. The transformation layer utilizes Apache Flink for stream processing, AWS Glue (Spark) for core ETL, and dbt/Athena for analytical data models. The platform efficiently serves data through specialized data stores, including Amazon DynamoDB for low-latency applications and Google BigQuery as the primary analytics engine. This is a hybrid role based in our New York City headquarters, reporting directly to the Sr. Director of Engineering. Expect to work in the office 2+ days per week.

About The New York Times

The New York Times Company is driven by its enduring mission to seek the truth and foster a deeper understanding of the world. At the heart of our operations is independent journalism, supported by a world-class newsroom with correspondents in nearly 160 countries. We are committed to enhancing our readers' experiences across various mediums, including print, audio, and digital platforms. Our business strategy revolves around delivering exceptional journalism that our audience finds valuable enough to pay for.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.