companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifyAtlanta, GA, USA
Remote Full-time $140K/yr - $200K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Ideal Candidate QualificationsHolder of a BS, MS, or PhD in Computer Science or a related discipline. Minimum of 5 years of relevant industry experience in software development. Strong proficiency in bash/Python scripting within Linux environments. Solid understanding of Docker and Infrastructure-as-Code practices, with professional experience in at least one major Cloud Provider (GCP preferred). Experience with web crawlers and large-scale data processing workflows is an advantage. Ability to manage multiple priorities and adapt to evolving challenges. Excellent communication skills, both verbal and written.

About the job

Speechify builds tools that turn reading materials into audio, helping over 50 million people access content in new ways. Our products work across iOS, Android, Mac, Chrome Extension, and Web. Google named us Chrome Extension of the Year, and Apple recognized us with the 2025 Design Award for Inclusivity.

Our remote team includes nearly 200 professionals from leading tech companies and top universities. Collaboration happens across frontend and backend engineering, AI research, and more, without a central office.

About the Role

The Data team in Speechify’s AI group is hiring a Software Engineer focused on data infrastructure and acquisition. This role is central to collecting and managing the large-scale datasets that support our model training. The work combines infrastructure, engineering, and research to deliver high-quality data at petabyte scale and low cost.

What You Will Do

  • Find and source new audio data for our ingestion pipeline.
  • Manage and improve our cloud infrastructure for data ingestion, currently on Google Cloud Platform (GCP) and maintained with Terraform.
  • Work with Scientists to optimize cost, throughput, and data quality, supporting larger and richer datasets for advanced models.
  • Partner with the AI team and company leadership to shape the dataset roadmap for future consumer and enterprise products.

Location

This position is based in Atlanta, GA, USA.

About Speechify

Speechify is dedicated to revolutionizing how individuals approach reading and learning. With cutting-edge text-to-speech technology, we remove the barriers that hinder comprehension and retention, making information accessible to everyone. Our global team, comprised of experts from leading technology firms and academia, collaborates remotely to innovate and enhance our offerings continuously.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.