About the job
Speechify builds tools that turn reading materials into audio, helping over 50 million people access content in new ways. Our products work across iOS, Android, Mac, Chrome Extension, and Web. Google named us Chrome Extension of the Year, and Apple recognized us with the 2025 Design Award for Inclusivity.
Our remote team includes nearly 200 professionals from leading tech companies and top universities. Collaboration happens across frontend and backend engineering, AI research, and more, without a central office.
About the Role
The Data team in Speechify’s AI group is hiring a Software Engineer focused on data infrastructure and acquisition. This role is central to collecting and managing the large-scale datasets that support our model training. The work combines infrastructure, engineering, and research to deliver high-quality data at petabyte scale and low cost.
What You Will Do
- Find and source new audio data for our ingestion pipeline.
- Manage and improve our cloud infrastructure for data ingestion, currently on Google Cloud Platform (GCP) and maintained with Terraform.
- Work with Scientists to optimize cost, throughput, and data quality, supporting larger and richer datasets for advanced models.
- Partner with the AI team and company leadership to shape the dataset roadmap for future consumer and enterprise products.
Location
This position is based in Atlanta, GA, USA.

