About the job
The mission of Speechify is to break down barriers to learning through innovative technology.
With over 50 million users, Speechify's text-to-speech products transform reading materials—be it PDFs, books, Google Docs, news articles, or websites—into audio, enabling faster comprehension and retention. Our suite includes cutting-edge applications across iOS, Android, Mac, Chrome, and the web. We’re proud to have been recognized by Google as the Chrome Extension of the Year and by Apple as the 2025 Design Award winner for Inclusivity.
Today, our diverse team of nearly 200 professionals operates in a fully distributed environment, without a physical office. Our talented members hail from illustrious backgrounds, including Amazon, Microsoft, and Google, as well as leading academic programs like Stanford and high-growth startups such as Stripe, Vercel, and Bolt.
Role Overview
We are seeking a dedicated Software Engineer to enhance our AI team’s data operations. This role is pivotal in managing data collection to support our model training, allowing us to create high-quality datasets on a petabyte scale efficiently through the synergy of infrastructure, engineering, and research.
Key Responsibilities
- Identify and source new audio data for integration into our ingestion pipeline.
- Manage and extend our cloud infrastructure on GCP, utilizing Terraform.
- Work closely with our scientists to optimize cost, throughput, and quality, delivering enriched data at scale to enhance our next-generation models.
- Collaborate with the AI team and Speechify leadership to develop a dataset roadmap that supports innovative consumer and enterprise products.
Ideal Candidate Qualifications
- BS/MS/PhD in Computer Science or a related field.
- 5+ years of experience in software development.
- Proficiency in bash/Python scripting within Linux environments.
- Expertise with Docker and infrastructure as code, with experience in at least one major cloud provider (GCP preferred).
- Familiarity with web crawlers and large-scale data processing workflows is advantageous.
- Strong multitasking abilities and adaptability to shifting priorities.
- Excellent communication skills, both written and verbal.

