Qualifications
Responsibilities:Identify and source new audio data to enhance our data ingestion pipeline. Manage and optimize our cloud infrastructure for data ingestion, currently utilizing GCP and Terraform. Collaborate with our AI scientists to enhance data cost-effectiveness, throughput, and quality to support advanced model training. Work closely with the AI team and Speechify leadership to define our dataset roadmap for future consumer and enterprise products. Ideal Candidate Qualifications:BS, MS, or PhD in Computer Science or a related field. A minimum of 5 years of professional software development experience. Strong proficiency in bash and Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code, with professional familiarity with at least one major cloud provider (GCP preferred). Exposure to web crawlers and large-scale data processing workflows is advantageous. Ability to manage multiple tasks and adapt to evolving priorities. Excellent written and verbal communication skills are essential.
About the job
Speechify builds tools that turn written content into audio, helping millions of people read and learn in new ways. With over 50 million users, our text-to-speech products support formats like PDFs, books, Google Docs, news articles, and websites. Our lineup spans iOS, Android, Mac, a Chrome Extension, and a web app. Google named us Chrome Extension of the Year, and Apple recognized our work with the 2025 Design Award for Inclusivity.
Our team is fully remote and includes nearly 200 professionals around the world. Engineers, AI researchers, and specialists from companies like Amazon, Microsoft, and Google work alongside alumni from top universities, including Stanford.
Role overview
Speechify is hiring a Software Engineer for the AI team, with a focus on data infrastructure and acquisition. This role centers on building and maintaining systems that collect and process large-scale datasets for model training. The work involves integrating data pipelines and infrastructure to keep data quality high while controlling costs.
About Speechify
Speechify is dedicated to ensuring that reading is never a barrier to learning. With its award-winning text-to-speech technology, the company empowers millions to improve their reading speed and comprehension. Speechify's innovative approach has earned it recognition from industry leaders, and its fully remote team comprises top talent from various sectors.