Qualifications
Key Responsibilities:Proactively identify and source new audio data to integrate into our ingestion pipeline. Manage and enhance our cloud infrastructure on GCP, utilizing Terraform for orchestration. Work collaboratively with data scientists to improve data quality, throughput, and cost-efficiency, facilitating the development of next-generation models. Partner with AI Team members and leadership to establish a comprehensive dataset roadmap that fuels innovative consumer and enterprise products. Ideal Candidate Qualifications:BS, MS, or PhD in Computer Science or a related discipline. A minimum of 5 years of professional experience in software development. Expertise in bash/Python scripting within Linux environments. Proficient in Docker, Infrastructure-as-Code practices, and experience with a major Cloud Provider (GCP preferred). Familiarity with web crawlers and large-scale data processing workflows is advantageous. Demonstrated ability to manage multiple tasks and adapt to evolving priorities. Exceptional written and verbal communication skills.
About the job
Speechify builds technology to make reading more accessible for everyone. Our text-to-speech tools help over 50 million people turn PDFs, books, Google Docs, news articles, and websites into audio, making it easier to read, learn, and retain information.
Our products span iOS, Android, Mac, and a Chrome extension. Google named our Chrome extension Extension of the Year, and Apple recognized us with the 2025 Design Award for Inclusivity.
Our team is fully remote and includes nearly 200 people from backgrounds at Amazon, Microsoft, Google, and top universities such as Stanford. We value inclusion and collaboration across all levels.
Role Overview
The Data team within Speechify’s AI division is hiring a Software Engineer focused on data infrastructure and acquisition. This engineer will work on building and maintaining systems for large-scale data collection, supporting model training efforts, and helping us create high-quality datasets at petabyte scale.
Location
New York, NY, USA
About Speechify
Speechify is dedicated to making reading accessible for everyone. Our cutting-edge text-to-speech products empower millions of users to consume content more efficiently. We pride ourselves on our diverse, fully remote workforce that fosters innovation and inclusivity, aiming to continuously enhance our offerings in the digital reading space.