companySpeechify logo

Software Engineer - Data Infrastructure & Acquisition

SpeechifySydney, Australia
Remote Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Preferred Qualifications:Bachelor's, Master's, or PhD in Computer Science or a related field. At least 5 years of professional experience in software development. Strong proficiency in bash/Python scripting within Linux environments. Experience with Docker and Infrastructure-as-Code principles, with professional exposure to at least one leading Cloud Provider (we use GCP). Familiarity with web crawlers and large-scale data processing workflows is a plus. Proven ability to multitask and adapt to evolving priorities. Excellent written and verbal communication skills.

About the job

Speechify helps over 50 million people turn reading materials, PDFs, books, Google Docs, news articles, and websites, into audio. Our text-to-speech apps span iOS, Android, Mac, Chrome, and web. Google named us Chrome Extension of the Year, and Apple awarded us the 2025 Design Award for Inclusivity.

Our fully remote team of about 200 includes engineers, AI researchers, and alumni from Amazon, Microsoft, Google, Stripe, Vercel, Bolt, and top academic programs. We work across time zones to build tools that make reading more accessible for everyone.

Role overview

Speechify’s AI division is looking for a Software Engineer focused on Data Infrastructure & Acquisition. This role centers on collecting and managing large-scale audio data to support model training. The team builds and maintains systems that create high-quality datasets at petabyte scale, balancing efficiency and cost.

What you will do

  • Find and source new audio data, then integrate it into our ingestion pipeline.
  • Maintain and improve our cloud infrastructure for data ingestion, currently running on Google Cloud Platform and managed with Terraform.
  • Partner with scientists to optimize for cost, throughput, and data quality, delivering large-scale datasets efficiently for new model development.
  • Work with the AI team and company leadership to plan the dataset roadmap for future consumer and enterprise products.

Location

This position is based in Sydney, Australia.

About Speechify

Speechify is committed to breaking down barriers to learning through innovative text-to-speech technology. With a global team working remotely, we have built a diverse and talented workforce dedicated to creating inclusive and effective solutions for our users.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.