About the job
About Aqua Voice
Aqua Voice is pioneering the voice input landscape for the AI era. By training our own models and creating deep OS integrations, we ensure that voice technology is executed flawlessly across the entire stack.
The evolution of work is upon us; individual contributor roles are transforming as we now manage AI agents, a task that naturally aligns with voice interactions.
We are not focused on traditional conversational agents; instead, we champion a voice-in-text-out (VITO) approach, believing that voice functionality should operate above traditional applications, and that innovative startups can lead in this domain.
With an unwavering commitment to this vision, our progress has been promising, and we invite you to be a part of our journey.
The Role
As a growing team, you will have the opportunity to engage with all facets of our system.
Recent projects include:
- Development of a real-time transcription server capable of managing thousands of concurrent audio streams.
- Training and deploying custom speech recognition models.
- Creating native integrations for macOS and Windows utilizing advanced system APIs.
Technology Stack
Frontend: TypeScript, React, Next.js, Electron
Backend: Python, real-time server (Bun/Node.js), WebSockets
Native: Swift (macOS), C# (Windows)
Machine Learning: Custom speech recognition models, inference pipelines
Infrastructure: Terraform, Stripe
Prior experience with all these technologies is not required.
Qualifications
- Showcase your previous work and projects.
- Adaptable in switching between programming languages and domains.
- Capable of taking ownership of projects from concept to production.
- Proficient in writing clean and maintainable code.
- Experience with production systems is essential.
- A positive attitude and a collaborative spirit are a must!

