About the job
About Us:
At Fireworks AI, we are at the forefront of generative AI infrastructure innovation. We provide cutting-edge models with unmatched inference speed and scalability, establishing ourselves as leaders in the industry. Our projects include groundbreaking function calling and multimodal models, solidifying our reputation for excellence. As a Series C company valued at $4 billion, we are backed by esteemed investors such as Benchmark, Sequoia, Lightspeed, Index, and Evantic. Our dynamic team, composed of veterans from Meta PyTorch and Google Vertex AI, thrives on collaboration and ambition.
The Role
Join us in developing the fundamental systems that drive Fireworks AI, ranging from customer-centric APIs and product features to the distributed infrastructure facilitating AI workloads on a massive scale.
This position is a comprehensive full-stack backend and infrastructure role. You will design systems, deliver products, and take ownership of the entire process from inception to deployment.
What You’ll Work On
- APIs, web backend, and developer tooling
- Model training, fine-tuning, and inference orchestration
- Job scheduling, autoscaling, and model serving
- Billing, enterprise features, and access control
- Cross-cloud infrastructure (compute, storage, networking)
- Global scale GPU cluster management
What You’ll Do
- Develop and scale backend services and distributed systems
- Ensure system reliability from design through production
- Collaborate directly with customers to address real-world challenges
- Enhance performance, cost-effectiveness, and developer experience
- Rapidly implement AI tools to automate processes
You Might Be a Fit If
- You are eager to engage in the AI revolution
- You enjoy building infrastructure and backend systems that enhance products
- You think critically about systems, trade-offs, and their impacts
- You demonstrate ownership and drive initiatives across teams

