About the job
Founding Platform & Reliability Engineer
About OpenArt
OpenArt is a revolutionary AI-driven storytelling and visual creation platform utilized by millions around the globe. Our mission is to build the next generation of creative tools powered by advanced AI technology, allowing users to generate videos, visuals, characters, and narratives with speed and creativity never seen before. We envision a future where creativity is inherently AI-native, and we are at the forefront of this transformation.
Why Join OpenArt?
Be part of a small, dynamic team where senior engineers are responsible for significant systems, not just fragments.
Contribute to large-scale projects, with your work impacting millions of users swiftly.
Benefit from a founder-led engineering culture where both founders are technical and actively engaged in product and architectural decisions.
Work on an AI-native product, crafting how state-of-the-art AI models translate into tangible user experiences.
Experience high ownership with minimal bureaucracy, emphasizing judgment, clarity, and speed.
Join us during a period of significant growth, with a 7-10X revenue increase over the past two years, and play a pivotal role in scaling the company to new heights.
About the Role
We are seeking a Founding Platform & Reliability Engineer to take charge of the design, scalability, and reliability of our entire infrastructure stack, from high-level architectural choices to hands-on implementation, observability, and cost management.
This role is not suited for traditional operators or narrow DevOps specialists. You should be adept at navigating cloud infrastructure, distributed systems, backend services, and developer tools, making practical decisions that optimize product velocity, system reliability, and cost efficiency, particularly in a fast-paced AI-centric landscape.
You will collaborate closely with the founders and product engineers to design and refine the platform that powers OpenArt, influencing key decisions like serverless versus containerized architecture, multi-provider AI reliability, and scaling systems for millions of users, while serving as a force multiplier for the entire engineering team.
What You’ll Do
Establish and operationalize SLOs/SLIs across essential user journeys (generation, editing, payments/credits, uploads, etc.), utilizing them to guide prioritization (including error budgets).
Lead the design and implementation of robust infrastructure solutions that effectively support OpenArt's rapid growth and evolving needs.

