About the job
Join the Team at Bretton AI
Bretton AI stands at the forefront of artificial intelligence in the financial services sector, providing an essential platform that organizations like Robinhood, Mercury, and Gusto rely on to streamline critical tasks, including anti-money laundering and counter-terrorism efforts.
With over $95 million raised from renowned investors such as Greylock, Y Combinator, and Thomson Reuters Ventures, we are located in the heart of San Francisco. Our talented team hails from prestigious companies including SpaceX, Google, Netflix, Stripe, and Plaid.
Your Role
As a Platform Engineer, you will be instrumental in constructing and maintaining the infrastructure that underpins our compliance solutions at scale. You will take charge of reliability, performance, and scalability across our core platforms.
Your responsibilities will include designing Postgres schemas, optimizing queries, developing high-throughput services, orchestrating long-running workflows with Temporal, and ensuring we maintain a commitment to a remarkable 99.9%+ uptime for our clients who process thousands of cases daily.
In this role, you'll produce backend code for production, design distributed systems, and manage the infrastructure that allows our product teams to deploy rapidly and efficiently.
Key Responsibilities
- Develop APIs and services capable of managing thousands of concurrent requests.
- Create ingestion → assessment → delivery pipelines for our case management.
- Implement Temporal workflows to manage long-running, stateful processes.
- Design and refine PostgreSQL schemas and queries for compliance workflows.
- Oversee CI/CD, deployments, and infrastructure-as-code while enhancing observability through logging, tracing, and metrics.
- Troubleshoot slow queries, memory leaks, and race conditions in our production systems.
What We're Seeking
Essential Qualifications
- 4+ years for Engineer / 6+ years for Sr. Engineer / 8+ years for Staff Engineer in backend or infrastructure roles.
- Strong understanding of systems fundamentals, including distributed systems, databases, APIs, and concurrency.
- Demonstrated production experience with live systems, including debugging, incident response, and participation in on-call rotations.

