About the job
Ramp builds intelligent infrastructure for finance teams, helping companies manage over $100 billion in annual spending. The platform automates payment authorization, risk reduction, spend categorization, and bookkeeping for more than 50,000 organizations. Ramp’s approach centers on solving complex, data-driven challenges with a focus on ownership, urgency, and measurable impact.
Customers typically see a 5% reduction in costs and a 16% increase in revenue within their first year on Ramp. Every team member is empowered to make decisions that shape the company’s outcomes, with an emphasis on action over credentials.
Role overview
This Software Engineer position sits on the Engineering Platform team in New York, NY (HQ). Rather than building customer-facing features, this team creates and maintains the core infrastructure that supports over 300 engineers at Ramp. Work includes developing and refining CI/CD pipelines, merge queues, deployment systems, service templates, and on-call tools, as well as the AI-driven layer that connects these systems.
When internal engineering processes slow down or fail, the effects are felt company-wide. This role carries responsibility for the entire incident lifecycle, from diagnosis and remediation to postmortem analysis and prevention. The work is foundational and high-impact, supporting a fast-moving engineering organization.
What you will do
- Design and enhance systems that increase engineering productivity, such as merge queues, deployment strategies, CI/CD frameworks, Smart Reviewer, and stack tools. Respond rapidly to slowdowns or issues.
- Lead incident response from start to finish: identify root causes (for example, database serialization failures or Redis latency spikes), implement solutions, document postmortems, and develop strategies to prevent future incidents.
- Establish standardized service pathways, including API design, database migrations, Kafka, Temporal, and secret management, to help teams build infrastructure efficiently and avoid redundant work.
- Oversee the full suite of developer tools (including Devportal, Graphite, Buildkite, Inspect, deployment processes, LaunchDarkly, and Datadog), with a strong understanding of their interactions, especially under pressure.
- Treat the Software Development Life Cycle (SDLC) as an operational system: monitor metrics like pull request throughput, CI/CD cycle times, and defect rates, and proactively address bottlenecks.

