companyAir Apps logo

Site Reliability Engineer at Air Apps | London

Air AppsLondon Metropolitain Area
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

QualificationsBachelor's degree in Computer Science, Engineering, or related field. Proven experience in systems administration, software development, or DevOps. Strong knowledge of cloud platforms (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker). Experience with monitoring and observability tools. Proficiency in scripting languages (e.g., Python, Bash). Excellent problem-solving skills and attention to detail. Strong communication skills and ability to work collaboratively in a team environment.

About the job

Air Apps builds technology to help people plan, work, and live better. Founded in Lisbon in 2018 and still family-led, the company has grown to San Francisco and now London, remaining self-funded and reaching over 100 million downloads.

Every day, teams at Air Apps challenge assumptions and develop AI-powered products that make a difference for users worldwide. The company values creativity and aims to improve how resources are managed and lives are impacted.

Role Overview

The Site Reliability Engineer (SRE) will focus on keeping Air Apps systems reliable, available, and scalable. This role connects software development and operations, using automation, monitoring, and performance tuning to reduce downtime and strengthen system resilience.

This is a fully onsite position based in the London Metropolitan Area. Air Apps will consider relocation support for the right candidate. The SRE will work closely with cross-functional teams in a busy office setting.

What You Will Do

  • Design and implement systems that are scalable, reliable, and fault-tolerant across cloud platforms.
  • Develop and maintain observability tools for monitoring, logging, and alerting (such as Prometheus, Grafana, Datadog, ELK).
  • Automate infrastructure provisioning, deployment, and incident response using Infrastructure as Code tools like Terraform or CloudFormation.
  • Improve system performance, scalability, and incident response processes to maximize uptime.
  • Work with development and DevOps teams to strengthen system designs for reliability.
  • Conduct root cause analysis and implement steps to prevent future failures.
  • Design and maintain strategies for load balancing, failover, and disaster recovery to ensure high availability.

About Air Apps

Air Apps is a pioneering technology company focused on creating an AI-powered Personal & Entrepreneurial Resource Planner (PRP). Our innovative solutions aim to revolutionize how individuals manage their resources, enhancing productivity and efficiency across the globe.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.