About the job
Archer Aviation is a pioneering aerospace company headquartered in San Jose, California, dedicated to developing innovative all-electric vertical takeoff and landing aircraft. Our mission is to enhance sustainable air mobility, providing a quiet and eco-friendly air travel solution for four passengers.
We embrace challenges with high aspirations and strongly believe that a diverse workplace enriches our insights and drives success. Committed to fostering an equitable and inclusive environment, we celebrate the unique contributions of all our team members.
Senior DevOps Engineer
The Role
As a Senior DevOps Engineer, you will play a crucial role in shaping our infrastructure strategy, prioritizing automation, reliability, and performance within both cloud and on-premise settings. Your expertise will guide best practices in CI/CD, configuration management, and monitoring, with a targeted emphasis on enhancing the deployment and operation of large language models (LLMs) and associated technologies.
Responsibilities
- Architect, deploy, and oversee scalable infrastructure utilizing Kubernetes and Docker across public cloud platforms (such as AWS, GCP, Azure) and on-premise data centers.
- Craft and maintain robust Configuration Management solutions (e.g., Ansible, Terraform) to ensure consistent environment provisioning and oversight.
- Establish and manage CI/CD pipelines to support quick, dependable, and automated software releases.
- Administer and troubleshoot operating systems, including both Linux and Windows environments.
- Enhance observability practices with monitoring tools like Datadog for effective logging, tracing, and alerting.
- Lead the operational deployment, scaling, and maintenance of LLM infrastructure, utilizing tools such as LiteLLM, OpenRouter, or similar LLM orchestration technologies.
- Automate routine tasks and system operations through scripting languages, predominantly Bash and Python.
- Collaborate closely with development, MLOps, and security teams to ensure infrastructure aligns with product requirements and compliance standards.
- Participate in an on-call rotation to maintain service reliability and responsiveness.

