companyxAI logo

Software Engineer - Observability

xAIPalo Alto, CA
On-site Full-time $180K/yr - $440K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Bachelor's degree in Computer Science or a related field; 3+ years of experience in software development; strong proficiency in programming languages such as Go, Python, or Java; experience with observability tools and frameworks; solid understanding of metrics, logging, and tracing concepts; excellent problem-solving skills; ability to work collaboratively in a fast-paced environment.

About the job

About xAI

At xAI, we're on a mission to develop AI systems that not only understand the universe but also empower humanity in its quest for knowledge. Our team is compact, driven, and dedicated to engineering excellence. We welcome individuals who thrive on intellectual challenges and have a passion for curiosity. Operating within a flat organizational structure, we encourage all team members to be hands-on in contributing to our mission. Leadership opportunities are available for those who demonstrate initiative and consistently deliver outstanding results. Strong work ethic and prioritization skills are paramount. Effective communication is a must, as team members should be adept at sharing insights and knowledge with their peers.

 

About the Team

The Observability team is responsible for constructing and managing the essential infrastructure that allows engineers to monitor, troubleshoot, and enhance the performance and reliability of their systems. We process telemetry at an enormous scale, managing billions of time series and petabytes of logs, all while adhering to rigorous performance and availability standards.

About the Role

As part of a dynamic and impactful team, you will play a vital role in developing and maintaining xAI’s observability platform. You will take ownership of critical systems that facilitate metrics, logs, tracing, and alerting, enabling engineering teams to operate services at scale, preemptively identify issues before they affect users, and drive systemic improvements in reliability.

What You’ll Do

  • Design and implement scalable observability infrastructure for metrics, logging, and tracing.
  • Build high-performance telemetry pipelines capable of managing extensive ingestion volumes.
  • Develop APIs, query engines, and user interfaces that deliver real-time insights into services.
  • Establish and reinforce best practices for instrumentation, alerting, and reliability throughout the organization.
  • Collaborate with infrastructure and product teams to seamlessly integrate observability into our internal platforms.
  • Maintain end-to-end ownership of the reliability, scalability, and performance of the observability stack.

About xAI

xAI is at the forefront of innovation in artificial intelligence, dedicated to crafting systems that enhance human understanding of the universe. Our small yet passionate team thrives in an environment that encourages curiosity and hands-on contributions. Join us in our mission to push the boundaries of knowledge and technology.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.