companyDatabricks logo

Senior Software Engineer - Distributed Data Systems

DatabricksSan Francisco, California
On-site Full-time $166K/yr - $225K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Desired qualifications include:A Bachelor's degree (or higher) in Computer Science, a related technical field, or equivalent practical experience. Proficiency and comfort in working towards a multi-dimensional technical landscape. Experience with distributed systems and data processing. Strong problem-solving skills and the ability to work collaboratively in a fast-paced environment.

About the job

At Databricks, we are driven by a passion for empowering data teams to tackle the world’s most challenging problems — from transforming transportation to accelerating medical innovations. We achieve this by creating and maintaining the leading data and AI infrastructure platform, enabling our clients to leverage profound data insights for business enhancement. Founded by engineers with a customer-first mentality, we eagerly embrace every opportunity to tackle complex technical challenges, ranging from the design of next-generation UI/UX for data interactions to scaling our services across millions of virtual machines. Our journey has just begun.

As a member of the Runtime team at Databricks, you will be instrumental in developing the next generation of distributed data storage and processing systems. These systems will surpass specialized SQL query engines in relational query performance while offering the programming abstractions necessary to support a variety of workloads, from ETL to data science.

Example projects include:

  • Apache Spark™: Contribute to the de facto open-source standard framework for big data.
  • Data Plane Storage: Develop reliable and high-performance services and client libraries for managing vast amounts of data within cloud storage backends like AWS S3 and Azure Blob Store.
  • Delta Lake: Design a storage management system that merges the scalability and cost-effectiveness of data lakes with the performance and reliability of data warehouses, providing features like ACID transactions and time travel.
  • Delta Pipelines: Simplify the orchestration and operation of numerous data pipelines, enabling clients to deploy, test, and upgrade pipelines effortlessly.
  • Performance Engineering: Create the next-generation query optimizer and execution engine that is fast, scalable, and robust.

About Databricks

Databricks is at the forefront of data and AI innovation, committed to enabling organizations to harness data effectively. With a team of dedicated engineers and a strong customer focus, we are continuously evolving our platform to meet the demands of modern data analysis and machine learning, empowering users to unlock the full potential of their data.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.