companyDatabricks logo

Staff Software Engineer - Database Engine Internals

DatabricksSan Francisco, California
On-site Full-time $192K/yr - $260K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

What We Are Looking For: A strong passion for database systems, storage solutions, distributed systems, language design, or performance optimization Experience in working toward a multi-year vision with incremental deliverables A drive to deliver meaningful customer value and impact 8+ years of relevant experience in a related field (preferred) Optional: PhD in databases or distributed systems

About the job

P-188

At Databricks, we are on a mission to revolutionize the data lifecycle—from ingestion and ETL to business intelligence and advanced machine learning. Our vision is centered around a unified platform that replaces the conventional data warehouse architecture with a cutting-edge Lakehouse model (CIDR 2021 paper). This innovative architecture aims to tackle significant challenges such as data staleness, reliability, total cost of ownership, data lock-in, and limited support for diverse use cases.

A pivotal element of achieving this vision is the development of the next generation of decoupled query engines and structured storage systems that can surpass the performance of specialized data warehouses while retaining the versatility of general-purpose systems like Apache Spark™. This capability is essential for supporting a wide range of workloads, from ETL processes to complex data science applications.

As a key member of this team, you will engage in one or more of the following areas to design and implement systems that set new standards in the industry:

  • Query compilation and optimization
  • Distributed query execution and scheduling
  • Vectorized execution engine
  • Data security
  • Resource management
  • Transaction coordination
  • Efficient storage structures (encodings, indexes)
  • Automatic physical data optimization

About Databricks

Databricks is a leader in data and AI, dedicated to simplifying the data lifecycle with innovative solutions that bridge the gap between data warehousing and advanced analytics. Our aim is to empower organizations to make data-driven decisions with confidence and efficiency.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.