companyDatabricks logo

Staff Software Engineer - Database Engine Internals

DatabricksMountain View, California
On-site Internship $192K/yr - $260K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

What We Seek: A strong passion for database systems, storage systems, distributed systems, language design, or performance optimization. Proven experience in achieving a multi-year vision with incremental milestones. A drive to deliver significant customer value and impact. 8+ years of experience in a relevant system (preferred). Optional: PhD in databases or distributed systems.

About the job

P-188

At Databricks, we are on a mission to fundamentally transform the data lifecycle, simplifying processes from data ingestion to ETL, Business Intelligence (BI), and extending to Machine Learning (ML) and Artificial Intelligence (AI) through a unified platform. We envision a future where traditional data warehouse architectures are replaced by innovative solutions like the Lakehouse architecture (CIDR 2021 paper), which integrate data warehousing and advanced analytics, effectively addressing critical challenges such as data staleness, reliability, cost of ownership, data lock-in, and limited use-case support.

To turn this vision into reality, we are developing a next-generation decoupled query engine and structured storage system designed to surpass specialized data warehouses in relational query performance while maintaining the versatility of general-purpose systems like Apache Spark™. This system is intended to support a wide array of workloads, from ETL processes to data science applications.

As a member of our team, you will engage in one or more of the following areas, contributing to the design and implementation of cutting-edge systems that redefine industry standards:

  • Query compilation and optimization
  • Distributed query execution and scheduling
  • Vectorized execution engine
  • Data security measures
  • Resource management strategies
  • Transaction coordination processes
  • Efficient storage structures (encodings, indexes)
  • Automatic physical data optimization techniques

About Databricks

Databricks is a leader in simplifying the data lifecycle through innovative and scalable solutions. Our commitment to creating a unified platform empowers organizations to harness the full potential of their data, driving insights and efficiencies across the board.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.