companyDatabricks logo

Senior Software Engineer - Data Ingestion

DatabricksBengaluru, India
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

The Impact You Will Have: Tackle real business challenges at scale using your software engineering expertise. Deliver a robust, scalable, and fault-tolerant engine capable of processing hundreds of TB of data daily across a multitude of clients. Engage in low-level systems debugging, performance measurement, and optimization on extensive production clusters. Contribute to architectural design, influence the product roadmap, and take ownership of new projects. Utilize your extensive experience to help prevent and resolve production issues. Plan and lead complex technical projects involving collaboration with multiple teams within the organization. Quickly decompose intricate problems into actionable solutions, identify knowns and unknowns, and mitigate risks through prototyping and validation. What We Look For: Bachelor's degree or higher in Computer Science or a related field. 6+ years of production-level experience in one of the following: Python, Java, Scala, C++, or a similar programming language. Experience developing and managing data ingestion and integration processes. Strong understanding of database internals, including OLTP systems. Proficiency in designing scalable data architectures. Excellent problem-solving skills and the ability to work collaboratively in a team environment.

About the job

P-1403

At Databricks, we are dedicated to empowering data teams to tackle the most challenging problems in the world—ranging from transforming transportation to accelerating groundbreaking medical advancements. Our mission is realized through the development and operation of the premier data and AI infrastructure platform, enabling our clients to leverage deep data insights for enhanced business performance.

The ingestion of data into the Lakehouse represents a pivotal investment area for Databricks, serving as a vital enabler for Data and AI processes. The Lakeflow Connect initiative aims to address this challenge by offering intuitive, ready-to-use connectors for a diverse array of sources, including enterprise applications (such as Salesforce, Workday, ServiceNow, SharePoint), databases (e.g., SQL Server), cloud storage, message queues, and local files.

In addition to being a crucial component of Lakeflow and Data Engineering, Connect is a fundamental platform capability. Every interface at Databricks (Dashboards, Notebooks, SQL, AI) relies on ingestion functionality, and the leader in this role will collaborate closely with other product teams to integrate Connect into these interfaces.

We are seeking engineers who possess a strong foundation in core database internals to join our Lakeflow Connect team. A significant aspect of Connect involves extracting data from OLTP systems while minimizing the impact on production environments. To achieve this efficiently, we are developing systems that implement techniques such as incremental data capture and log parsing. We are looking for hands-on engineers eager to make a substantial impact on a critical challenge facing the company.

About Databricks

Databricks is a leader in the data and AI space, providing companies with the tools necessary to harness the power of their data. Our innovative platform allows organizations to leverage big data and machine learning to drive insights and achieve their goals. Join us in our mission to revolutionize how businesses interact with data.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.