About the job
Join Skydio, the foremost drone company in the United States and a pioneer in autonomous flight technology, which is set to revolutionize the future of drones and aerial mobility. Our team excels in artificial intelligence, cutting-edge hardware and software development, operational excellence, and a relentless focus on customer satisfaction. We aim to empower a diverse range of drone users, including utility inspectors, first responders, and military personnel in challenging environments.
About the Role
We are looking for a Data Mining Engineer who possesses a systems-oriented perspective to streamline access to the vast data generated by our fleet of autonomous drones. In this role, you will significantly enhance the interaction of engineers, researchers, and product stakeholders with logs, video, sensor data, and analytics, allowing for better insights and the development of advanced autonomy features.
This position requires close collaboration with autonomy engineers, machine learning researchers, quality assurance, and platform teams to design robust data pipelines, establish indexing/search functionalities, and create tools that unlock the potential of our data.
Key Responsibilities:
Your contributions will be vital in the following areas:
Data Discovery & Accessibility: Develop systems to consolidate diverse data sources (logs, telemetry, analytics, media, etc.) into formats that are easy to discover and query.
Smart Dataset Generation: Facilitate the effective curation of machine learning datasets through tagging, indexing, and filtering based on relevant scenarios (e.g., environmental conditions, sensor behavior, scene attributes).
Telemetry & Log Intelligence: Create tools to automatically identify anomalies, regressions, or significant patterns in logs and telemetry data (e.g., CPU usage spikes, sensor noise, adverse conditions).
Software Performance Monitoring & Tooling: Develop systems for rapid comparisons between releases and for surfacing regressions in performance metrics, resource utilization, and data integrity.
Your Impact:
Design and maintain scalable data pipelines and services to index, enrich, and query multimodal autonomy data (e.g., time series, media, tabular analytics).
Work alongside autonomy and ML teams to understand data usage patterns and develop tools that enhance their workflows.
Create efficient search, tagging, and filtering methods for both structured and unstructured data.

