About the job
At Intuition Machines, we leverage cutting-edge AI and machine learning technologies to develop innovative enterprise security solutions. Our research impacts systems that cater to hundreds of millions globally, supported by a diverse team spread across various locations. You may recognize our flagship product, the hCaptcha security suite. Our operational philosophy emphasizes minimal overhead, small teams, and rapid prototyping.
As a Senior Machine Learning Data Engineer, you will play a pivotal role in designing and enhancing the data pipelines that fuel our products and research initiatives. Collaborating across teams, you will ensure that our data is not only accessible but also reliable and scalable to cater to both user and internal stakeholder needs.
Key Responsibilities:
- Enhance existing data and machine learning workflows while developing new processes for handling high-velocity data streams.
- Create interfaces and systems that empower machine learning engineers and researchers to generate datasets as required.
- Influence strategies for data storage and processing.
- Collaborate closely with machine learning, frontend, and backend teams to fortify our data platform.
- Accelerate deployment timelines for dashboards and machine learning models.
- Establish best practices and craft pipelines and software to facilitate efficient dataset creation and usage for ML engineers and researchers.
- Manage large datasets within performance constraints akin to those found in leading corporations.
- Adopt an agile approach, prioritizing early and frequent product iterations to enable rapid deployment to millions of users.
Desired Qualifications:
- A minimum of 3 years of experience in a data-centric role focused on designing and building data repositories, feature engineering, and creating dependable data pipelines for high-load scenarios.
- At least 2 years of professional experience in software development in a non-data engineering capacity.
- Strong proficiency in Python, with hands-on experience in Kafka infrastructure and distributed data systems.
- In-depth knowledge of SQL and NoSQL databases, with a preference for Clickhouse.
- Familiarity with cloud platforms such as AWS or Azure.
- Experience with CI/CD processes and orchestration tools: Kubernetes, containerization, and microservice architecture.
- A proven track record of making independent decisions regarding data processing strategy and architecture.
- A self-motivated individual capable of thriving in a fast-paced environment.
Preferred Qualifications:
- Experience in cross-functional collaboration with machine learning, backend, and frontend teams.
- Understanding of machine learning principles, encompassing model training, inference, and frameworks like PyTorch or TensorFlow.
What We Offer:
- A fully remote position with flexible working hours.
- An inspiring team of colleagues from around the globe.
- Modern and pleasant development and deployment workflows that embrace early and frequent shipping.

