companyReflection AI logo

Technical Staff Member - Data Platform

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

Key Responsibilities:Design and implement scalable data ingestion systems and orchestration frameworks. Work with technologies such as Spark, Flink, Beam, Airflow, Dagster, Kafka, and PubSub. Ensure data quality and governance through established standards and practices. Collaborate with cross-functional teams to integrate data solutions effectively. Continuously optimize data processing systems for performance and cost-effectiveness.

About the job

Our Mission

At Reflection AI, our mission is to create open superintelligence and ensure its accessibility for everyone.

We are crafting open weight models for individuals, organizations, and even nations. Our talented team comprises AI researchers and entrepreneurs hailing from renowned institutions such as DeepMind, OpenAI, Google Brain, Meta, Character. AI, Anthropic, and more.

Foundations

Vision:

Establish and maintain a comprehensive company-wide foundations platform that empowers every team by delivering dependable, scalable developer infrastructure, Site Reliability Engineering (SRE) capabilities, and high-throughput data ingestion tools, enabling Reflection to accelerate as we grow.

What This Team Does

We are responsible for developing and managing the essential data systems and pipelines that fuel our research, training, and production environments. This platform facilitates rapid experimentation, reliable model development, and scalable production workflows by integrating ingestion, processing, and orchestration throughout the data lifecycle.

  • Design ingestion and orchestration patterns for both batch and streaming data workloads.

  • Construct scalable compute and storage foundations (formats, engines, runtimes) that support extensive data processing.

  • Guarantee reproducible pipelines through versioning, backfills, and isolated execution environments.

  • Deliver trusted data quality, lineage, and governance signals to empower teams in making informed production decisions.

  • Sustain predictable cost and performance through established guardrails, budgets, and ongoing system optimization.

  • Facilitate a unified data layer that supports research, training, and production across the model development lifecycle.

About the Role

You will play a pivotal role in constructing the core data systems and pipelines that drive our research, training, and production environments. Your responsibilities will include designing and implementing reliable, scalable ingestion and orchestration patterns for batch and streaming workloads, developing storage and compute foundations that enable reproducible experimentation and rapid iteration, and establishing data quality and governance standards that teams can rely on for production decisions. You will also provide the foundational data layer that unifies ingestion, processing, and workflow management throughout model development.

About Reflection AI

Reflection AI is at the forefront of artificial intelligence research and development, aiming to democratize access to cutting-edge technologies. Our team brings together some of the brightest minds in the industry, fostering innovation and collaboration to tackle the most pressing challenges in AI.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.