Our MissionAt Reflection AI, our mission is to create open superintelligence and ensure its accessibility for everyone.We are crafting open weight models for individuals, organizations, and even nations. Our talented team comprises AI researchers and entrepreneurs hailing from renowned institutions such as DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic, and more.FoundationsVision:Establish and maintain a comprehensive company-wide foundations platform that empowers every team by delivering dependable, scalable developer infrastructure, Site Reliability Engineering (SRE) capabilities, and high-throughput data ingestion tools, enabling Reflection to accelerate as we grow.What This Team DoesWe are responsible for developing and managing the essential data systems and pipelines that fuel our research, training, and production environments. This platform facilitates rapid experimentation, reliable model development, and scalable production workflows by integrating ingestion, processing, and orchestration throughout the data lifecycle.Design ingestion and orchestration patterns for both batch and streaming data workloads.Construct scalable compute and storage foundations (formats, engines, runtimes) that support extensive data processing.Guarantee reproducible pipelines through versioning, backfills, and isolated execution environments.Deliver trusted data quality, lineage, and governance signals to empower teams in making informed production decisions.Sustain predictable cost and performance through established guardrails, budgets, and ongoing system optimization.Facilitate a unified data layer that supports research, training, and production across the model development lifecycle.About the RoleYou will play a pivotal role in constructing the core data systems and pipelines that drive our research, training, and production environments. Your responsibilities will include designing and implementing reliable, scalable ingestion and orchestration patterns for batch and streaming workloads, developing storage and compute foundations that enable reproducible experimentation and rapid iteration, and establishing data quality and governance standards that teams can rely on for production decisions. You will also provide the foundational data layer that unifies ingestion, processing, and workflow management throughout model development.
Mar 12, 2026