About the job
About Anyscale:
At Anyscale, we are dedicated to revolutionizing distributed computing, making it accessible to developers of all backgrounds. Our flagship product, Ray, is an influential open-source framework that fosters a thriving ecosystem of libraries for scalable machine learning. Industry giants like OpenAI, Uber, Spotify, Instacart, and Cruise leverage Ray to enhance their AI initiatives, propelling them into practical applications.
With Anyscale, we aim to establish the ultimate environment for Ray, empowering developers and data scientists to effortlessly scale machine learning applications from their local machines to large clusters, without requiring deep expertise in distributed systems.
We are proud to be supported by Andreessen Horowitz, NEA, and Addition, having successfully raised over $250 million to date.
About the Role:
Ray aspires to deliver a universal API for crafting distributed applications, such as machine learning pipelines that encompass feature engineering, model training, and evaluation. Data is a fundamental component that interconnects these various stages, significantly influencing Ray's usability, performance, and stability. We are seeking talented engineers to enhance, optimize, and scale Ray’s Datasets library and its overall data processing capabilities.

