Software Engineer - Infrastructure Team at Anyscale | San Francisco, CA
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Experience
Qualifications
About Anyscale
Anyscale is at the forefront of democratizing distributed computing, offering innovative solutions that empower developers and data scientists. Our commitment to simplifying the complexity of machine learning applications sets us apart in the tech industry.
Similar jobs
Search for Infrastructure Software Engineer At Middesk San Francisco
11,588 results
About MiddeskMiddesk simplifies collaboration for businesses by transforming identity verification processes. Since our inception in 2018, we have replaced outdated, manual methods with an efficient platform that provides seamless access to comprehensive and current data. Our services enable companies across various sectors to confidently verify business identities, accelerate customer onboarding, and mitigate risks throughout the customer lifecycle.As a proud Y Combinator graduate, Middesk is supported by notable investors such as Sequoia Capital and Accel Partners. We have recently been featured on Forbes’ Fintech 50 List and recognized as a leader in business verification by the digital identity strategy firm, Liminal.About the Middesk Engineering Team:At Middesk, we prioritize delivering value to our customers through a concept we call 'Velocity'. This term embodies our commitment to achieving meaningful outcomes rather than merely focusing on code delivery speed. We believe that exceptional products arise from a blend of technical expertise and a profound understanding of our customers’ needs. Our engineering team is composed of humble, self-driven individuals who are dedicated to addressing even the most complex challenges faced by our clients. At Middesk Engineering, our mission is to put customers first.Your Role:We are seeking a talented Infrastructure Engineer to join our DevSecOps team. Your mission will be to empower engineering teams by providing secure, cost-effective, and scalable platform capabilities that enhance software delivery, improve developer experience, and ensure compliance with industry standards. You will be responsible for developing the tools and infrastructure necessary to scale our development and production systems. Your contributions will directly impact the entire Software Development Lifecycle and overall developer experience (DevEx). The systems you will support include Kubernetes, cloud infrastructure, observability, and local development environments.Our work environment is hybrid, requiring a presence in our San Francisco or New York City offices for 2 days each week. Candidates must reside within a reasonable commuting distance as we value in-person collaboration while also supporting flexible work arrangements.Key Responsibilities:Architect, build, and scale cloud infrastructure and orchestration systems (e.g., Kubernetes, Terraform, CI/CD).Take ownership of and enhance developer experience (DevEx) tools and workflows, spanning from local development to deployment.Develop observability systems that offer insights into performance, reliability, and usage metrics.
About MiddeskMiddesk is revolutionizing the way businesses collaborate by providing seamless business identity verification. Since our inception in 2018, we have replaced cumbersome manual processes with instant access to accurate and current data. Our platform empowers companies across various sectors to confidently verify business identities, accelerate customer onboarding, and mitigate risks throughout the customer journey.As a proud graduate of Y Combinator and backed by esteemed investors such as Sequoia Capital and Accel Partners, Middesk has been recognized in the Forbes Fintech 50 List and acknowledged as a leading authority in business verification by digital identity strategy firm, Liminal.About Middesk Engineering:At Middesk Engineering, we prioritize
Join middesk as a Software Engineer specializing in our Data Platform, where you will play a pivotal role in building robust systems that empower our data-driven initiatives. You will collaborate with cross-functional teams to design, implement, and optimize data solutions that enhance our products and services.
About MiddeskAt Middesk, we simplify the way businesses collaborate by streamlining business identity verification. Since our inception in 2018, we have been dedicated to transforming cumbersome, manual processes into efficient access to comprehensive, up-to-date data. Our innovative platform empowers companies across various sectors to confidently verify business identities, expedite customer onboarding, and minimize risk throughout the customer lifecycle.Originating from Y Combinator and backed by prestigious investors like Sequoia Capital and Accel Partners, Middesk has garnered recognition as an industry leader in business verification, being named to the Forbes Fintech 50 List and acknowledged by Liminal, a leading digital identity strategy firm.The Role:As a Senior Product Designer at Middesk, you will be instrumental in aiding businesses to maintain robust compliance standards and navigate risks with assurance. Your role involves crafting intuitive user experiences that equip customers to make informed compliance and risk decisions in highly regulated environments. Collaborating closely with cross-functional Product and Engineering teams, you will oversee the entire design process from initial discovery to final delivery, showcasing your expertise in design craftsmanship, systems thinking, and an inquisitive approach at every stage.We embrace a hybrid work model, anticipating 2 days per week in our San Francisco office. Ideal candidates should reside within a commutable distance, as we value the benefits of in-person collaboration and fostering strong team dynamics while accommodating flexibility whenever feasible.
About NumeralNumeral is revolutionizing the automation framework for online commerce, beginning with the often tedious task of sales tax compliance. We take care of everything from registration to remittance, providing an exceptional service that allows e-commerce businesses to concentrate on their core mission: expanding their products, customer base, and teams.As one of the rapidly growing companies from Y Combinator’s Winter 2023 cohort, we are backed by prestigious investors such as Benchmark Capital. Our team boasts extensive experience from the pioneering days at Stripe, Airbnb, Notion, and other leading firms, and we are poised to bring that same level of expertise, speed, and ambition to an industry ripe for transformation.Numeral may be small, but our impact is significant. Our growth is already approaching unmanageable levels, meaning each new hire will play a crucial role in shaping our company’s future. If you’re eager to join as an early team member and desire the kind of ownership that can define your career, we would love to connect with you.MissionOperating an online business today often requires juggling numerous responsibilities, many of which are not why founders launched their ventures. Our mission is to alleviate the administrative and accounting pressures that divert businesses from their passions.We have already assisted hundreds of merchants in sidestepping the headache of establishing large finance teams solely to manage tax compliance. Looking ahead, we aim to broaden our positive impact by developing the automation layer that enables online businesses to remain agile, compliant, and prepared for the future.About the RoleWe are seeking a foundational Software Engineer (Infrastructure) who excels at tackling complex distributed systems challenges at scale. In this role, you will design and implement core infrastructure, enhance service reliability and observability, and guarantee the platform’s scalability as we accommodate growing transaction volumes and integrations.This is a pivotal role with high leverage: your contributions will shape the architecture and technical direction of our infrastructure platform, directly influencing our customers’ experience and the trajectory of the company.ResponsibilitiesDesign and develop highly scalable, secure, and reliable infrastructure to support critical APIs, services, and data pipelines.Lead infrastructure architecture decisions focusing on performance, observability, and fault tolerance.
At Greptile, we are on a mission to develop intelligent agents that autonomously verify code modifications. Our current focus involves utilizing AI to analyze pull requests on GitHub, effectively identifying bugs and enforcing coding standards. With our technology, we review nearly 1 billion lines of code each month for over 3,000 companies.Challenges We Are Excited To TackleDeveloping agents that can learn coding standards through experience, similar to how new hires adapt.Determining customer-specific preferences for pull request feedback using sample-efficient reinforcement learning to enhance signal-to-noise ratios.Implementing automated deployments of feature branches and leveraging agents to stress-test the application for bug detection.Our Growth TrajectoryServing over 7,000 customers.Successfully raised $30 million from prominent investors including Benchmark, Y Combinator, Paul Graham, and Initialized.Our TeamWe have curated a highly skilled team that has successfully scaled vital functions at leading companies such as Stripe, Google, Figma, and others.Key ResponsibilitiesDesign and implement resilient infrastructure to accommodate Greptile's expanding user base.Collaborate with our largest enterprise clients to facilitate the deployment of Greptile within their environments.Streamline the on-premise deployment process to support smaller clients with minimal hands-on intervention.
About LightfieldLightfield is an innovative AI-driven CRM that seamlessly integrates your email, calendar, and meetings into an organized platform. It captures every interaction and transforms it into actionable insights, tasks, accounts, and follow-ups, ensuring that nothing is overlooked.We are revolutionizing the CRM landscape from the ground up. Rather than imposing rigid structures on teams, Lightfield adapts to how businesses naturally operate, automating processes and highlighting insights that foster growth. We are crafting the CRM platform of our dreams: fast, intelligent, and genuinely beneficial.Supported by esteemed investors like Greylock, Lightspeed, and Coatue, our team has a rich history of building impactful products, including Tome, a generative AI presentation tool embraced by over 25 million users. Our collective experience spans companies such as Llama, Instagram, Facebook Messenger, Pinterest, Google, and Salesforce.About the RoleWe are in search of a skilled, innovative, and adaptable engineer excited to take on the challenge of designing, developing, and scaling the foundational infrastructure and systems that power Lightfield's AI-enhanced CRM.At Lightfield, engineers take full ownership of projects from ideation to execution, collaborating across various functions and pushing the boundaries of what is achievable in applied AI and data systems. You will play a pivotal role in crafting and constructing the data models and infrastructure that support a next-generation CRM, ensuring optimal performance, scalability, and resilience.What You'll DoDevelop and sustain full-stack systems, ensuring reliability, scalability, and robust performance.Architect and enhance the data infrastructure and backend services that drive Lightfield's AI-powered CRM.Design and refine data models to facilitate CRM workflows, customer interactions, and AI-derived insights.Create observability and metrics frameworks to promote seamless operations, proactive issue detection, and ongoing enhancements.Guide colleagues, engage in code reviews, and contribute to shaping the engineering culture and best practices.Play a key role in cultivating a top-tier engineering team through recruitment, mentorship, and knowledge exchange.Who You ArePossess 3+ years of software development experience, with a solid foundation in infrastructure, data modeling, and backend systems.Demonstrate extensive experience in designing scalable, high-performance systems.
About Flow EngineeringAt Flow Engineering, we are pioneering an AI-native requirements platform that empowers cutting-edge engineering teams to collaborate seamlessly with AI agents. Our mission is to facilitate the design, validation, and evolution of complex systems with unmatched speed and precision. Following our successful Series A funding, we are on an exciting trajectory to scale our product from thousands to hundreds of thousands of users, all while upholding the highest standards of reliability and performance.About the RoleWe are seeking a passionate Infrastructure Software Engineer to join our dynamic team. In this role, you will be instrumental in constructing and expanding the core platform that underpins Flow. You will manage services and infrastructure that empower "agentic systems engineers" and product teams to leverage Flow in their daily tasks.You will become a key member of a small, senior team that prioritizes speed, ownership, and solid engineering principles—delivering version 1 products swiftly, learning, and iterating effectively.Your ResponsibilitiesDesign, develop, and maintain backend services and platform primitives that facilitate complex engineering workflows and large-scale collaboration.Enhance Flow’s capacity from thousands to hundreds of thousands of users, focusing on performance, reliability, observability, and security across the entire stack.Take ownership of CI/CD pipelines, testing infrastructure, and internal tools to enable rapid and safe product releases.Collaborate with frontend and AI engineers to establish robust APIs, data models, and integration points that are easy to adapt and evolve.Contribute to architectural decisions and the technical roadmap as our product and customer base expands.Your ProfileA minimum of 3 years of software engineering experience in building and maintaining production systems within a cloud environment (e.g., AWS or GCP).Deep understanding of systems design, distributed systems, and best practices for reliability, observability, and security.Proficiency with containerization and infrastructure-as-code tools (e.g., Docker, Terraform, etc.).Ability to take ownership of projects end-to-end in a fast-paced environment and make pragmatic decisions amidst ambiguity.A collaborative mindset with low ego, eager to work closely with product, design, and customer-facing teams.Our Technology StackUtilization of TypeScript, Node.js, and React for application development, with a strong emphasis on type safety.Employing Postgres and various managed cloud services for data persistence and messaging.
About Our TeamJoin our dynamic Infrastructure organization at OpenAI, where we are actively seeking talented software engineers to bolster our efforts across several high-impact teams. With a variety of focus areas available—including Core Distributed Systems, Databases, Observability, and Cloud Infrastructure—you'll have the opportunity to work on projects that fascinate you. Our teams operate with a high level of autonomy and foster a deeply collaborative environment, all dedicated to enhancing safety, reliability, and operational velocity across the organization.About the RoleAs a Software Engineer focused on Infrastructure Reliability, you will play a pivotal role in scaling and fortifying the infrastructure that supports some of the world’s most widely utilized AI systems. Your work will ensure that our systems maintain high reliability, observability, performance, and security—enabling researchers to iterate rapidly and allowing products like ChatGPT and the OpenAI API to effectively serve millions of users.This hands-on, impactful role is perfect for engineers who enjoy ownership, excel at solving complex technical challenges across the entire stack, and wish to contribute to systems that facilitate cutting-edge research deployed on a global scale. You will significantly influence technical direction, enhance system resilience, and collaborate closely with infrastructure, product, and research teams to transform intricate infrastructure into dependable platforms.Key ResponsibilitiesDesign, construct, and maintain reliable, high-performance systems utilized across engineering.Identify and resolve performance bottlenecks and inefficiencies, ensuring our infrastructure scales appropriately.Investigate and troubleshoot complex issues thoroughly.Enhance automation to minimize manual tasks and improve internal developer tools.Participate in incident response, postmortem analysis, and the development of best practices surrounding system reliability and scalability.Ideal Candidate ProfilePossess a deep understanding of distributed systems principles, with a proven track record in developing and managing scalable, reliable systems.Demonstrate a strong focus on performance and optimization, with the ability to maximize efficiency in complex, globally distributed systems.Have experience managing orchestration systems such as Kubernetes at scale and creating abstractions over cloud platforms.Be comfortable working within Linux environments and possess strong problem-solving skills.
About VibecodeAt Vibecode, we are revolutionizing the way software is created. Our innovative platform empowers anyone to articulate an idea and instantly transform it into a fully functional application—no coding skills required.We are tackling one of the most significant challenges in computing: aligning human intent with software execution. This endeavor necessitates groundbreaking advancements in AI reasoning, code generation, and user experience design.Our impressive seed funding comes from some of the top investors globally, including Alexis Ohanian (776), Arielle Zuckerberg, Cyan Banister (Long Journey), Ali Partovi, Suzanne Xie (Neo), and numerous esteemed angels from Google, Expo, OpenAI, and beyond.About the RoleAre you eager to be at the cutting edge of infrastructure design for a consumer product that will reach millions? If so, this opportunity is perfect for you.We seek an Infrastructure Engineer to develop the foundational systems that support millions of AI-generated applications. You will design a platform capable of securely hosting thousands of user-created applications concurrently while ensuring optimal performance and unwavering reliability.Your Responsibilities:Develop and implement secure sandbox environments for executing untrusted AI-generated code at scale.Create orchestration systems for stateless containers capable of launching over 10,000 applications simultaneously.Architect backend API services for real-time code generation, compilation, and deployment.Establish monitoring and observability systems for complex, multi-tenant application infrastructures.Design auto-scaling solutions to manage unpredictable traffic patterns from viral consumer applications.Build security-focused infrastructure that isolates user applications while preserving performance.This is not conventional infrastructure work. You will face unique challenges related to large-scale code execution, develop systems that are yet to be created, and establish infrastructure paradigms suited for the AI-native era.
OVERVIEWThis position is based out of our San Francisco office.At Modern Treasury, we are revolutionizing the way money moves. Our team is developing innovative products that empower customers to transfer funds seamlessly across traditional banking systems and emerging technologies, including stablecoins. In this role, you will play a critical part in designing, deploying, and managing the infrastructure that enables this transformation.This is a hands-on devops position that offers significant ownership of our infrastructure architecture and platform automation.ABOUT THE ROLEAs a key player at the crossroads of infrastructure, platform engineering, and security, you will define the strategies for scaling and managing our infrastructure while guiding teams to adopt cohesive architectural patterns.You will take the lead in building infrastructure systems that support:Multi-rail money movementReal-time payment flowsLedger-backed transaction processingHigh-availability payment orchestrationThis role is a senior individual contributor position that significantly influences our Product, Platform, and Infrastructure teams.WHAT YOU’LL DOLead the architecture for AWS infrastructure, container orchestration, and CI/CD processes to support scalable payment systems.Standardize automation and infrastructure-as-code practices for provisioning, scaling, and operating high-volume ledger and transaction platforms.Minimize operational overhead through enhanced tooling, optimized workflows, and elevated engineering standards.Enhance performance, reliability, and observability across our real-time and rails-specific infrastructure.Define service ownership and operational models, ensuring alignment between platform and product teams on scalable rail architecture.Contribute to the infrastructure and security strategy for new payment rails, including FBO structures, stablecoin integrations, and compliance-focused features.WHAT YOU SHOULD HAVERequired8+ years of experience in DevOps or infrastructure engineering.
About Candid HealthCandid Health is dedicated to transforming the healthcare landscape by addressing one of its most intricate and expensive challenges: the billing and revenue cycle management (RCM) process. The healthcare sector has been hampered by sluggish, inefficient workflows that squander precious resources, ultimately detracting from providers' ability to focus on patient care. Our innovative revenue cycle automation platform is set to revolutionize this domain with a smart, data-driven methodology that streamlines billing, enhances claims processing, and eradicates administrative inefficiencies.The OpportunityAs a Staff Engineer at Candid Health, you will have the opportunity to tackle our most challenging infrastructure issues, developing the essential frameworks needed for rapid scalability and to meet surging customer demand.In this position, you will take significant ownership of resolving technical bottlenecks related to scalability, performance, reliability, and observability across distributed systems. You will be empowered to make strategic decisions regarding build versus buy, guiding technical discussions and architecting solutions that enhance the reliability and performance of our core platform.Moreover, you will participate in leadership discussions, allowing you to advocate for your team, influence project prioritization on the roadmap, and stay informed about developments across the engineering organization.We seek an individual who has successfully led high-stakes, business-critical technical projects to swiftly scale an organization's infrastructure, significantly impacting business success. The ideal candidate will have experience in a startup environment where the product achieved maturity, yet the technology required rapid scaling to meet customer needs, and you were integral in facilitating that growth.
Clever is dedicated to bridging the gap in education by connecting every student globally to a vast array of learning opportunities. Our innovative identity platform supports 77% of U.S. schools and over 1 million K-12 students worldwide. As a trusted ally for educational institutions, we ensure secure and seamless access to digital learning resources, empowering students everywhere. Clever is proud to be a Kahoot! Company, headquartered in San Francisco, CA, with a far-reaching impact. Discover more about us at www.clever.com.The Infrastructure team at Clever is responsible for the engineering of our platform. This includes the development and enhancement of the internal developer platform utilized by our engineering teams to build, deploy, operate, and monitor Clever’s products. As a valued member of the Infrastructure team, you will be instrumental in creating self-service 'golden paths' and shared tools that enable teams to deliver and iterate on products swiftly and safely while ensuring adherence to reliability, security, and compliance standards. You will collaborate with engineering teams to identify pain points, enhance automation and workflows, and continuously elevate the developer experience and operational efficiency throughout the organization.
Join our dynamic team at leverdemo-8 as a Software Engineer specializing in Cloud Infrastructure. We are passionate about reimagining the hiring landscape and are looking for talented engineers to enhance our YugaByte DB for enterprise applications. Your expertise will contribute to optimizing orchestration support across major public clouds including AWS, Google Cloud, and Azure, as well as Kubernetes services and private data centers. You'll play a crucial role in the control and manageability plane of YugaByte and collaborate with tools such as Prometheus and Alert Manager to ensure seamless infrastructure management.Please note that this position is part of Lever's testing environment; we kindly ask you not to apply for this role.
About UsAt Hayden AI, we are dedicated to leveraging the capabilities of computer vision to revolutionize how transit systems and government agencies tackle pressing real-world issues.Our cutting-edge mobile perception system enhances bus lane enforcement, transportation optimization, and much more, empowering our clients to speed up transit, improve street safety, and work towards a sustainable future.About the RoleThe Infrastructure Engineering team plays a vital role in the success of Hayden AI products. We are responsible for the foundational infrastructure that links thousands of deployed devices to our extensive, multi-region cloud services and applications that utilize the data collected from these devices. If you consider yourself a force multiplier, you’ll thrive in our dynamic team!Key Responsibilities:Architect the Service Backbone: Lead the design and continuous improvement of our core services architecture, ensuring a robust, high-availability backbone for all cloud services and engineering teams.Drive Technical Roadmap: Define the long-term technical strategy and architectural vision for our cloud services, aligning with future business expansion and technological advancements.Strategic Decision Making: Lead pivotal architectural decisions, conduct in-depth code reviews, and assess technical trade-offs to maintain a sustainable and scalable service ecosystem.Scale Multi-Region Cloud: Design and oversee globally distributed, multi-region cloud deployments utilizing advanced Infrastructure as Code (IaC) for optimal scalability and performance.Automate Everything: Employ automation and modern orchestration tools to minimize manual tasks, enabling a self-service infrastructure model that allows developers to deploy code swiftly and safely.
Join Lever's innovative team as an Infrastructure Engineer, where we are redefining the future of talent acquisition. This role is crucial in building and maintaining the infrastructure that supports our cutting-edge hiring software used by renowned companies like Netflix, Shopify, and Spotify.Lever is a leader in the recruitment technology space, founded a decade ago to address the critical challenge of attracting and hiring top talent. Our company culture is rooted in a people-first approach, and we are proud to be recognized as the #1 workplace in San Francisco and a top employer in the U.S. We seek passionate individuals to join our team as we continue to scale.test
About Anyscale:At Anyscale, we are on a mission to democratize distributed computing, making it accessible for software developers across all skill levels. We are actively commercializing Ray, a prominent open-source project that's fostering an ecosystem of libraries designed for scalable machine learning. Leading companies such as OpenAI, Uber, Spotify, Instacart, Cruise, among others, have integrated Ray into their tech stacks to expedite the deployment of AI applications in real-world scenarios.At Anyscale, we are committed to creating the optimal environment for running Ray, enabling developers and data scientists to effortlessly scale machine learning applications from their laptops to large clusters without requiring expertise in distributed systems.We are proud to be backed by Andreessen Horowitz, NEA, and Addition, with over $250 million raised to date.About the RoleAnyscale is seeking a talented Software Engineer to join our Infrastructure team. Our goal is to deliver next-generation tools and infrastructure that simplify the development and execution of distributed AI applications in the cloud, making it as straightforward as local development. As a member of the Infra team, you will contribute to the creation of a scalable, secure, and resilient backbone that supports this vision.
At Composio, we are at the forefront of creating a seamless communication infrastructure that empowers agents to interact with essential work tools such as GitHub, Gmail, Notion, Salesforce, and more. Our dedicated team of engineers tackles challenges ranging from contextual understanding to search functionality, ensuring we build the most effective bridge between your agents and their tools.Recently, we secured a $25M Series A funding from Lightspeed, along with support from notable investors such as Guillermo Rauch (CEO of Vercel), Dharmesh Shah (CTO of HubSpot), and Gokul Rajaram. This year, we tripled our Annual Recurring Revenue (ARR), with clients ranging from fellow YC alumni to companies like Wabi, Glean, Zoom, and many others.We are always on the lookout for exceptional talent, regardless of the job board listings. If you believe you are a remarkable fit for our team, describe your dream job and outline what makes you exceptional. If we see a potential match, we will reach out!
Middesk
Join Middesk as a Machine Learning Engineer and contribute to cutting-edge projects that leverage machine learning to drive business insights. You will collaborate with a dedicated team of data scientists and engineers, developing algorithms and models that enhance our product offerings and improve user experience.
Join the Revolution at Retell AIRetell AI is pioneering the future of call centers through innovative voice AI, driven by first principles thinking.In just 18 months since our inception, we have empowered thousands of businesses with our AI voice agents, transforming how sales, support, and logistics calls are managed—previously requiring extensive human teams. Supported by prestigious investors such as Y Combinator and Alt Capital, we've rapidly scaled from $5M ARR to an impressive $36M ARR with a compact yet dynamic team of 20.Our ambition for 2026 is to create a revolutionary customer experience platform, where entire contact centers are powered by AI. Moving beyond basic automation, we aim to develop intelligent AI “workers” that serve as frontline agents, QA analysts, and managers, continuously enhancing customer interactions without the need for constant human oversight.As we expand, we are seeking passionate engineers who are eager to solve challenging technical problems, act swiftly, and make a significant impact in one of the fastest-growing voice AI startups. Let’s shape the future together.
Sign in to browse more jobs
Create account — see all 11,588 results

