Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Proven experience in platform engineering, with a strong focus on infrastructure and AI technologies. Expertise in cloud services (AWS, Azure, GCP) and containerization (Docker, Kubernetes). Strong programming skills in languages such as Python, Java, or Go. Excellent problem-solving abilities and a team-oriented mindset. Ability to design and implement scalable architecture.
About the job
We are seeking a talented Senior Platform Engineer to join our dynamic team at Blackbird Labs Inc. in New York. In this pivotal role, you will leverage your expertise in infrastructure and artificial intelligence to build and enhance our platforms, ensuring they are robust, scalable, and efficient.
You will work closely with cross-functional teams to architect solutions that drive innovation and enhance performance. This is an excellent opportunity for an experienced engineer looking to make a significant impact in a fast-paced environment.
About Blackbird Labs Inc.
Blackbird Labs Inc. is a leading technology company focusing on innovative solutions in the fields of infrastructure and artificial intelligence. Our mission is to empower businesses through advanced technological frameworks and to create a collaborative environment for our employees to thrive.
We are seeking a talented Senior Platform Engineer to join our dynamic team at Blackbird Labs Inc. in New York. In this pivotal role, you will leverage your expertise in infrastructure and artificial intelligence to build and enhance our platforms, ensuring they are robust, scalable, and efficient.You will work closely with cross-functional teams to architect solutions that drive innovation and enhance performance. This is an excellent opportunity for an experienced engineer looking to make a significant impact in a fast-paced environment.
Full-time|$170K/yr - $210K/yr|Remote|New York, NY. Remote US.
About Kalepa:In the trillion-dollar commercial insurance industry, many processes still rely on outdated tools like Microsoft Outlook. At Kalepa, we are revolutionizing this landscape.Kalepa offers an advanced AI Underwriting Platform designed to provide professional-grade AI capabilities in production. Our platform empowers leading insurers to streamline submission data, uncover critical risk insights, and enhance decision-making speed and accuracy. Clients experience significant enhancements in their operational efficiency and portfolio quality upon implementing Kalepa, often referring to it as a "truly an underwriter’s dream." Our company is backed by renowned investors such as IA Ventures and Inspired Capital, and our diverse team boasts expertise from industry giants including Facebook, Google, Amazon, Mastercard, and Uber.
Full-time|$200K/yr - $250K/yr|On-site|New York, NY
Join Fluidstack: Pioneering the Future of IntelligenceAt Fluidstack, we're transforming the landscape of artificial intelligence infrastructure. Collaborating with leading AI research labs, government entities, and major corporations—including Mistral, Poolside, Black Forest Labs, and Meta—we are dedicated to delivering computing solutions at unprecedented speeds. Our mission is to expedite the realization of Artificial General Intelligence (AGI), and we are seeking passionate individuals who thrive on purpose and excellence.We take immense pride in the systems we develop and the trust we build with our clients. If you are ready to roll up your sleeves and contribute to shaping the future of intelligence, we invite you to join our innovative team.Position OverviewFluidstack, a prominent player in the cloud services arena, is on the lookout for a Software Engineer specializing in Infrastructure Platform Development. In this role, you will be instrumental in constructing the foundational platforms that support our global infrastructure and data center operations. Your focus will be on developing robust internal tools across various domains, including Configuration Management Database (CMDB), asset management, Data Center Infrastructure Management (DCIM), monitoring, observability, security, and operational automation. Collaborating with cross-functional teams, you will craft scalable and user-friendly solutions that enhance our ability to provide top-tier infrastructure services.Key ResponsibilitiesInfrastructure Platform DevelopmentDesign and implement a next-generation CMDB system to serve as the definitive source of truth for infrastructure assets, network architecture, and configuration data.Develop DCIM platforms for managing rack operations, server/GPU deployments, operating system installations, quality assurance, and white-screen activities.Create comprehensive asset lifecycle management systems encompassing receiving, racking, inventory, break-fix, and decommissioning workflows.Build monitoring and observability platforms that integrate telemetry from Building Management Systems (BMS), Environmental Power Monitoring Systems (EPMS), and IT devices, featuring intelligent alerting and incident management capabilities.Develop self-service portals and automation tools for new region initialization, post-deployment operations, and fleet-scale management.Operational Excellence & AutomationMinimize manual tasks through workflow automation and self-service tools that empower our operations and engineering teams.Create workflow orchestration systems to streamline complex multi-step processes that encompass incident, problem, and change management.
Full-time|On-site|San Francisco, CA; Seattle, WA; New York, NY
Scale AI is seeking a Senior AI Infrastructure Engineer to help build and refine the company’s Training Platform. This position centers on designing, implementing, and improving infrastructure that supports machine learning teams as they train and deploy models. Role overview This engineer will work closely with colleagues across different functions to create solutions that make AI systems more efficient. The focus is on enabling faster, more reliable model training and deployment. Key responsibilities Design and build infrastructure for AI model training Implement and optimize systems to support machine learning workflows Collaborate with teams throughout the company to improve platform capabilities Locations This role is based in San Francisco, Seattle, or New York.
Role overview Braze, Inc. is hiring a Senior Staff Platform Infrastructure Engineer based in New York City. This position centers on designing and improving the core infrastructure that supports Braze’s products and services. The engineer will guide complex technical projects and partner with teams throughout the company. What you will do Design and enhance platform infrastructure to strengthen Braze’s offerings Take the lead on technical projects, managing them from initial planning through execution Work closely with colleagues across departments to deliver reliable and scalable solutions Share ideas and technical knowledge to help drive improvements and innovation in the platform
About NomicNomic AI is at the forefront of innovation, developing cutting-edge AI agents and robust developer tools that transform document intelligence. We empower enterprise teams in architecture, engineering, and construction to extract actionable insights from decades of drawings, specifications, and project files. Our platform integrates advanced embedding models, sophisticated document parsing, and intelligent autonomous agents capable of reasoning over real-world data and executing actions in dynamic environments.The RoleAs a Senior Platform Engineer at Nomic, you will play a pivotal role in advancing our infrastructure and ensuring the safe operation of our autonomous agents in live customer environments. You will be responsible for overseeing our multi-account AWS setup, Kubernetes clusters, and infrastructure-as-code implementations. Your primary focus will be on the critical challenge of executing autonomous agents securely, including sandbox execution, secure isolation, automated health checks, and meeting the reliability standards our enterprise clients demand.This senior individual contributor role offers significant ownership and influence over architectural decisions.Team: Platform & Infrastructure Reports to: CTOWhat You'll Work OnAgent infrastructure: Your top priority will be ensuring that our agents operate safely within customer cloud environments. This includes managing sandboxing, isolation, monitoring, and the scalable operation of workloads. Key tasks will involve establishing execution environments, security boundaries, automated QA processes, evaluation harnesses, and feedback loops to enhance agent reliability over time.Core infrastructure: Work with Kubernetes, multi-account AWS, CI/CD pipelines, deployment strategies, observability frameworks (traces, metrics, logs, alerting, SLOs), disaster recovery protocols, and cost management initiatives.Security posture: Implement access controls, manage secrets, ensure network security, conduct image scanning, perform dependency audits, and oversee compliance tasks (SOC2, enterprise security) as required by our clients.Infrastructure as code: Continuously define, provision, and evolve our infrastructure through code.What We're Looking ForYou have:A minimum of 5 years of experience in infrastructure, DevOps, or SRE roles, specifically managing cloud infrastructure in production environments.Proficiency in Kubernetes, including deploying workloads, troubleshooting issues, and optimizing performance.
Full-time|$236K/yr - $295K/yr|On-site|New York, NY
Role Overview Ridgeline is looking for a Senior Staff AI Research Engineer to join the AI Platform team in New York, NY. This group focuses on advancing AI research and building practical solutions for complex, real-world problems. The position centers on evaluating machine learning models and large language models (LLMs) to improve their performance and impact. What You Will Do Design and build frameworks, tools, and systems that use advanced AI technologies. Architect and implement end-to-end AI systems that address challenging business needs. Prototype multi-stage processing structures as part of the research-to-production workflow. Develop evaluation frameworks and benchmarks for model accuracy, consistency, and cost-effectiveness. Work closely with business teams to validate and refine AI-powered tools. Create reusable AI capabilities that can be applied across different business areas, with a focus on investment management operations. About the Team The AI Platform team at Ridgeline drives the adoption of state-of-the-art AI across the company. The group values innovative thinking and practical application, aiming to deliver solutions that shape the future of investment management.
About DaydreamDaydream is an innovative chat-based shopping assistant exclusively tailored for fashion. Our aim is to revolutionize the way individuals discover and explore fashion by delivering a personalized, conversational experience powered by cutting-edge AI and natural language understanding.Supported by leading investors such as Forerunner Ventures, Index Ventures, Google Ventures, and True Ventures, our dedicated team is passionate about redefining the future of shopping.About the RoleAs the Engineering Lead for Platform & AI, you will take ownership of several backend systems at Daydream. This includes managing the catalog data pipeline that drives our product index, overseeing the AI content pipeline that generates landing pages for ad campaigns, and developing the B2B integration layer that allows our search functionality on partner retail websites. Your work will primarily involve Python and Go, focusing heavily on LLM integration, with your contributions directly impacting our product.Key Responsibilities:Catalog Pipeline:Oversee the augmentation pipeline—a multi-step Polars system that executes ML models (image analysis, hero selection, swatches via Vertex AI), standardizes sizes, appends affiliate links, and writes to Delta Lake / GCS.Enhance catalog quality through OOS accuracy, hero image selection, and sizing coverage.Develop and maintain scrapers and mappers for over 100 fashion merchants across various platforms including Shopify and custom sites.Eventually, take charge of the enrichment layer, focusing on attribute extraction, taxonomy, and Elasticsearch indexing.Content Generation Pipeline:Lead the AI content pipeline—a multi-step system utilizing LLM for topic generation, product matching, quality scoring, and landing page assembly.Enhance prompt quality and evaluate LLM output.Advance towards personalized content for logged-in users, email content, and broadened ad formats.B2B Merchant Integration:Create and manage services that expose core Daydream capabilities to merchant partners.Deploy on Kubernetes utilizing Helm and ArgoCD.In All Areas:Collaborate across teams to ensure seamless integration and functionality of systems.
Our VisionAt General Intelligence Company (GIC), we are pioneers in developing advanced autonomous agents designed specifically for startups. Our mission is to empower solo entrepreneurs to build billion-dollar enterprises with our innovative technology. These agents not only streamline workflows but also transform operational processes. We are in search of an experienced infrastructure and platform engineer with profound systems engineering expertise to create and sustain the robust, scalable infrastructure that fuels our AI agent platform.Our infrastructure manages intricate distributed workloads, ranging from AI model inference to multi-tenant orchestration of agents. We seek a candidate who can design resilient systems, enhance performance at scale, and facilitate the seamless evolution of our platform from handling thousands to millions of agent operations.Your ResponsibilitiesCraft and implement scalable architectural designs for AI agent workloads.Develop and uphold cloud infrastructure utilizing modern Infrastructure as Code (IaC) tools such as Terraform, ensuring dependable deployments across various environments.Enhance system performance and cost-effectiveness for high-throughput workloads, focusing on resource management and auto-scaling capabilities.Establish robust monitoring, alerting, and observability frameworks to guarantee over 99.9% uptime for mission-critical agent functionalities.Collaborate closely with engineering teams to ensure smooth CI/CD pipelines and zero-downtime deployments.Construct a security-centric infrastructure with appropriate access controls, secrets management, and compliance protocols.Engage in rapid, iterative cycles—focusing on continuous enhancement and scaling without unnecessary backlogs.Align our foundational principles with infrastructure:Cycle time is essential. You deploy rapidly and iterate.Quality is paramount. Your infrastructure is self-sufficient.No backlogs. Develop what is necessary, when it is necessary.FITFO. You independently resolve challenges.Ownership mentality. You approach your work as if you own it.Your Profile4+ years of experience in building and scaling production infrastructure, ideally within high-growth startup environments.Familiarity with Site Reliability Engineering (SRE) monitoring and observability tools, such as Datadog, along with experience in constructing robust alerting systems.Proficient in cloud platforms and services, with a strong understanding of networking and security best practices.
Full-time|$282K/yr - $363K/yr|On-site|New York, NY
Supported by prominent investors from Silicon Valley, Peregrine Technologies empowers public safety organizations, state and local governments, federal agencies, and private-sector institutions to tackle societal challenges with unmatched speed and precision. Our AI-driven platform transforms isolated and fragmented data into actionable operational intelligence, instantly highlighting critical information that enables quicker, more informed decisions to enhance outcomes at every interaction. Currently, Peregrine serves hundreds of clients across over 30 states and two countries, impacting more than 125 million individuals as we broaden our reach into enterprise markets and international territories.Our TeamAt Peregrine, we hold a strong belief that empathy is key to enhancing our solutions. Observing how users interact with our product is a priority and essential to finding effective answers. Engineers are encouraged to collaborate closely with our team on-site to grasp the diverse use cases that Peregrine addresses.We are seeking an Engineering Manager to strengthen our core engineering teams. In this role, you will work collaboratively with design and product management to create robust, scalable, and user-focused systems. Our teams face various challenges, including enabling real-time user collaboration on detailed maps and constructing high-scale backend architecture to process billions of data points.Ownership and collaboration are highly valued— you will be entrusted with significant features and will work alongside fellow engineers to see them through to completion. We believe humility and empathy are crucial in crafting the right solutions; you will engage directly with our deployment team and users as we refine our approach to address their needs. Creativity and perseverance are essential in realizing our vision.RoleThis position is pivotal to Peregrine's platform strategy and execution. You will be responsible for defining how our core systems scale, perform, and evolve as Peregrine accelerates its growth and deepens its influence across public safety, government, and enterprise clientele.As a senior leader in platform engineering, you will not only manage systems but also set the technical trajectory, build a cohesive team, and establish the operational framework that empowers every product team at Peregrine to operate more efficiently, securely, and confidently. Your contributions will directly influence reliability, developer productivity, customer trust, and the company’s capacity for innovation at scale.
Full-time|$179K/yr - $224K/yr|On-site|New York, NY
Who We AreAddepar is a leading global data and AI platform that empowers investment professionals to transform complex financial data into actionable insights. By unifying portfolio, market, and client data, Addepar delivers AI-driven insights seamlessly integrated within investment and client workflows. Serving over 1,400 firms across nearly 60 countries, we manage and advise on assets totaling almost $9 trillion. Our open platform collaborates with approximately 650 software, data, and consulting partners to enhance investment operations for organizations of all sizes and complexities. With a presence in New York City, Salt Lake City, London, Edinburgh, Pune, Dubai, Geneva, and São Paulo, Addepar is committed to supporting clients worldwide.The RoleWe are on the lookout for a Senior AI Engineer to become a pivotal member of our rapidly expanding AI Platform team. This team is dedicated to creating innovative and transformative solutions leveraging AI technology across our product suite.In this high-impact role, you will architect, lead, and execute the core AI capabilities that power our platform. Ideal candidates will thrive at the cutting edge of applied AI, showcasing a constant drive for delivering results. You will manage the complete lifecycle of AI-native products, ensuring the integration of advanced AI systems into robust, scalable, and high-performing production environments.
Databricks builds the data and AI infrastructure that helps organizations solve complex problems, from modernizing transportation to accelerating medical research. The platform supports teams in uncovering insights and improving operations through advanced data and AI tools. Role Overview This Senior Software Engineer position sits within the New York City Engineering office. The team focuses on creating new products from the ground up, using Databricks’ strengths in data and AI to build specialized AI applications for both technical and business audiences. The environment combines the energy of a startup with the backing of Databricks’ established resources. What You Will Do Design and build advanced interfaces for Generative AI agents that handle complex workflows, ensuring transparency and human oversight. Work closely with product managers, designers, and engineers to deliver intuitive, scalable solutions that drive user and business growth. Contribute hands-on as a full-stack developer, delivering user-centered experiences. Develop features that support product-led growth, including seamless onboarding and sharing tools. Help define the direction of agent-based applications at Databricks, collaborating with cross-functional teams. Lead the design, development, and maintenance of end-to-end AI systems that are reliable, scalable, and efficient.
Full-time|$300K/yr - $350K/yr|On-site|New York, New York, United States, San Francisco, California, United States
Our MissionAt Alchemy, we are dedicated to democratizing web3 for a billion users by equipping developers with the essential tools to create outstanding on-chain products. As the premier developer platform, we deliver robust APIs, SDKs, and tools that empower the development and scaling of on-chain applications and rollups.Our infrastructure underpins 70% of elite web3 teams, over 90% of web2 companies venturing into web3, and serves over 100 million end users. Our esteemed clientele includes leading web3 innovators such as Polymarket, OpenSea, and Circle, alongside global giants like Shopify and Adobe.The Alchemy team brings decades of unparalleled experience in highly scalable infrastructure, AI, and blockchain, with leadership roles held at top-tier companies and prestigious institutions like Google, Microsoft, Facebook, Stanford, and MIT.We are proudly supported by world-class VCs and institutions, including Lightspeed, Silver Lake, a16z, Coatue, Pantera, Addition, Stanford University, Coinbase, and Charles Schwab, among others.The RoleAs the Director of Engineering for Infrastructure & Platform, you will spearhead the vision, strategy, and implementation of one of the most sophisticated high-throughput distributed systems in the blockchain landscape. You will lead multiple teams responsible for the core platform that drives Alchemy’s offerings, where reliability, performance, and cost-efficiency at scale are of utmost importance.Your role will define the long-term technical direction of our platform infrastructure, collaborating closely with Product and Executive Leadership to ensure Alchemy remains at the forefront of Web3 infrastructure. Your contributions will empower developers to innovate swiftly and confidently, paving the way for blockchain adoption by the next billion users.What You’ll Do:Lead and expand multiple engineering teams across Cloud Infrastructure, Platform, Data Platform, and Internal Tooling, including distributed and international teams.Define and manage the platform and infrastructure strategy, ensuring scalability, reliability, performance, and cost efficiency throughout the organization.Serve as a senior technical leader, influencing architectural and system design for large-scale, mission-critical distributed systems.Collaborate with Product, Security, Finance, and Executive Leadership to align platform investments with company objectives and customer requirements.Drive execution excellence by refining processes, establishing clear priorities, and ensuring predictable delivery as the organization scales.
Join Us at PerceptaAt Percepta, we are on a mission to revolutionize critical industries through the power of applied AI. Our commitment is to ensure that essential sectors such as healthcare, manufacturing, and energy leverage advanced technologies for their benefit.We partner with leading organizations to foster AI transformation, combining:Expertise in engineering, product development, and researchMosaic, our proprietary toolkit designed for the rapid deployment of intelligent architecturesStrategic collaborations with industry giants like Anthropic, McKinsey, AWS, and General CatalystOur team consists of a dynamic group of Applied AI Engineers, Embedded Product Managers, and Researchers, all eager to integrate AI innovations into tangible improvements in everyday life.Percepta is a proud partner of General Catalyst, an esteemed global transformation and investment firm.Role OverviewWe are seeking a Senior Platform Engineer to develop the foundational infrastructure that supports Percepta’s AI initiatives.This pivotal role bridges customer implementations and platform architecture. As part of a dedicated team, you will engage directly with key institutions, designing and constructing systems that enable AI agents to effectively interact with real-world tools, services, and environments.Your work will often commence in the field, crafting solutions for specific client challenges. Ultimately, your goal will be to identify and formalize effective abstractions to enhance our platform, facilitating quicker deployment of future systems.The challenges we face are novel, and the frameworks are still evolving, meaning the systems we create often lack predefined guidelines.Your ResponsibilitiesDevelop the core platform enabling AI agents to interact seamlessly with external tools, services, and environments.Design systems that manage complex multi-step workflows driven by cutting-edge models.Collaborate with Percepta teams embedded with clients to deliver robust production AI systems.Extract reusable abstractions and infrastructure from real-world deployments.Create reliable interfaces between AI systems and distributed execution environments.Rapidly prototype new architectures as we navigate emerging patterns in AI.
GovDash empowers businesses to secure and execute government contracts that align with American interests.Our cutting-edge AI platform provides a robust, secure, and workflow-driven solution for managing the entire contracting lifecycle—ranging from opportunity identification and capture to proposal execution, award management, and post-award operations.In 2025, our clients successfully secured over $5 billion in government contracts. With an investment of $42 million, we are accelerating our product development and expanding GovDash's reach across the nation.About the RoleWe are seeking a Senior Software Engineer (Infrastructure) to take charge of a diverse array of self-hosted deployments, spanning cloud environments to air-gapped on-premises systems, along with the software essential for their management. You will work closely with both customers and our product engineering teams to enhance the capabilities of our self-hosted control plane, which includes features such as licensing, observability, automated updates, and more.This position requires a professional adept in high-level architectural design and implementation, with excellent communication skills to engage with customers effectively. Our team values curiosity and a commitment to excellence, thus new engineers should anticipate a challenging yet rewarding environment of technical mastery and significant ownership over critical components of GovDash's infrastructure.This role offers a competitive base salary of $190,000, along with equity, and is positioned in New York City.
Role overview rowspace is looking for an Infrastructure Engineer based in New York City. This position centers on building and securing the core systems that power our AI data platform. The work involves designing infrastructure that processes large volumes of sensitive financial data, with particular attention to security and compliance. Integrating both public and private, tenant-specific customer data in real time and at scale is a key part of this role. What you will do Design and build scalable infrastructure for an AI knowledge engine that works with structured and unstructured financial data. Develop secure architectures for private cloud environments, ensuring alignment with financial services compliance standards. Create data ingestion pipelines for sources such as CapIQ feeds and internal SharePoint documents. Develop monitoring and alerting tools for our Bring Your Own Cloud (BYOC) platform. Set up access controls and audit trails to trace AI interactions back to original data sources. Collaborate with AI Research and Product teams to optimize infrastructure for large language model (LLM) inference, training, and agent development. Implement CI/CD workflows and infrastructure-as-code for reliable deployments across multiple cloud providers.
About UsAt Percepta, we are dedicated to revolutionizing vital sectors through the power of applied AI. Our goal is to ensure that key industries such as healthcare, manufacturing, and energy harness the benefits of cutting-edge technology.We partner with leading organizations to facilitate AI transformation, providing:Expertise in engineering, product development, and researchMosaic, our proprietary toolkit designed for the swift deployment of intelligent architecturesStrategic alliances with notable entities like Anthropic, McKinsey, AWS, and the General Catalyst portfolioOur team is a dynamic collective of Applied AI Engineers, Embedded Product Managers, and Researchers driven by the mission to integrate advanced AI into the systems that shape our world.Percepta is a proud partner of General Catalyst.Role OverviewWe are on the lookout for an AI Infrastructure Engineer who will be responsible for the infrastructure, deployment, and operational reliability that underpin Percepta's AI systems, including the autonomous agents driving our innovations.Your role will involve enhancing existing systems: refining our Terraform configurations, fortifying deployment pipelines, and implementing more robust management of infrastructure across various regions and providers. You will also be tasked with constructing missing components and exploring uncharted territories, defining what Site Reliability Engineering (SRE) means in the context of autonomous decision-making systems.The infrastructure paradigms for future autonomous systems are yet to be established, and you will play a crucial role in shaping them.What Sets This Role ApartYou will be working with autonomous systems, where the infrastructure dynamics shift significantly when workloads have agency.Observability entails understanding the rationale behind an agent's decisions, not merely checking the health of a pod.There is a tangible gap between research and production in our environment. Our teams transition optimization algorithms and AI systems from research settings to production, and you will be integral to this process. While MLOps experience is not mandatory, you will be closer to this boundary than most infrastructure roles.Join a small team with significant ownership. You will make foundational decisions rather than inherit pre-existing ones.Your ResponsibilitiesDesign infrastructure patterns for multi-agent systems that are observable, controllable, and recoverable in innovative ways.
Join The New York Times Company as a Senior Software Engineer specializing in AI platforms and products. In this role, you will be at the forefront of developing innovative AI solutions that enhance our digital offerings and engage millions of readers. Collaborate with cross-functional teams to design, implement, and maintain high-performance AI systems that drive our content and product strategy.
About YouAs a Senior Platform Engineer at Copia Automation, you are a problem-solver who thrives on tackling challenging, unique, and impactful projects alongside passionate team members. Your expertise spans Site Reliability Engineering, DevOps, and Infrastructure Engineering, allowing you to take ownership of technical projects and provide invaluable thought leadership.In this role, you will make critical architectural decisions, mentor fellow engineers, and influence the technical trajectory of our products. Working in the industrial automation sector, you will delve into a range of topics that are rarely encountered in conventional software engineering roles. About UsCopia Automation is at the forefront of providing cutting-edge disaster recovery technology for the industrial automation landscape. Our innovative product suite introduces modern developer tools to the operational technology (OT) space, including a unique Git-based source control solution tailored for automation professionals. We are a well-funded startup with a rapidly expanding customer base within the industrial sector. Why Choose Industrial Automation?The manufacturing industry is heavily reliant on industrial automation, utilizing computerized systems and robotics operating on Programmable Logic Controllers (PLCs). This domain employs a distinctive graphical language that does not align with standard developer tools like GitLab or GitHub, leading automation professionals to depend on outdated storage methods and partial solutions. With significant downtime costs, there is a pressing demand for improved development tools. Engineering Culture at CopiaOur engineering team thrives on collaboration, experimentation, and a sense of ownership. We tackle complex challenges together, testing our assumptions and valuing practical solutions that balance immediate needs with long-term objectives. We maintain an optimistic outlook, celebrating our achievements and supporting one another through unforeseen obstacles.
Full-time|$125K/yr - $150K/yr|On-site|New York, New York, United States
CVector is dedicated to revolutionizing economic optimization and AI-driven predictions across energy and manufacturing sectors.Our technology seamlessly integrates critical decision-making factors that impact cost, reliability, and profit margins, providing a unified decision layer that forecasts future scenarios, simulates potential outcomes, and optimizes operational strategies. This allows industrial plants to operate closer to their maximum economic potential daily.This position requires in-office attendance at our New York City headquarters four days a week. CVector serves clients nationwide and operates in challenging industrial environments.Role OverviewAs a Senior Software Engineer specializing in Backend and AI Infrastructure, you will be instrumental in advancing CVector’s backend platform. Your focus will be on developing time-series data systems, AI-enhanced analytics, cloud infrastructure, and data ingestion pipelines that underpin our client-facing applications and internal modeling tools.This role is ideal for engineers who thrive on working closely with data and infrastructure, possess excellent architectural judgment, and are eager to engage with AI systems, databases, and distributed backend services. You will take ownership of complex systems, lead significant technical migrations, and influence the integration of intelligence into industrial energy workflows.You will work in close collaboration with product, modeling, and frontend engineers, significantly impacting the platform's direction, reliability, and scalability for the long term.Key ResponsibilitiesAs a Senior Software Engineer, your contributions will span various interconnected domains:Intelligent SystemsTranslate customer domains and operational workflows into effective prompts and AI system interfaces.Design, implement, and refine evaluations for AI outputs.Incorporate customer feedback and reinforcement signals to enhance system performance.Optimize context selection, retrieval, and trace collection to elevate output quality.Fine-tune smaller models using collected traces to enhance speed while maintaining performance standards.Evaluate and assimilate new AI platforms and models as they emerge.Support the training and deployment of large, time-series-specific models.Backend Platform and Data InfrastructureOversee migrations and enhancements of our time-series data schemas and storage systems.Update and manage PostgreSQL and associated database infrastructure.Develop and sustain data connectors for industrial and external systems.Lead improvements to MQTT-based data ingestion pipelines.Transition PostgREST to a GraphQL-based framework and evolve our API architecture.
We are seeking a talented Senior Platform Engineer to join our dynamic team at Blackbird Labs Inc. in New York. In this pivotal role, you will leverage your expertise in infrastructure and artificial intelligence to build and enhance our platforms, ensuring they are robust, scalable, and efficient.You will work closely with cross-functional teams to architect solutions that drive innovation and enhance performance. This is an excellent opportunity for an experienced engineer looking to make a significant impact in a fast-paced environment.
Full-time|$170K/yr - $210K/yr|Remote|New York, NY. Remote US.
About Kalepa:In the trillion-dollar commercial insurance industry, many processes still rely on outdated tools like Microsoft Outlook. At Kalepa, we are revolutionizing this landscape.Kalepa offers an advanced AI Underwriting Platform designed to provide professional-grade AI capabilities in production. Our platform empowers leading insurers to streamline submission data, uncover critical risk insights, and enhance decision-making speed and accuracy. Clients experience significant enhancements in their operational efficiency and portfolio quality upon implementing Kalepa, often referring to it as a "truly an underwriter’s dream." Our company is backed by renowned investors such as IA Ventures and Inspired Capital, and our diverse team boasts expertise from industry giants including Facebook, Google, Amazon, Mastercard, and Uber.
Full-time|$200K/yr - $250K/yr|On-site|New York, NY
Join Fluidstack: Pioneering the Future of IntelligenceAt Fluidstack, we're transforming the landscape of artificial intelligence infrastructure. Collaborating with leading AI research labs, government entities, and major corporations—including Mistral, Poolside, Black Forest Labs, and Meta—we are dedicated to delivering computing solutions at unprecedented speeds. Our mission is to expedite the realization of Artificial General Intelligence (AGI), and we are seeking passionate individuals who thrive on purpose and excellence.We take immense pride in the systems we develop and the trust we build with our clients. If you are ready to roll up your sleeves and contribute to shaping the future of intelligence, we invite you to join our innovative team.Position OverviewFluidstack, a prominent player in the cloud services arena, is on the lookout for a Software Engineer specializing in Infrastructure Platform Development. In this role, you will be instrumental in constructing the foundational platforms that support our global infrastructure and data center operations. Your focus will be on developing robust internal tools across various domains, including Configuration Management Database (CMDB), asset management, Data Center Infrastructure Management (DCIM), monitoring, observability, security, and operational automation. Collaborating with cross-functional teams, you will craft scalable and user-friendly solutions that enhance our ability to provide top-tier infrastructure services.Key ResponsibilitiesInfrastructure Platform DevelopmentDesign and implement a next-generation CMDB system to serve as the definitive source of truth for infrastructure assets, network architecture, and configuration data.Develop DCIM platforms for managing rack operations, server/GPU deployments, operating system installations, quality assurance, and white-screen activities.Create comprehensive asset lifecycle management systems encompassing receiving, racking, inventory, break-fix, and decommissioning workflows.Build monitoring and observability platforms that integrate telemetry from Building Management Systems (BMS), Environmental Power Monitoring Systems (EPMS), and IT devices, featuring intelligent alerting and incident management capabilities.Develop self-service portals and automation tools for new region initialization, post-deployment operations, and fleet-scale management.Operational Excellence & AutomationMinimize manual tasks through workflow automation and self-service tools that empower our operations and engineering teams.Create workflow orchestration systems to streamline complex multi-step processes that encompass incident, problem, and change management.
Full-time|On-site|San Francisco, CA; Seattle, WA; New York, NY
Scale AI is seeking a Senior AI Infrastructure Engineer to help build and refine the company’s Training Platform. This position centers on designing, implementing, and improving infrastructure that supports machine learning teams as they train and deploy models. Role overview This engineer will work closely with colleagues across different functions to create solutions that make AI systems more efficient. The focus is on enabling faster, more reliable model training and deployment. Key responsibilities Design and build infrastructure for AI model training Implement and optimize systems to support machine learning workflows Collaborate with teams throughout the company to improve platform capabilities Locations This role is based in San Francisco, Seattle, or New York.
Role overview Braze, Inc. is hiring a Senior Staff Platform Infrastructure Engineer based in New York City. This position centers on designing and improving the core infrastructure that supports Braze’s products and services. The engineer will guide complex technical projects and partner with teams throughout the company. What you will do Design and enhance platform infrastructure to strengthen Braze’s offerings Take the lead on technical projects, managing them from initial planning through execution Work closely with colleagues across departments to deliver reliable and scalable solutions Share ideas and technical knowledge to help drive improvements and innovation in the platform
About NomicNomic AI is at the forefront of innovation, developing cutting-edge AI agents and robust developer tools that transform document intelligence. We empower enterprise teams in architecture, engineering, and construction to extract actionable insights from decades of drawings, specifications, and project files. Our platform integrates advanced embedding models, sophisticated document parsing, and intelligent autonomous agents capable of reasoning over real-world data and executing actions in dynamic environments.The RoleAs a Senior Platform Engineer at Nomic, you will play a pivotal role in advancing our infrastructure and ensuring the safe operation of our autonomous agents in live customer environments. You will be responsible for overseeing our multi-account AWS setup, Kubernetes clusters, and infrastructure-as-code implementations. Your primary focus will be on the critical challenge of executing autonomous agents securely, including sandbox execution, secure isolation, automated health checks, and meeting the reliability standards our enterprise clients demand.This senior individual contributor role offers significant ownership and influence over architectural decisions.Team: Platform & Infrastructure Reports to: CTOWhat You'll Work OnAgent infrastructure: Your top priority will be ensuring that our agents operate safely within customer cloud environments. This includes managing sandboxing, isolation, monitoring, and the scalable operation of workloads. Key tasks will involve establishing execution environments, security boundaries, automated QA processes, evaluation harnesses, and feedback loops to enhance agent reliability over time.Core infrastructure: Work with Kubernetes, multi-account AWS, CI/CD pipelines, deployment strategies, observability frameworks (traces, metrics, logs, alerting, SLOs), disaster recovery protocols, and cost management initiatives.Security posture: Implement access controls, manage secrets, ensure network security, conduct image scanning, perform dependency audits, and oversee compliance tasks (SOC2, enterprise security) as required by our clients.Infrastructure as code: Continuously define, provision, and evolve our infrastructure through code.What We're Looking ForYou have:A minimum of 5 years of experience in infrastructure, DevOps, or SRE roles, specifically managing cloud infrastructure in production environments.Proficiency in Kubernetes, including deploying workloads, troubleshooting issues, and optimizing performance.
Full-time|$236K/yr - $295K/yr|On-site|New York, NY
Role Overview Ridgeline is looking for a Senior Staff AI Research Engineer to join the AI Platform team in New York, NY. This group focuses on advancing AI research and building practical solutions for complex, real-world problems. The position centers on evaluating machine learning models and large language models (LLMs) to improve their performance and impact. What You Will Do Design and build frameworks, tools, and systems that use advanced AI technologies. Architect and implement end-to-end AI systems that address challenging business needs. Prototype multi-stage processing structures as part of the research-to-production workflow. Develop evaluation frameworks and benchmarks for model accuracy, consistency, and cost-effectiveness. Work closely with business teams to validate and refine AI-powered tools. Create reusable AI capabilities that can be applied across different business areas, with a focus on investment management operations. About the Team The AI Platform team at Ridgeline drives the adoption of state-of-the-art AI across the company. The group values innovative thinking and practical application, aiming to deliver solutions that shape the future of investment management.
About DaydreamDaydream is an innovative chat-based shopping assistant exclusively tailored for fashion. Our aim is to revolutionize the way individuals discover and explore fashion by delivering a personalized, conversational experience powered by cutting-edge AI and natural language understanding.Supported by leading investors such as Forerunner Ventures, Index Ventures, Google Ventures, and True Ventures, our dedicated team is passionate about redefining the future of shopping.About the RoleAs the Engineering Lead for Platform & AI, you will take ownership of several backend systems at Daydream. This includes managing the catalog data pipeline that drives our product index, overseeing the AI content pipeline that generates landing pages for ad campaigns, and developing the B2B integration layer that allows our search functionality on partner retail websites. Your work will primarily involve Python and Go, focusing heavily on LLM integration, with your contributions directly impacting our product.Key Responsibilities:Catalog Pipeline:Oversee the augmentation pipeline—a multi-step Polars system that executes ML models (image analysis, hero selection, swatches via Vertex AI), standardizes sizes, appends affiliate links, and writes to Delta Lake / GCS.Enhance catalog quality through OOS accuracy, hero image selection, and sizing coverage.Develop and maintain scrapers and mappers for over 100 fashion merchants across various platforms including Shopify and custom sites.Eventually, take charge of the enrichment layer, focusing on attribute extraction, taxonomy, and Elasticsearch indexing.Content Generation Pipeline:Lead the AI content pipeline—a multi-step system utilizing LLM for topic generation, product matching, quality scoring, and landing page assembly.Enhance prompt quality and evaluate LLM output.Advance towards personalized content for logged-in users, email content, and broadened ad formats.B2B Merchant Integration:Create and manage services that expose core Daydream capabilities to merchant partners.Deploy on Kubernetes utilizing Helm and ArgoCD.In All Areas:Collaborate across teams to ensure seamless integration and functionality of systems.
Our VisionAt General Intelligence Company (GIC), we are pioneers in developing advanced autonomous agents designed specifically for startups. Our mission is to empower solo entrepreneurs to build billion-dollar enterprises with our innovative technology. These agents not only streamline workflows but also transform operational processes. We are in search of an experienced infrastructure and platform engineer with profound systems engineering expertise to create and sustain the robust, scalable infrastructure that fuels our AI agent platform.Our infrastructure manages intricate distributed workloads, ranging from AI model inference to multi-tenant orchestration of agents. We seek a candidate who can design resilient systems, enhance performance at scale, and facilitate the seamless evolution of our platform from handling thousands to millions of agent operations.Your ResponsibilitiesCraft and implement scalable architectural designs for AI agent workloads.Develop and uphold cloud infrastructure utilizing modern Infrastructure as Code (IaC) tools such as Terraform, ensuring dependable deployments across various environments.Enhance system performance and cost-effectiveness for high-throughput workloads, focusing on resource management and auto-scaling capabilities.Establish robust monitoring, alerting, and observability frameworks to guarantee over 99.9% uptime for mission-critical agent functionalities.Collaborate closely with engineering teams to ensure smooth CI/CD pipelines and zero-downtime deployments.Construct a security-centric infrastructure with appropriate access controls, secrets management, and compliance protocols.Engage in rapid, iterative cycles—focusing on continuous enhancement and scaling without unnecessary backlogs.Align our foundational principles with infrastructure:Cycle time is essential. You deploy rapidly and iterate.Quality is paramount. Your infrastructure is self-sufficient.No backlogs. Develop what is necessary, when it is necessary.FITFO. You independently resolve challenges.Ownership mentality. You approach your work as if you own it.Your Profile4+ years of experience in building and scaling production infrastructure, ideally within high-growth startup environments.Familiarity with Site Reliability Engineering (SRE) monitoring and observability tools, such as Datadog, along with experience in constructing robust alerting systems.Proficient in cloud platforms and services, with a strong understanding of networking and security best practices.
Full-time|$282K/yr - $363K/yr|On-site|New York, NY
Supported by prominent investors from Silicon Valley, Peregrine Technologies empowers public safety organizations, state and local governments, federal agencies, and private-sector institutions to tackle societal challenges with unmatched speed and precision. Our AI-driven platform transforms isolated and fragmented data into actionable operational intelligence, instantly highlighting critical information that enables quicker, more informed decisions to enhance outcomes at every interaction. Currently, Peregrine serves hundreds of clients across over 30 states and two countries, impacting more than 125 million individuals as we broaden our reach into enterprise markets and international territories.Our TeamAt Peregrine, we hold a strong belief that empathy is key to enhancing our solutions. Observing how users interact with our product is a priority and essential to finding effective answers. Engineers are encouraged to collaborate closely with our team on-site to grasp the diverse use cases that Peregrine addresses.We are seeking an Engineering Manager to strengthen our core engineering teams. In this role, you will work collaboratively with design and product management to create robust, scalable, and user-focused systems. Our teams face various challenges, including enabling real-time user collaboration on detailed maps and constructing high-scale backend architecture to process billions of data points.Ownership and collaboration are highly valued— you will be entrusted with significant features and will work alongside fellow engineers to see them through to completion. We believe humility and empathy are crucial in crafting the right solutions; you will engage directly with our deployment team and users as we refine our approach to address their needs. Creativity and perseverance are essential in realizing our vision.RoleThis position is pivotal to Peregrine's platform strategy and execution. You will be responsible for defining how our core systems scale, perform, and evolve as Peregrine accelerates its growth and deepens its influence across public safety, government, and enterprise clientele.As a senior leader in platform engineering, you will not only manage systems but also set the technical trajectory, build a cohesive team, and establish the operational framework that empowers every product team at Peregrine to operate more efficiently, securely, and confidently. Your contributions will directly influence reliability, developer productivity, customer trust, and the company’s capacity for innovation at scale.
Full-time|$179K/yr - $224K/yr|On-site|New York, NY
Who We AreAddepar is a leading global data and AI platform that empowers investment professionals to transform complex financial data into actionable insights. By unifying portfolio, market, and client data, Addepar delivers AI-driven insights seamlessly integrated within investment and client workflows. Serving over 1,400 firms across nearly 60 countries, we manage and advise on assets totaling almost $9 trillion. Our open platform collaborates with approximately 650 software, data, and consulting partners to enhance investment operations for organizations of all sizes and complexities. With a presence in New York City, Salt Lake City, London, Edinburgh, Pune, Dubai, Geneva, and São Paulo, Addepar is committed to supporting clients worldwide.The RoleWe are on the lookout for a Senior AI Engineer to become a pivotal member of our rapidly expanding AI Platform team. This team is dedicated to creating innovative and transformative solutions leveraging AI technology across our product suite.In this high-impact role, you will architect, lead, and execute the core AI capabilities that power our platform. Ideal candidates will thrive at the cutting edge of applied AI, showcasing a constant drive for delivering results. You will manage the complete lifecycle of AI-native products, ensuring the integration of advanced AI systems into robust, scalable, and high-performing production environments.
Databricks builds the data and AI infrastructure that helps organizations solve complex problems, from modernizing transportation to accelerating medical research. The platform supports teams in uncovering insights and improving operations through advanced data and AI tools. Role Overview This Senior Software Engineer position sits within the New York City Engineering office. The team focuses on creating new products from the ground up, using Databricks’ strengths in data and AI to build specialized AI applications for both technical and business audiences. The environment combines the energy of a startup with the backing of Databricks’ established resources. What You Will Do Design and build advanced interfaces for Generative AI agents that handle complex workflows, ensuring transparency and human oversight. Work closely with product managers, designers, and engineers to deliver intuitive, scalable solutions that drive user and business growth. Contribute hands-on as a full-stack developer, delivering user-centered experiences. Develop features that support product-led growth, including seamless onboarding and sharing tools. Help define the direction of agent-based applications at Databricks, collaborating with cross-functional teams. Lead the design, development, and maintenance of end-to-end AI systems that are reliable, scalable, and efficient.
Full-time|$300K/yr - $350K/yr|On-site|New York, New York, United States, San Francisco, California, United States
Our MissionAt Alchemy, we are dedicated to democratizing web3 for a billion users by equipping developers with the essential tools to create outstanding on-chain products. As the premier developer platform, we deliver robust APIs, SDKs, and tools that empower the development and scaling of on-chain applications and rollups.Our infrastructure underpins 70% of elite web3 teams, over 90% of web2 companies venturing into web3, and serves over 100 million end users. Our esteemed clientele includes leading web3 innovators such as Polymarket, OpenSea, and Circle, alongside global giants like Shopify and Adobe.The Alchemy team brings decades of unparalleled experience in highly scalable infrastructure, AI, and blockchain, with leadership roles held at top-tier companies and prestigious institutions like Google, Microsoft, Facebook, Stanford, and MIT.We are proudly supported by world-class VCs and institutions, including Lightspeed, Silver Lake, a16z, Coatue, Pantera, Addition, Stanford University, Coinbase, and Charles Schwab, among others.The RoleAs the Director of Engineering for Infrastructure & Platform, you will spearhead the vision, strategy, and implementation of one of the most sophisticated high-throughput distributed systems in the blockchain landscape. You will lead multiple teams responsible for the core platform that drives Alchemy’s offerings, where reliability, performance, and cost-efficiency at scale are of utmost importance.Your role will define the long-term technical direction of our platform infrastructure, collaborating closely with Product and Executive Leadership to ensure Alchemy remains at the forefront of Web3 infrastructure. Your contributions will empower developers to innovate swiftly and confidently, paving the way for blockchain adoption by the next billion users.What You’ll Do:Lead and expand multiple engineering teams across Cloud Infrastructure, Platform, Data Platform, and Internal Tooling, including distributed and international teams.Define and manage the platform and infrastructure strategy, ensuring scalability, reliability, performance, and cost efficiency throughout the organization.Serve as a senior technical leader, influencing architectural and system design for large-scale, mission-critical distributed systems.Collaborate with Product, Security, Finance, and Executive Leadership to align platform investments with company objectives and customer requirements.Drive execution excellence by refining processes, establishing clear priorities, and ensuring predictable delivery as the organization scales.
Join Us at PerceptaAt Percepta, we are on a mission to revolutionize critical industries through the power of applied AI. Our commitment is to ensure that essential sectors such as healthcare, manufacturing, and energy leverage advanced technologies for their benefit.We partner with leading organizations to foster AI transformation, combining:Expertise in engineering, product development, and researchMosaic, our proprietary toolkit designed for the rapid deployment of intelligent architecturesStrategic collaborations with industry giants like Anthropic, McKinsey, AWS, and General CatalystOur team consists of a dynamic group of Applied AI Engineers, Embedded Product Managers, and Researchers, all eager to integrate AI innovations into tangible improvements in everyday life.Percepta is a proud partner of General Catalyst, an esteemed global transformation and investment firm.Role OverviewWe are seeking a Senior Platform Engineer to develop the foundational infrastructure that supports Percepta’s AI initiatives.This pivotal role bridges customer implementations and platform architecture. As part of a dedicated team, you will engage directly with key institutions, designing and constructing systems that enable AI agents to effectively interact with real-world tools, services, and environments.Your work will often commence in the field, crafting solutions for specific client challenges. Ultimately, your goal will be to identify and formalize effective abstractions to enhance our platform, facilitating quicker deployment of future systems.The challenges we face are novel, and the frameworks are still evolving, meaning the systems we create often lack predefined guidelines.Your ResponsibilitiesDevelop the core platform enabling AI agents to interact seamlessly with external tools, services, and environments.Design systems that manage complex multi-step workflows driven by cutting-edge models.Collaborate with Percepta teams embedded with clients to deliver robust production AI systems.Extract reusable abstractions and infrastructure from real-world deployments.Create reliable interfaces between AI systems and distributed execution environments.Rapidly prototype new architectures as we navigate emerging patterns in AI.
GovDash empowers businesses to secure and execute government contracts that align with American interests.Our cutting-edge AI platform provides a robust, secure, and workflow-driven solution for managing the entire contracting lifecycle—ranging from opportunity identification and capture to proposal execution, award management, and post-award operations.In 2025, our clients successfully secured over $5 billion in government contracts. With an investment of $42 million, we are accelerating our product development and expanding GovDash's reach across the nation.About the RoleWe are seeking a Senior Software Engineer (Infrastructure) to take charge of a diverse array of self-hosted deployments, spanning cloud environments to air-gapped on-premises systems, along with the software essential for their management. You will work closely with both customers and our product engineering teams to enhance the capabilities of our self-hosted control plane, which includes features such as licensing, observability, automated updates, and more.This position requires a professional adept in high-level architectural design and implementation, with excellent communication skills to engage with customers effectively. Our team values curiosity and a commitment to excellence, thus new engineers should anticipate a challenging yet rewarding environment of technical mastery and significant ownership over critical components of GovDash's infrastructure.This role offers a competitive base salary of $190,000, along with equity, and is positioned in New York City.
Role overview rowspace is looking for an Infrastructure Engineer based in New York City. This position centers on building and securing the core systems that power our AI data platform. The work involves designing infrastructure that processes large volumes of sensitive financial data, with particular attention to security and compliance. Integrating both public and private, tenant-specific customer data in real time and at scale is a key part of this role. What you will do Design and build scalable infrastructure for an AI knowledge engine that works with structured and unstructured financial data. Develop secure architectures for private cloud environments, ensuring alignment with financial services compliance standards. Create data ingestion pipelines for sources such as CapIQ feeds and internal SharePoint documents. Develop monitoring and alerting tools for our Bring Your Own Cloud (BYOC) platform. Set up access controls and audit trails to trace AI interactions back to original data sources. Collaborate with AI Research and Product teams to optimize infrastructure for large language model (LLM) inference, training, and agent development. Implement CI/CD workflows and infrastructure-as-code for reliable deployments across multiple cloud providers.
About UsAt Percepta, we are dedicated to revolutionizing vital sectors through the power of applied AI. Our goal is to ensure that key industries such as healthcare, manufacturing, and energy harness the benefits of cutting-edge technology.We partner with leading organizations to facilitate AI transformation, providing:Expertise in engineering, product development, and researchMosaic, our proprietary toolkit designed for the swift deployment of intelligent architecturesStrategic alliances with notable entities like Anthropic, McKinsey, AWS, and the General Catalyst portfolioOur team is a dynamic collective of Applied AI Engineers, Embedded Product Managers, and Researchers driven by the mission to integrate advanced AI into the systems that shape our world.Percepta is a proud partner of General Catalyst.Role OverviewWe are on the lookout for an AI Infrastructure Engineer who will be responsible for the infrastructure, deployment, and operational reliability that underpin Percepta's AI systems, including the autonomous agents driving our innovations.Your role will involve enhancing existing systems: refining our Terraform configurations, fortifying deployment pipelines, and implementing more robust management of infrastructure across various regions and providers. You will also be tasked with constructing missing components and exploring uncharted territories, defining what Site Reliability Engineering (SRE) means in the context of autonomous decision-making systems.The infrastructure paradigms for future autonomous systems are yet to be established, and you will play a crucial role in shaping them.What Sets This Role ApartYou will be working with autonomous systems, where the infrastructure dynamics shift significantly when workloads have agency.Observability entails understanding the rationale behind an agent's decisions, not merely checking the health of a pod.There is a tangible gap between research and production in our environment. Our teams transition optimization algorithms and AI systems from research settings to production, and you will be integral to this process. While MLOps experience is not mandatory, you will be closer to this boundary than most infrastructure roles.Join a small team with significant ownership. You will make foundational decisions rather than inherit pre-existing ones.Your ResponsibilitiesDesign infrastructure patterns for multi-agent systems that are observable, controllable, and recoverable in innovative ways.
Join The New York Times Company as a Senior Software Engineer specializing in AI platforms and products. In this role, you will be at the forefront of developing innovative AI solutions that enhance our digital offerings and engage millions of readers. Collaborate with cross-functional teams to design, implement, and maintain high-performance AI systems that drive our content and product strategy.
About YouAs a Senior Platform Engineer at Copia Automation, you are a problem-solver who thrives on tackling challenging, unique, and impactful projects alongside passionate team members. Your expertise spans Site Reliability Engineering, DevOps, and Infrastructure Engineering, allowing you to take ownership of technical projects and provide invaluable thought leadership.In this role, you will make critical architectural decisions, mentor fellow engineers, and influence the technical trajectory of our products. Working in the industrial automation sector, you will delve into a range of topics that are rarely encountered in conventional software engineering roles. About UsCopia Automation is at the forefront of providing cutting-edge disaster recovery technology for the industrial automation landscape. Our innovative product suite introduces modern developer tools to the operational technology (OT) space, including a unique Git-based source control solution tailored for automation professionals. We are a well-funded startup with a rapidly expanding customer base within the industrial sector. Why Choose Industrial Automation?The manufacturing industry is heavily reliant on industrial automation, utilizing computerized systems and robotics operating on Programmable Logic Controllers (PLCs). This domain employs a distinctive graphical language that does not align with standard developer tools like GitLab or GitHub, leading automation professionals to depend on outdated storage methods and partial solutions. With significant downtime costs, there is a pressing demand for improved development tools. Engineering Culture at CopiaOur engineering team thrives on collaboration, experimentation, and a sense of ownership. We tackle complex challenges together, testing our assumptions and valuing practical solutions that balance immediate needs with long-term objectives. We maintain an optimistic outlook, celebrating our achievements and supporting one another through unforeseen obstacles.
Full-time|$125K/yr - $150K/yr|On-site|New York, New York, United States
CVector is dedicated to revolutionizing economic optimization and AI-driven predictions across energy and manufacturing sectors.Our technology seamlessly integrates critical decision-making factors that impact cost, reliability, and profit margins, providing a unified decision layer that forecasts future scenarios, simulates potential outcomes, and optimizes operational strategies. This allows industrial plants to operate closer to their maximum economic potential daily.This position requires in-office attendance at our New York City headquarters four days a week. CVector serves clients nationwide and operates in challenging industrial environments.Role OverviewAs a Senior Software Engineer specializing in Backend and AI Infrastructure, you will be instrumental in advancing CVector’s backend platform. Your focus will be on developing time-series data systems, AI-enhanced analytics, cloud infrastructure, and data ingestion pipelines that underpin our client-facing applications and internal modeling tools.This role is ideal for engineers who thrive on working closely with data and infrastructure, possess excellent architectural judgment, and are eager to engage with AI systems, databases, and distributed backend services. You will take ownership of complex systems, lead significant technical migrations, and influence the integration of intelligence into industrial energy workflows.You will work in close collaboration with product, modeling, and frontend engineers, significantly impacting the platform's direction, reliability, and scalability for the long term.Key ResponsibilitiesAs a Senior Software Engineer, your contributions will span various interconnected domains:Intelligent SystemsTranslate customer domains and operational workflows into effective prompts and AI system interfaces.Design, implement, and refine evaluations for AI outputs.Incorporate customer feedback and reinforcement signals to enhance system performance.Optimize context selection, retrieval, and trace collection to elevate output quality.Fine-tune smaller models using collected traces to enhance speed while maintaining performance standards.Evaluate and assimilate new AI platforms and models as they emerge.Support the training and deployment of large, time-series-specific models.Backend Platform and Data InfrastructureOversee migrations and enhancements of our time-series data schemas and storage systems.Update and manage PostgreSQL and associated database infrastructure.Develop and sustain data connectors for industrial and external systems.Lead improvements to MQTT-based data ingestion pipelines.Transition PostgREST to a GraphQL-based framework and evolve our API architecture.
Feb 2, 2026
Sign in to browse more jobs
Create account — see all 7,404 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.