Site Reliability Engineer at Cloaked | New York, NY
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Experience
Qualifications
About Cloaked
Cloaked is on a mission to redefine the relationship between users and their personal data. With our cutting-edge technology, we empower users to make informed choices about their privacy, ensuring a safer and more intuitive online experience.
Similar jobs
Search for Network Engineer Reliability Observability At Fluidstack New York Ny
16,755 results
Join Fluidstack as a Network Engineer!At Fluidstack, we are at the forefront of building cutting-edge infrastructure designed for abundant intelligence. Collaborating with leading AI labs, government entities, and major enterprises such as Mistral, Poolside, and Meta, we strive to deliver compute capabilities at unprecedented speeds.Our mission is to accelerate the realization of Artificial General Intelligence (AGI). We are urgently seeking passionate individuals who are committed to delivering exceptional infrastructure. At Fluidstack, we take pride in our work, treating our customer outcomes as our own. If you are driven by purpose and excellence, and ready to put in the effort necessary to shape the future of intelligence, we invite you to join us!Position OverviewFluidstack is on the lookout for a Network Engineer specializing in Reliability & Observability. In this pivotal role, you will act as a reliability engineer, leading the charge in developing processes, collecting data, and establishing reliability metrics aimed at enhancing the quality and dependability of AI networks throughout all operational phases.Your primary focus will be on creating systems, tools, and data pipelines to boost network quality, while also automating metrics reporting (24/7) and generating periodic reliability assessments for both internal teams and customers.This position is perfect for seasoned network operators who possess a deep passion for reliability and have experience in designing and implementing full lifecycle software, including conducting Quality Assurance audits and analyzing failure rates. A strong interest in hardware (both electronics and optics) and software development is essential, alongside a commitment to leveraging data for informed decision-making in deployment and operations.We encourage experienced Site Reliability Engineers (SREs) with a strong networking background to apply!Key ResponsibilitiesQuality Assurance Ownership: Design and implement QA processes tailored for network hardware and networks.Data Pipelines: Develop and deploy both serverless and manually triggered workflows to generate network quality and reliability observability for our clients.Deployment and Operations Assistance: Collaborate with various teams to support full lifecycle data collection, analysis, and process enhancements aimed at meeting service level agreements (SLAs) and objectives (SLOs).Process Engineering: Innovate and implement process improvements to streamline deployment and operational workflows.
Join Fluidstack as a Senior Electrical EngineerAt Fluidstack, we are pioneering the infrastructure needed for abundant intelligence. Collaborating with leading AI labs, governments, and enterprises such as Mistral, Poolside, Black Forest Labs, and Meta, we are striving to enable compute capabilities at unprecedented speeds.Our mission is to make Artificial General Intelligence (AGI) a reality, and we are motivated to create world-class infrastructure. We take great pride in the systems we build and the trust we establish with our clients. If you are driven by a sense of purpose, passionate about excellence, and ready to work diligently towards accelerating the future of intelligence, we invite you to join us in shaping what’s next.Key ResponsibilitiesReview design and equipment submissions for new Data Centers within your designated area.Diagnose issues, perform Root Cause Analysis (RCA), and draft Corrective Action (CA) documentation for site and equipment failures.Provide direct support for operational challenges through ad-hoc training and comprehensive reviews of complex operating procedures.Lead the design for upgrades and solutions in existing data centers to enhance capacity, reliability, and efficiency.Collaborate with internal teams including design engineering, server hardware, and environmental health and safety to uphold consistent service standards.Manage multiple projects across various geographical locations simultaneously.Initiate and conduct engineering audits at Fluidstack data centers, generating reports that detail risks along with recommended mitigations.Act as the resident engineer during new construction projects, supporting construction, commissioning, and project turnover.Your QualificationsBachelor’s Degree in Electrical Engineering or equivalent experience.Demonstrated strong engineering judgment with the ability to make informed recommendations in uncertain situations.Proven experience in developing engineered solutions to solve complex problems.Experience managing engineering projects and consultants effectively.Ability to build trust and foster relationships with various stakeholders including Operations, Controls, Construction, Design, Commissioning, and Product Managers.Adaptability and willingness to engage in fieldwork as needed.
About FluidstackAt Fluidstack, we are pioneering the infrastructure for abundant intelligence. Collaborating with leading AI laboratories, governmental bodies, and major enterprises—including Mistral, Poolside, and Meta—we strive to unlock compute capabilities at unprecedented speeds.With a mission to actualize AGI, our team operates with a sense of urgency and commitment to excellence. We prioritize our customers' outcomes as our own and take pride in the systems we create and the trust we establish. If you are driven by purpose, passionate about excellence, and eager to contribute to the future of intelligence, we invite you to join us in shaping what comes next.About the RoleWe are on the lookout for exceptional talent scouts capable of identifying and securing the top 0.01% of candidates worldwide. In this role, you will play a vital part in building Fluidstack's team by sourcing outstanding talent, evaluating candidates against a high technical standard, and executing swiftly in a startup environment. Collaborating closely with hiring managers and leadership, you will craft innovative sourcing strategies to uncover hidden gems, design thorough evaluation frameworks that distinguish the good from the extraordinary, and successfully engage candidates who have numerous opportunities at their fingertips. Our team is deeply aligned with Fluidstack's commitment to constructing the backbone of AI and supporting the most essential AI companies globally—starting with exceptional talent.What You’ll DoSource the Unsourcable: Develop innovative pipelines to engage top 0.01% talent before they are discovered by others—utilizing unconventional channels, specialized technical communities, and personalized outreach.Assess at the Highest Technical Bar: Collaborate with recruiters and hiring managers to create and implement rigorous sourcing and evaluation processes that identify exceptional technical capabilities beyond just resume qualifications.Build Scalable Talent Infrastructure: Design sourcing playbooks, evaluation frameworks, and data systems that uphold quality while we scale rapidly.Focus/What We ValueExceptional Sourcing Skill: Profound expertise in identifying passive candidates across technical communities, GitHub, research publications, conference presentations, and other non-traditional channels.Technical Fluency: Competency in evaluating highly technical candidates, discerning nuanced skill differences, and engaging in credible discussions with engineers.
About FluidstackAt Fluidstack, we are pioneering the infrastructure that powers advanced artificial intelligence. Collaborating with leading AI laboratories, government entities, and major corporations—including Mistral, Poolside, Black Forest Labs, and Meta—we aim to deliver compute capabilities at unparalleled speeds.Our mission is to expedite the realization of Artificial General Intelligence (AGI). Our team is driven by a sense of urgency and is dedicated to providing top-tier infrastructure. We view our clients' success as our own and take pride in the robust systems we create and the trust we cultivate. If you are inspired by meaningful work, strive for excellence, and are prepared to exert yourself to advance the future of intelligence, we invite you to join us in shaping what lies ahead.About the RoleIn the capacity of a System Engineer for our GPU Fleet, you will oversee, operate, and optimize our large-scale GPU compute infrastructure, which is essential for AI/ML training and inference processes. Your role will ensure the high availability, performance, and reliability of our GPU server fleet through automation, monitoring, troubleshooting, and collaboration with hardware engineering, platform teams, and data center operations.Key ResponsibilitiesMaintain and operate a vast GPU server fleet (H100, B200, GB200) catering to AI/ML workloads; continuously monitor system health, performance, and utilization to ensure maximum uptime and adherence to SLA.Conduct hands-on troubleshooting and root cause analysis for complex hardware, firmware, operating system, and application issues across GPU clusters; collaborate with vendors and hardware teams to rectify systemic failures.Create and sustain automation scripts for efficient provisioning, configuration management, monitoring, and remediation on a large scale.Enhance tools for GPU health assessments, performance diagnostics, driver validation, and automated recovery processes.Implement server provisioning, configuration, firmware updates, and OS installations utilizing automation frameworks; manage lifecycle operations encompassing deployment, maintenance, and decommissioning.Engage in 24x7 on-call rotation; respond to production incidents and coordinate resolution efforts with cross-functional teams, including data center operations, network engineering, and application teams.Lead post-incident reviews, document root causes, and spearhead continuous improvement initiatives focused on automation, reliability, monitoring, and operational efficiency.
About FluidstackAt Fluidstack, we are pioneering the infrastructure that powers intelligent systems. Collaborating with leading AI labs, government entities, and industry giants—including Mistral, Poolside, Black Forest Labs, and Meta—we are dedicated to delivering computational capabilities at unprecedented speeds.Our mission is to transform Artificial General Intelligence (AGI) from concept to reality. Our team is driven by a shared commitment to building world-class infrastructure, where we prioritize our customers' success as if it were our own. If you are passionate about purpose, dedicated to excellence, and ready to contribute to shaping the future of intelligence, we invite you to join us in this exciting journey.Role OverviewAs a Transaction Manager, you will spearhead the commercial processes to secure data center capacity and associated infrastructure, guiding projects from initial market outreach to negotiation and contract finalization. You will collaborate closely with legal, finance, engineering, networking, and delivery teams to convert technical requirements into practical deal frameworks and achieve timely contract execution.
About FluidstackAt Fluidstack, we are at the forefront of building infrastructure that empowers cutting-edge intelligence. We collaborate with leading AI laboratories, government entities, and enterprises—including Mistral, Poolside, Black Forest Labs, and Meta—to facilitate computing capabilities at unprecedented speeds.Our mission is to accelerate the realization of Artificial General Intelligence (AGI). We are a highly motivated team dedicated to delivering premier infrastructure solutions. We treat our customers’ success as our own, taking immense pride in the systems we create and the trust we establish. If you are driven by purpose, committed to excellence, and prepared to work diligently to shape the future of intelligence, we invite you to join us in pioneering what’s next.About the RoleAs a Technical Program Manager for Deployments, you will spearhead the comprehensive program delivery necessary to activate data center infrastructure and AI clusters. Operating in a dynamic and often ambiguous environment, you will coordinate efforts across construction, infrastructure, networking, and operations teams to guarantee deployment readiness and successful launches.This role necessitates an ownership mentality. You will be expected to advance programs using sometimes partial information, identify and resolve gaps, and ensure momentum across teams responsible for construction, networking, hardware, and operations. While the physical installation and construction execution are handled by partner teams, you will hold accountability for overall delivery outcomes.Focus AreasOversee program delivery for data hall and AI cluster activation from planning to operational handoff.Facilitate cross-functional collaboration among teams responsible for construction, infrastructure, network, hardware, and operations.Convert hyperscale AI/GPU and DLC infrastructure requirements into actionable deployment strategies.Anticipate downstream effects of technical sequencing decisions on capacity, scheduling, and performance.Drive alignment among teams operating at varied speeds, scopes, and time zones.Effectively navigate ambiguity, making progress with incomplete information and evolving requirements.Maintain clear program documentation and communicate status, risks, and decisions to stakeholders.Keep momentum through changing requirements, tight timelines, and cross-time-zone execution.About You5+ years of experience in managing complex data center or infrastructure programs in fast-paced environments.
About FluidstackAt Fluidstack, we are at the forefront of developing the infrastructure that powers abundant intelligence. Partnering with leading AI laboratories, governmental bodies, and enterprises—including Mistral, Poolside, Black Forest Labs, and Meta—we strive to unlock computing capabilities at unprecedented speeds.Our mission is urgent: to make Artificial General Intelligence (AGI) a reality. Our team is driven, passionate, and dedicated to delivering exceptional infrastructure solutions. We view our customers' success as our own, taking immense pride in the systems we create and the trust we cultivate. If you are purpose-driven, committed to excellence, and eager to work hard to shape the future of intelligence, we invite you to join us on this exciting journey.About the RoleAs an Infrastructure Deployment Engineer, you will serve as a Subject Matter Expert (SME) in fiber optics, structured cabling, and data center physical infrastructure. You will oversee the complete lifecycle of physical installation projects, coordinating Low Voltage Contractor (LVC) schedules, conducting thorough Quality Assurance (QA) and Quality Control (QC) checks on contractor work, and ensuring final installations comply with all design standards, safety regulations, and labeling protocols.Key Responsibilities1. Project Oversight & Contractor ManagementAct as the primary on-site technical liaison for the Low Voltage Contractor (LVC) and our internal engineering teams.Oversee the LVC's deployment schedule, ensuring compliance with project timelines while reporting progress and any delays to project management.Coordinate access, material delivery, and staging logistics with data center facility operations.Function as a remote hands expert for remote engineering teams during deployment and validation phases.2. Quality Assurance (QA) & Quality Control (QC)Conduct comprehensive, multi-stage QA inspections on all LVC work, including rack placement, server/device racking, cable routing, and labeling.Ensure all infrastructure components, including racks, servers, network devices, and storage units, are installed and grounded in accordance with the specified Bill of Materials (BoM) and design documentation.Fiber Optic and Cabling Validation:Conduct or supervise optical loss testing (OLTS), OTDR analysis, and insertion/return loss evaluations on all installed fiber links.
Join Fluidstack as a Security Lead for Data CentersAt Fluidstack, we are pioneering the infrastructure for abundant intelligence. Collaborating with prestigious AI laboratories, governmental bodies, and leading enterprises such as Mistral, Poolside, and Meta, we aim to deliver computing capabilities at extraordinary speeds.Our mission is to make Artificial General Intelligence (AGI) a reality, and we are committed to creating world-class infrastructure. We prioritize our customers' success and take pride in the systems we develop and the trust we cultivate. If you are driven by a strong sense of purpose, passionate about excellence, and eager to work diligently to shape the future of intelligence, we invite you to be a part of our journey.Role SummaryAs the Security Lead for Data Centers, you will be at the forefront of designing and implementing robust physical and logical security frameworks across our distributed data center network. Your responsibilities will encompass setting security standards for facility developments, spearheading the design and implementation of secure network infrastructure, managing vendor and contractor security protocols, and ensuring adherence to regulatory and customer requirements. This role will involve collaboration with engineering, construction, and operations teams to integrate security throughout every layer of Fluidstack's GPU cloud infrastructure, safeguarding our AI infrastructure from advanced threats.Key ResponsibilitiesDevelop security processes and policies that evolve with Fluidstack's rapid infrastructure expansion.Lead network security engineering initiatives, providing both consultative and implementation support to infrastructure teams.Design and uphold comprehensive security frameworks across Fluidstack's distributed infrastructure.Assist in the development and enforcement of security system standards compliant with US DoD and international standards.Guide cross-functional data center architecture and operations, focusing on infrastructure, operations, and support.Foster strategic alignment across various security functions that integrate into all aspects of data center operations.Who You Are10+ years of experience in designing and implementing security systems for hyperscale or mission-critical environments.In-depth understanding of the threat landscape at the convergence of IT and OT.Proven experience in leading cross-functional teams and initiatives.
Join Fluidstack as an IT Engineer - Data CenterAt Fluidstack, we are at the forefront of developing the infrastructure that powers advanced artificial intelligence. We collaborate with leading AI laboratories, government entities, and major corporations such as Mistral, Poolside, Black Forest Labs, and Meta to deliver computational power at unprecedented speeds.Our mission is to turn artificial general intelligence into reality. To achieve this, we have assembled a team of passionate individuals who are dedicated to creating world-class infrastructure. We regard our clients' success as our own, taking immense pride in the systems we construct and the trust we build. If you are driven by a strong sense of purpose, possess a relentless pursuit of excellence, and are eager to work tirelessly to shape the future of intelligence, we invite you to join us in crafting what comes next.About the RoleAs an IT Engineer for our Data Center, you will serve as the essential on-site IT support for our operations. You will manage the entire device lifecycle, from provisioning macOS, Windows, and iOS devices for new hires, to ensuring secure and compliant disposal of equipment. You will be the primary resource for troubleshooting hardware, software, and networking challenges within a dynamic AI infrastructure, working alongside remote IT and Security teams to ensure thorough documentation, escalation, and resolution of issues. Beyond routine maintenance, you will contribute significantly to our security compliance efforts, supporting SOC 2 and ISO 27001 standards, gathering audit evidence, and enforcing IT procedures that safeguard our data centers and user environments.Key ResponsibilitiesAct as the main on-site contact for hardware and software issues, diagnosing and resolving problems across a variety of devices including laptops, desktops, mobile devices, and networking equipment, while escalating issues with detailed documentation when necessary.Prepare, configure, and deploy end-user devices (macOS, Windows, iOS/iPadOS) for new employees and contractors, ensuring Mobile Device Management (MDM) enrollment and identity provider account setups are completed before their start date.Maintain an accurate inventory of hardware assets, tracking serial numbers, assigned users, locations, and lifecycle status from acquisition to secure retirement and disposal (ITAM/ITAD).Collect, wipe, and manage returns of equipment from offboarding employees and contractors, adhering to data destruction and chain-of-custody protocols.Assist in meeting security compliance requirements (SOC 2, ISO 27001, or similar) by following and enforcing established IT procedures, contributing to the collection of audit evidence, and notifying the Security team of any policy gaps.
Join Fluidstack and Elevate Your CareerAt Fluidstack, we are pioneering the infrastructure for abundant intelligence, collaborating with leading AI labs, government entities, and enterprises such as Mistral, Poolside, Black Forest Labs, and Meta. Our mission is to enable compute capabilities at unprecedented speeds.We are on an urgent quest to realize Artificial General Intelligence (AGI). Our team is driven by passion and commitment to deliver first-rate infrastructure. We consider our customers' success as our own, taking immense pride in the systems we construct and the trust we cultivate. If you are fueled by purpose, strive for excellence, and are prepared to work diligently to accelerate the future of intelligence, we invite you to join us in shaping what lies ahead.About Our TeamThe Data Center Operation team plays a crucial role in supporting our rapid growth by managing the deployment and operation of hyperscale data centers. Our team is responsible for the comprehensive onsite management of each facility, overseeing the entire lifecycle of our hardware fleet and delivering scalable, reliable infrastructure solutions and services.Your ResponsibilitiesWe seek candidates with a robust technical infrastructure background, a passion for engineering excellence, and a commitment to providing effective and sustainable solutions designed for the future. You should have a keen understanding of the challenges related to deploying infrastructure at scale and possess experience in designing systems and platforms that set global benchmarks for performance, security, availability, and cost. Your contributions will have a global impact on our processes.Act as a Subject Matter Expert for the DC Operations teams and manage escalations related to critical infrastructure and facilities.Assess and manage design constraints and risks within the data center, regularly conducting audits and collaborating with our colocation partners or service providers to evaluate their maintenance programs.Lead the change management process for high-risk maintenance activities in our data centers.Serve as the first responder onsite for incidents, coordinating support from SMEs in data center engineering and infrastructure.Provide onsite services to support various initiatives while fostering positive and collaborative relationships with customers, partner teams, vendors, and internal stakeholders.
Sigma Computing
Role Overview Sigma Computing is growing its engineering team in New York City. The Senior Software Engineer - Observability and Reliability will help build technology that makes data accessible for all. This role focuses on improving how systems are monitored, measured, and maintained for reliability. What You Will Do Design and build observability tools and platforms, including metrics collection, logging, distributed tracing, dashboarding, alerting, and application performance management. Work with technologies such as Go, Open Telemetry, and Kubernetes. Take part in on-call rotations to help maintain high service uptime. Develop runtime tools and processes that support cloud triaging and help minimize downtime. Define and promote best practices for monitoring and measuring systems and services. Collaborate with engineers and stakeholders through design and code reviews, with a strong emphasis on hands-on coding.
About FluidstackAt Fluidstack, we are pioneering the infrastructure for abundant intelligence. Collaborating with leading AI laboratories, government entities, and enterprises including Mistral, Poolside, Black Forest Labs, and Meta, we are committed to unlocking computational power at unprecedented speeds.Our urgent mission is to transform the vision of AGI into reality. We are a highly driven team dedicated to delivering exceptional infrastructure. We regard the success of our clients as our own, taking pride in the systems we create and the trust we build. If you are driven by purpose, passionate about excellence, and eager to contribute to accelerating the future of intelligence, we invite you to join us in shaping what comes next.About the RoleIn this pivotal role, you will focus on substation and interconnection design through the medium voltage (MV) distribution backbone of our data center campuses and protection schemes.You will serve as a technical leader on projects related to utility-scale interconnections at 138kV and above, working directly with utilities, Independent System Operators (ISOs), third-party engineering firms, and equipment manufacturers. You will also support the design of medium voltage distribution for data centers, owning the technical quality of our HV/MV designs and helping to establish engineering standards that can be replicated across multiple sites.
Fluidstack
About FluidstackFluidstack is at the forefront of building infrastructure that powers abundant intelligence. We collaborate with leading AI laboratories, governmental entities, and enterprises such as Mistral, Poolside, Black Forest Labs, and Meta, aiming to unlock compute capabilities at unparalleled speeds.Our mission is urgent: to transform AGI from concept to reality. Our team is driven, dedicated, and focused on delivering top-tier infrastructure. We take our customers' outcomes personally, valuing the systems we create and the trust we build. If you are passionate about purpose, strive for excellence, and are ready to work diligently to expedite the future of intelligence, we welcome you to join us in shaping what’s next.About the RoleWe are looking for a Lead, Network Connectivity & Strategy who will take complete ownership of our external connectivity strategy, overseeing everything from developing carrier relationships and procuring circuits to supporting site selection, forecasting capacity, and collaborating on backbone fiber optic topology. You will ensure that every Fluidstack datacenter, whether new or existing, has diverse, resilient, and timely connectivity, all while ensuring our backbone network scales in anticipation of future demand.This position intersects network engineering, infrastructure planning, and commercial execution. You will work closely with deployment teams to secure the essential connectivity foundations they require, collaborate with ICT and outside plant teams on fiber path designs, and engage directly with carriers and dark fiber providers to negotiate and deliver services within tight timelines.Your success will mean that Fluidstack is never held back by connectivity. Each time a new datacenter build is ready for service, diverse fiber paths will already be operational. When leadership inquires about potential new locations, you will have the connectivity landscape mapped out within days. When the backbone team needs a new Point of Presence (POP), you'll have identified candidate locations and provider options in advance.
About Fluidstack Fluidstack builds infrastructure for artificial superintelligence, working closely with organizations like Anthropic, Google, Meta, AMI Labs, and Black Forest Labs. The company is investing tens of billions in U.S. infrastructure, targeting 1GW deployed by 2026 and 10GW by 2027. The team values ownership, high standards, and customer focus. Every team member’s contribution matters, and no task is too small. Role Overview: Entry-Level Data Center Technician This Entry-Level (L1) Data Center Technician position is based in Buffalo, NY. The role is well suited for recent graduates or those starting a career in data center operations. The main focus is maintaining the physical infrastructure of GPU supercomputers to support AI clients. Fast response to on-call requests is essential. Fluidstack offers strong growth potential for technicians who show initiative and deliver solid results. Main Responsibilities Organize and manage cables, including reseating and swapping network, power, and data cables. Swap server trays and replace hardware following established protocols. Respond promptly to on-call requests and meet strict SLA requirements. Document all maintenance work and hardware changes in tracking systems. Perform visual inspections to help maintain data center integrity.
Cloaked is an innovative privacy startup committed to restoring consumer confidence in the handling of personal data. Our mission is to build an internet that prioritizes user needs, placing individual privacy and opt-in choices at the forefront. Our flagship product serves as a virtual 'cloak' for users, enabling them to navigate websites like Facebook and Amazon while controlling the sharing of their private information according to their preferences.
Palantir Technologies
Join a Revolutionary CompanyAt Palantir, we create cutting-edge software that transforms data into actionable insights, enabling our partners to make critical decisions. Our platforms support initiatives such as developing lifesaving medications, predicting supply chain issues, and finding missing children.About the PositionAs a Product Reliability Engineer (PRE), you will be instrumental in ensuring the health, performance, and reliability of our services. You will oversee the complete lifecycle of service reliability, addressing outages, refining codebases, and implementing sustainable solutions.Your responsibilities will include solving complex issues for key clients, enhancing observability in intricate systems, tackling technical debt in vital codebases, and guiding strategic investments in our core offerings. We seek engineers who are passionate about in-depth troubleshooting, possess a strong sense of ownership, and understand the urgency associated with customer-impacting outages.PREs primarily focus on proactive product initiatives, which involve infrastructure migrations, stability enhancements, and codebase improvements aimed at boosting resilience. During scheduled on-call shifts, you will respond to automated alerts, investigate customer-reported issues, and collaborate with adjacent product teams by sharing your technical insights.Regardless of the technical challenge you face, you will play a pivotal role in resolving it permanently, rather than just implementing temporary fixes. We offer comprehensive onboarding support with experienced mentors to help new team members thrive in their roles.
At Block, we are more than just a company; we are a collective of diverse teams united by a common mission of economic empowerment. Our foundational teams — including People, Finance, Counsel, Hardware, Information Security, and Platform Infrastructure Engineering — collaborate across various business sectors and global time zones to create inclusive policies, provide financial forecasting, deliver legal support, secure our systems, and nurture innovative initiatives. Every challenge we face opens new opportunities, and we value diverse perspectives to uncover them. We invite you to bring yours to Block. The Role As a vital member of our Site Reliability Engineering (SRE) team, you will take on the dual responsibility of proactively enhancing and reactively managing the reliability of Block's platform and critical infrastructure. You are driven by metrics, possess a systems-oriented mindset, and are dedicated to building distributed platforms that facilitate safe, scalable product development. You will utilize and continuously refine AI-driven tools and automation to boost observability, expedite incident detection and response, and minimize operational toil. This includes applying AI techniques to incident analysis, alert tuning, and operational workflows. Your role will also involve primary platform on-call duties (12 hours a day, one week every few weeks, depending on team size), supporting Block's most critical (Tier 0) services. In this capacity, you will lead incident command, coordinate mitigation efforts, and ensure effective escalation during high-severity incidents. You Will Build and extend platforms to enhance system reliability. Collaborate on team objectives that prioritize reliability across the entire company. Standardize reliability tools across multiple platforms and departments. Triaging, coordinating, and leading stabilization efforts for severity 0–1 incidents. Serve as the primary on-call engineer, maintaining clear escalation paths and demonstrating leadership during escalations. Drive improvements in platform-wide reliability, shared operational tools, and safe deployment patterns. Leverage AI-driven systems to enhance signal detection, reduce noise, and accelerate root cause analysis. Design and implement safe deployment strategies (including progressive delivery, automated rollback, and guardrails). You Have A strong inclination towards identifying root causes in complex systems and implementing necessary fixes. Proven technical initiative and leadership on prior projects, particularly those focused on backend/platform. Experience with AI-driven tools for observability, incident analysis, or automation. A mindset that naturally re-evaluates existing processes to drive continual improvement.
Join Fluidstack as a Network EngineerAt Fluidstack, we are pioneering the infrastructure for the next era of intelligence. By collaborating with prominent AI laboratories, governmental bodies, and leading enterprises like Mistral, Poolside, Black Forest Labs, and Meta, we strive to deliver computing capabilities at unprecedented speeds.Our mission to realize Artificial General Intelligence (AGI) drives our team to work with passion and dedication. We prioritize our clients' success as our own and take immense pride in the infrastructure we build and the trust we cultivate. If you are purpose-driven, committed to excellence, and excited to contribute to the future of intelligence, we invite you to join us on this innovative journey.Role OverviewWe are seeking a skilled Network Engineer to become an integral part of our Deployment & Integration team. In this hands-on role, you will be responsible for constructing and validating large-scale AI data center network infrastructures. Your responsibilities will include deploying modern data center fabrics, configuring switches, validating physical layer connections, collaborating with cross-functional teams to address challenges, and ensuring a seamless transition to operations.This position is perfect for engineers who thrive in dynamic environments and wish to gain extensive experience in large-scale data center deployments. You will collaborate closely with senior engineers who will provide you with technical mentorship and structured onboarding as you hone your skills in AI fabric deployment. Achieving success in this role means independently managing pod deployments, becoming the go-to expert for field execution, and progressing into leadership roles as our organization expands.Key ResponsibilitiesDeployment Execution: Oversee the deployment and validation of data center network infrastructure, including front-end and back-end fabrics, Building Management Systems (BMS), and management networks. You will configure switches, install and validate optics, coordinate fiber and cabling work, and guide deployments to completion, taking ownership of transforming designs into operational networks.Physical Layer Validation: Ensure physical connectivity adheres to production standards. Coordinate with structured cabling teams for fiber remediation, validate insertion loss and Optical Time-Domain Reflectometer (OTDR) traces, troubleshoot optical layer issues, and document physical infrastructure as-built. You will develop expertise in diagnosing and resolving physical layer challenges that impede deployments.Hardware Lifecycle Management: Manage hardware logistics such as device staging, rack and stack coordination, Return Merchandise Authorization (RMA) processes, and Data Center Infrastructure Management (DCIM) updates. Monitor hardware inventory, coordinate vendor shipments, and ensure device readiness.
Join Fluidstack as a Lead Network TechnicianAt Fluidstack, we are passionate about creating the infrastructure that powers the future of artificial intelligence. Our collaborations with leading AI labs, government entities, and renowned enterprises—such as Mistral, Poolside, Black Forest Labs, and Meta—enable us to deliver computational capabilities at unprecedented speeds.As we strive to make Artificial General Intelligence a reality, our team is composed of highly driven individuals dedicated to building world-class infrastructure. We take immense pride in our work, treating our customers’ outcomes as if they were our own. If you are motivated by a meaningful purpose and possess a relentless pursuit of excellence, we invite you to join us in shaping the future of intelligence.Your RoleWe are looking for a skilled Datacenter Operations Technician to be the primary on-site expert for network and structured cabling matters within our AI datacenter infrastructure. As the Tier 3 escalation point for network and ICT-related incidents, you will be responsible for diagnosing physical layer faults, validating repairs, and ensuring rapid issue resolution that directly impacts customer workloads.This position is part of the Datacenter Operations team and is designated for a specific site. You will collaborate closely with Tier 1 and 2 technicians, as well as Network Operations and cross-functional teams, to uphold the reliability and performance of our high-density AI infrastructure. When issues arise that exceed on-site capabilities, you will escalate them with comprehensive documentation to Network Operations for further investigation.The ideal candidate will possess a robust background in structured cabling and fiber infrastructure, coupled with sufficient network knowledge to interpret device-level diagnostics and confirm that physical repairs restore connectivity. You should be comfortable taking full ownership of issues until they are resolved, serving as the sole network/ICT escalation resource on-site.
About FluidstackAt Fluidstack, we are on a mission to create the infrastructure for abundant intelligence. Collaborating with leading AI labs, government entities, and top-tier enterprises—including Mistral, Poolside, and Black Forest Labs—we aim to deliver compute solutions at unprecedented speeds.We are driven by a sense of urgency to turn Artificial General Intelligence (AGI) into a reality. Our team is filled with highly motivated individuals dedicated to building world-class infrastructure. We treat our customers' success as our own and take pride in the systems we create and the trust we establish. If you are passionate about making a difference, committed to excellence, and prepared to work diligently to advance the future of intelligence, we welcome you to join us in shaping what comes next.About the RoleAs the Accounting Manager at Fluidstack, you will play an essential role within our Finance department, overseeing the core accounting functions that ensure the accuracy, integrity, and scalability of our financial records during our rapid growth phase. You will manage daily general ledger operations, lead the month-end and year-end closing processes, and facilitate the preparation of timely and reliable financial reports. In this position, you will collaborate cross-functionally with various teams and work closely with senior finance leadership and external auditors to uphold strong internal controls and audit readiness. This is a high-impact role for someone who thrives in a fast-paced environment and is eager to develop and refine top-tier accounting processes and infrastructure as our company expands.
Sign in to browse more jobs
Create account — see all 16,755 results

