About the job
Join Trexquant as a dedicated Linux Systems Engineer, where you will play a pivotal role in constructing, managing, and scaling our on-premise infrastructure. This hands-on position emphasizes bare-metal Linux systems and offers significant ownership of server provisioning, storage management, and overall system reliability.
The ideal candidate will be meticulous, highly disciplined, and adept at thriving in a challenging, production-critical environment. This role is specifically tailored for individuals with extensive experience in physical infrastructure, Linux internals, and automation within a high-performance context.
Key Responsibilities:
- Construct, provision, and manage bare-metal Linux servers, overseeing OS installation, configuration, and lifecycle management.
- Take full ownership of server infrastructure from hardware through to the operating system and core services, ensuring optimal stability and performance.
- Configure and manage storage systems, including ZFS and enterprise storage platforms (e.g., NetApp, Dell, or similar).
- Monitor system health and performance, troubleshoot issues, and implement sustainable long-term solutions.
- Develop and maintain automation scripts using Ansible and Bash to standardize provisioning, configuration, and operational procedures.
- Conduct system patching, upgrades, and capacity planning across an expanding server fleet.
- Engage in incident response and root cause analysis, focusing on the continuous improvement of system reliability.
- Collaborate with engineering and infrastructure teams to optimize application performance on Linux systems.
- Contribute to documentation, runbooks, and operational best practices.
- Support data center operations as required, including hardware troubleshooting, racking, cabling, and server replacements.
- Occasionally travel to data centers for maintenance, expansions, and problem resolution.

