Production Reliability Engineer (SWE/SRE)
Chicago, IL (hybrid, 2-3 days in office)
An elite High Freqeuncy Trading Firm is currently in search for a SWE/SRE.
In this cross functional role, you'll spend 75-80% coding -- work on building infrastructure from scratch and work on distributed systems and working on reliability, optimization and performance the other time (which is important as they're trying to maintain/create that low latency environment) .
What you will do:
- Work with traders, back-office teams, exchanges, and developers to optimize the trading environment and investigate and solve system issues.
- Own the production environment, driving performance, reliability, and operability through continuous improvement
- Proactively monitor and troubleshoot large-scale trading systems and exchange connectivity
- Leverage firm-wide metrics to improve scalability and system performance
- Collaborate across the technology organization to analyze and troubleshoot complex system problems
- Interact directly with traders to communicate and drive technology changes, manage incidents, and troubleshoot problems
Skills you will need:
- Experience in python and shell scripting
- Familiarity with C++ helpful but not required
- A rigorous, detail-oriented approach to operations
- Strong understanding of the linux operating system, including network and system configuration, kernel internals, scheduling, performance tuning
- Strong understanding of networking concepts such as routing, multicast, LLDP, VLAN tagging, ethernet
- Reliable and predictable availability