An elite hedge fund is seeking a Site Reliability Engineer (SRE) to join their team. As an SRE, you will work closely with their trading and technology teams to ensure the reliability and stability of our trading platform. You will play a critical role in maintaining and improving our platform's performance, security, and scalability.
Responsibilities:
- Monitor the performance and availability of our trading platform
- Troubleshoot and resolve incidents and outages in a timely manner
- Develop and maintain monitoring, logging, and alerting systems
- Collaborate with software engineers to design and implement scalable, fault-tolerant systems
- Continuously improve our platform's reliability, performance, and security
Requirements:
- Bachelor's degree in Computer Science, Engineering or related field
- Strong understanding of Linux systems and command line tools
- Knowledge of at least one programming language (e.g. Python, Bash, or Perl)
- Familiarity with containerisation and orchestration technologies (e.g. Docker, Kubernetes)
- Understanding of networking concepts (TCP/IP, DNS, HTTP)
This role must sit out of the firms Chicago, NYC, or Austin office in a hybrid model of 3 days in office per week.
