Lead SRE
Our client, one of the world's leading hedge funds, is seeking a Lead Site Reliability Engineer to join their London office. This position offers unrivalled career progression and top of the range compensation, providing the opportunity to work within an elite engineering group at the heart of a globally recognised trading organisation. The successful candidate will lead production engineers, provide deep technical mentorship, and contribute directly to the design and build of mission‑critical infrastructure. They will also collaborate closely with quant trading, research, and data engineering teams to architect systems that are scalable, resilient, and observable from the ground up.
The ideal candidate will have demonstrable experience leading high‑performance engineering teams, along with a strong background in cloud based infrastructure, distributed systems, and large‑scale production environments. They should be a capable coder with excellent Linux and Kubernetes expertise. Candidates with experience operating in high‑level, low‑latency environments within top‑tier Financial Services or Big Tech, coupled with outstanding academic credentials, will be particularly competitive.
Responsibilities
- Lead a team of production engineers, providing mentorship, technical direction, and performance feedback.
- Build, operate, and maintain critical production infrastructure that supports trading and research platforms.
- Troubleshoot complex technical issues across the entire stack applications, systems, networking, and storage under time‑sensitive and high‑pressure conditions.
- Continuously monitor system performance, reliability, and capacity, proactively identifying opportunities for optimisation, automation, and strengthening resilience.
Qualifications
- Strong leadership experience with a proven track record of developing and scaling high‑performing engineering teams.
- Expertise with Linux systems and shell scripting.
- Deep experience with Kubernetes, cloud infrastructure, distributed systems, and large‑scale infrastructure operations.
- Experience with Java, including JVM tuning and heap management.
- Hands‑on experience with monitoring, logging, observability, and automation frameworks.
FAQs
Congratulations, we understand that taking the time to apply is a big step. When you apply, your details go directly to the consultant who is sourcing talent. Due to demand, we may not get back to all applicants that have applied. However, we always keep your resume and details on file so when we see similar roles or see skillsets that drive growth in organizations, we will always reach out to discuss opportunities.
Yes. Even if this role isn’t a perfect match, applying allows us to understand your expertise and ambitions, ensuring you're on our radar for the right opportunity when it arises.
We also work in several ways, firstly we advertise our roles available on our site, however, often due to confidentiality we may not post all. We also work with clients who are more focused on skills and understanding what is required to future-proof their business.
That's why we recommend registering your resume so you can be considered for roles that have yet to be created.
Yes, we help with resume and interview preparation. From customized support on how to optimize your resume to interview preparation and compensation negotiations, we advocate for you throughout your next career move.