Site Reliability Engineer
London, UK
5d ago


Mimecast is looking for a Site Reliability Engineer to join the Secrets Team. The purpose of the team is to design, implement and operate the next generation secrets management service, the next gen authentication and authorization framework for Mimecast services, and build a large scale, high throughput key management service.

Ideal candidates will share a passion for tackling complexity with simplicity and building platforms that can scale massively and are not afraid of exploring unconventional solutions.

As a Site Reliability Engineer, you will be passionate about ensuring that our services are running optimally at all times and about detecting / addressing any anomalous behavior with a minimum of delay.

When necessary, you will also be required to handle incidents and, most importantly, protect customers by minimising impacts on them by maintaining effective and swift communication and providing quick solutions in high pressure situations.


What we’ll expect from you :

You will be responsible for....

  • Monitoring, alerting, SLO tracking (Grafana, Graphite).
  • Detection / early warning / self-heal.
  • Facilitate emergency / incident response.
  • Change management and delivery pipeline into production.
  • Provisioning / automating deployment infrastructure.
  • Ensure safety, predictability, repeatability and auditability of all build and deploy processes.
  • Create, maintain and test for resilience & recovery.
  • Efficiency and cost management.
  • Performance and scalability of the services.
  • Provide level of audit and control to security personnel.
  • Collaborate with the team to come up with technical designs and solutions with adequate documentation.
  • Work closely with the team and Architects to discuss solution and implementation strategy.
  • Help maintain the platform by fixing bugs and issues coming from the operational pipeline.
  • Champion best practices and enable and support more junior members of the team.
  • Qualifications

    The ideal candidate would have knowledge / experience in the following :

    Essential :

  • Distributed Systems Architecture and best practices.
  • Strong background of traditional ops areas of expertise : Linux, Disk I / O, Networking.
  • Knowledge about encryption protocols, public-private key infrastructure, symmetric vs asymmetric keys, current cryptographic hashing algorithms and their use in HTTPS / SSH and other protocols.
  • Continuous delivery - principles and pragmatics of dealing with build pipelines, artifact repositories, zero-downtime deployment and modern software development engineering best practices.
  • Proving resilience via failure injection (chaos monkey), DR testing, scalability via load and stress testing.
  • Familiarity with docker and container ecosystem.
  • Sensitivity to (but also boldness to influence) culture and behaviour across an organisation.
  • Using, developing and debugging on Linux.
  • Developing and debugging in a distributed environment.
  • Familiar with distributed version control systems such as Git.
  • Experience with Software Engineering collaboration tools such as Jira, Confluence etc.
  • Experience working in an agile environment using approaches such as scrum and Kanban.
  • Desirable : (Experience in one or more of the following)

  • Microservice architecture.
  • Experience with OAuth, OpenID Connect, SAML and preferably : LDAP, ADFS, WS-Federation.
  • Working knowledge of Public Key Infrastructure, Envelope encryption, Key Management, Identity Access Management.
  • Knowledge of network programming and protocol including REST, HTTP.
  • Hands on experience in configuration management of server farms (using tools such as Puppet, Chef, Ansible etc).
  • Hands on experience with open source monitoring and graphing solutions such as Nagios, Zabbix, Sensu, Graphite, Grafana, etc.
  • Extensive experience with at least one scripting language apart from BASH (Ruby, Perl, Python, Go).
  • Ability to read / debug Java code, experience with Java tuning (GC especially).
  • Ability to analyze network behaviour, performance and application issues using standard tools.
  • Benefits and Rewards :

    We offer a highly competitive rewards and benefits package including.

  • Highly competitive pay.
  • Employee Shares Purchase Plan (ESPP).
  • Season ticket travel loan.
  • Pension Scheme.
  • Private medical insurance for you & family including eye care.
  • Employee assistance program, life cover and a gym subsidization.
  • 25 days annual leave + family leave and long service sabbaticals.
  • Office Features :

    In our hyper modern and vibrant office space we provide breakfasts, snacks, fresh fruit, soft drinks, good tea & decent coffee all day long.

    Dress code is whatever you're comfortable in. MacBook, iPhones, multi-screen setups and any other tech you need. Pet Projects innovation competition, Patent support & expertise, study assistance, social & charity events, volunteering time off and an in-house Uni.

    Check us out here too

    StackOverflow : https : / / / jobs / companies / mimecast

    Mimecast Engineering blog : https : / / / mimecast-engineering

    Our YouTube Channel : https : / / / user / mimecast

    LinkedIn Life page : https : / / / company / mimecast / life?trk nav type life

    Report this job

    Thank you for reporting this job!

    Your feedback will help us improve the quality of our services.

    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form