Site Reliability Engineer
Infiniti Research Ltd
London, England, GB
2d ago

Experience : 3 5 years Key Skills : Linux Admin, Multi Cloud (AWS , Azure, GCP etc.), Scripting Language e.g. : Shell, Perl, Python etc.

Roles & Responsibilities :

  • Develop automation tools and framework to automate operational tasks, deployment of code, applications, services and machines which are distributed across multiple cloud and data center environments
  • Build metrics and define SLOs / SLIs in collaboration with product management / engineering teams to improve customer experience
  • Manage multiple product stack environments with a cross-functional integrity to the engineering functions
  • Identify and drive opportunities to improve automation for code deployment, management and visibility of application services
  • Directs root cause analysis of critical business and production issues
  • Represent SRE in design reviews and work cross-functionally with Engineering teams on operational readiness
  • Document and manage software platform matrix and operational processes
  • Requirement

  • 3+ years of web deployment experience handling Site-Reliability / Production part of IT operations / DevOps.
  • Hands-on experience in deploying software in Private clouds and Public clouds (AWS, Azure, GCP, Linode, Digital Ocean, VMware, Proxmox)
  • Experience with setting up a whole network infrastructure, configuring and troubleshooting networking issues.
  • Understanding of Networking (load balancers, Autoscaling, etc).
  • Hands-on experience in Linux / Unix environment and scripting languages : Shell, Perl, Python, etc.
  • Hands-on experience with Databases Administration(SQL / NoSQL) : MSSQL, MySQL , PostgreSQL, Cassandra, MongoDB
  • Hands on experience with Messaging system deployment and maintenance(Kafka+Zookeeper, MQTT, RabbitMQ, ActiveMQ)
  • Must have worked with security standards for Web Applications and Services
  • Experience in monitoring tools like Nagios, ELK, Grafana, Prometheus, StackDriver, Zabbix, PagerDuty.
  • Strong experience in setup and managing Test (Pre-Production) and Production environments
  • Strong communications, analytical and troubleshooting skills
  • Ability to adapt to time critical deadlines and changing priorities in fast pace environment
  • Desired

  • Experience with containerization and containerized deployment orchestration(Kubernetes-GKE,AKS) using Terraform, Ansible, Chef, Puppet
  • Strong experience with Apache / Tomcat / JBoss based web applications and services (SOAP, REST, etc.)
  • Strong experience in implementing continuous integration using Maven, Jenkins, Bamboo, Azure Devops, etc.
  • Strong experience in GIT and parallel development , branching strategies and methodologies
  • Strong in writing deployment automation / orchestration scripts
  • Interested applicants can share their resumes to recruitment

    Add to favorites
    Remove from favorites
    My Email
    By clicking on "Continue", I give neuvoo consent to process my data and to send me email alerts, as detailed in neuvoo's Privacy Policy . I may withdraw my consent or unsubscribe at any time.
    Application form