Project Overview:

A unique, Sharia-compliant digital banking and wealth proposition in the heart of London. We are not just another digital bank offering seamless banking experience with certainty, security and simplicity but more. Our mission is to help people sustain and grow their wealth for future generations. We do this by solving problems people have instead of just selling boring banking products. We achieve this in a socially responsible way, driven by innovation and backed by a leading financial institution who are in it for the long run.

Role Description:

As a Site Reliability Engineer your primary responsibility is to keep our user and customer facing services running as smoothly as possible.

As part of your role, you will support and deliver several key practices across the organisation covering availability, performance, monitoring, and incident response. You will be experienced in working with and supporting cloud technologies coupled with multiple third-party integrations to ensure that the applications and platforms remains highly available, performant and able to scale as we continue to grow.

The successful candidate will need to have a strong background in infrastructure and software development, enjoy working in a fast paced, hands-on environment, and have extensive experience supporting, monitoring, and improving production systems. You’ll have an enthusiastic, can-do attitude and a willingness to work and communicate collaboratively.

Рекрутерка
Анастасія Штанопруд
Responsibilities:
  • Ensure the stable running of our production environments through the implementation of effective monitoring solutions;
  • Improve our operational processes (i.e. upgrades) to make them as efficient as possible;
  • Measure and optimise system performance, with a desire to drive our observability capabilities forward to support user and customer needs;
  • Provide operational support to our engineering teams dealing with customer incidents;
  • Implement good working practices to support the upkeep of documentation and runbooks across the teams;
  • Gather and analyse metrics across our platforms and applications to assist in performance tuning and fault finding to prevent incidents from occurring;
  • Contribute to our SDLC and look at ways to optimise this to support the reliability of the services we provide to our customers;
  • Conduct post-incident reviews to identify what and where improvements can be made;
  • Create sustainable systems and services through the use of automation.
Requirements:
  • Good working knowledge of AWS, including Lambda, DynamoDB, API Gateway, S3 and WAF;
  • High level of experience using cloud log management and monitoring data platforms (CloudWatch, Sumo Logic, Datadog);
  • Experience working with Infrastructure as Code and Containerisation tools (Terraform, Kubernetes);
  • Good engineering practices covering availability, reliability, and scalability;
  • Familiarity with Gitlab CI/CD to support the deployment of applications across cloud infrastructure;
  • Good working knowledge of JIRA and Confluence;
  • Experience working with both SQL and NoSQL databases;
  • Proficient working with a variety of languages (Python, Go, Typescript);
  • The ability and appetite to learn and use a wide variety of open-source technologies and tools.

Тебе також можуть зацікавити

Чому варто приєднатись до команди INTELLIAS

У нас ти знайдеш доброзичливе середовище та можливості навчатися й зростати щодня.

Можливості релокації в INTELLIAS

Отримуй новий досвід та відкривай нові горизонти, знаходячись лише в декількох годинах подорожі…

Підтримка здоров’я та спорту

Ми докладаємо максимум зусиль, щоб забезпечити комфортні умови для консультантів компанії, та піклуємося…

Як стати частиною команди INTELLIAS

Ми робимо все можливе, щоб спростити та прискорити твій шлях до нашої команди. Будемо раді бачити тебе...