Monitoring Engineer / Cloud DevOps

з/п не указана

Требуемый опыт работы: 3–6 лет

Полная занятость, сменный график

We are looking for a Monitoring Engineer / DevOps to join our Customer Solutions department and be a part of the GridGain Nebula project (database as a service).

Who we are

GridGain Systems is a major contributor to the Apache Ignite™ — a TOP-5 Apache SF project, an open-source distributed database, in-memory computing platform which is used in companies all over the world for fast and fault-tolerant access to their data.

Among GridGain’s users there are such companies as Microsoft, IBM, American Express, Barclays, CITI, ING, American Airlines, UPS, DreamWorks, Sberbank, MTS, Huawei and many others.

What is the GridGain Nebula project

GridGain Nebula is a Managed Services offering for Apache Ignite and GridGain. It helps our customer's organization to focus on developing applications based on GridGain or Apache Ignite without the need for an internal software engineering team to manage their in-memory computing / distributed database environment. The GridGain Nebula team applies our best practices and tools backed by decades of experience to maximize the availability of your in-memory computing applications.

What you will do

  • Work as a part of the L2/L3 support team for AWS cloud infrastructure.

  • Monitor health of GridGain clusters running there.

  • Respond to customers (always engineers) in a timely and effective manner.

  • Constantly improve monitoring and deployment machinery.

What we expect

  • Be capable of SSHing into Linux hosts running a distributed database in order to collect troubleshooting information.

  • Be proficient in the analysis of metrics coming from a distributed application.

  • Have an English level good enough to prepare technically and grammatically correct replies to customers (Intermediate or above).

Nice to have (a successful candidate would need at least “some” of those)

  • Understanding of what Java GC logs are, experience with Java tooling (JMX, JFR etc).

  • Database monitoring and/or administration experience, could be combined with ELK, Grafana, Prometheus, Zabbix.

  • AWS (EC2, EKS) experience (or similar public clouds: Azure, Google Cloud, Yandex.Cloud).

  • Basic understanding of distributed systems design and operation principles: how networks affect performance and stability, what could happen when a subset of servers is down etc.

What is cool about this job

  • Working in a team of very skilled engineers on a bleeding-edge product — the opportunity to learn from highly professional colleagues and intensive self-development are guaranteed.

  • Being a part of the open-source community: you can participate in the development and discussions, become an Apache Software Foundation committer or try yourself as a speaker.

  • Benefits package: stock options, medical insurance, English classes, gym membership compensation.

  • We encourage our engineers to speak on conferences, webinars and meetups or write blogs about Apache Ignite™ and GridGain.

  • Work shifts schedule with a hybrid work mode: from office and remotely.

  • Friendly atmosphere, team work and corporate events. We welcome the comprehensive development, love of music and other types of arts.

Welcome to GridGain Team!

Ключевые навыки

Английский — B1 — Средний
Linux
database monitoring
troubleshooting
distributed systems

Вакансия опубликована 3 декабря 2021 в Москве

Похожие вакансии