Site Reliability Engineer

Career >

(SRE) Site Reliability Engineer

Apply Now

Location

Remote

Headquarter

Vilnius, Lithuania

Deadline

May 1, 2024 at 8:00:00 AM

Salary

€2200+

Job Type

Full-time

Why Gegidze?

Gegidze - Digital Agency with the Georgian Character, designed to develop and build great brands. We help them innovate and outperform in the modern world.

Since the agency’s foundation in 2017, we successfully realized more than 80 projects, earned $ 20M+ for our customers, and built remote teams of 200+ developers, marketers and designers in Georgia for European Startups and SMBs. At our agency with 5 locations in Tbilisi, Berlin, Dublin, Warsaw and Tallinn, we work with great passion every day to inspire our customers and solve various problems in marketing, design, tech & HR.

About the project

Currently, we are looking for a top (SRE) Site Reliability Engineer for our key client, which is a first fintech company based in Vilnius, Lithuania. They are an EU-licensed e-money institution that provides fast, convenient, and affordable financial services globally. Their services range from a payment gateway for e-shops, a finance management app, and money transfers worldwide. With over one million app installs and growing, our client aims to push the boundaries and become an industry-leading super app that provides financial and lifestyle services across the globe. Their 500-person international team is spread among 15 different cities worldwide.
Currently, we are looking for a (SRE) Site Reliability Engineer to join our client company’s team to ensure the IT infrastructure's availability, performance, and security. Collaborating with development teams and system administrators, you'll guide the design and deployment of applications to meet our client's reliability standards.

Your duties

As a (SRE) Site Reliability Engineer, you will be responsible for:

- Defining and supporting Service Level Indicators (SLIs) and Objectives (SLOs) for existing critical components within their mixed on-premises environment
- Making informed decisions on database cluster optimization, and web services configuration, and introducing improvements to these areas
- Enhancing the instrumentation and efficiency of daily operations tasks performed by the operations teams
- Driving improvements in change and release processes, transitioning from unregulated CI/CD practices to a more structured change management framework that fosters reliable CI/CD processes
- Operating in an incident-prone environment, working proactively to reduce the frequency and impact of critical incidents
- Taking part in incident management, contributing to the development of a common operations knowledge base, maintenance operation procedures (MOPs), runbooks, and enhancing monitoring and observability
- Collaborating closely with operations and development teams to enhance the reliability of our infrastructure and software, through education and shared best practices
- Documenting and categorizing knowledge effectively, and training team members to ensure continuity and efficiency of operations
- Communicating effectively with team members and stakeholders, ensuring clear and concise information exchange
- Being ready for the on-call rotations
- Expecting to perform routine daily tasks using ChatGPT or a similar tool to enhance efficiency and productivity

Requirements

- At least 3 years of experience in Site Reliability Engineering, System Administration, Incident management, or a closely related field
- Bachelor’s/Master’s degree in Computer Science, Engineering, or a related field
- Demonstrated experience in designing and managing the reliability of large-scale systems
- Strong proficiency in monitoring tools and methodologies: ELK, Grafana, New Relic, Datadog, and Zabbix
- Strong experience with containerization technologies such as Docker and Kubernetes
- Familiarity with modern infrastructure technologies and deployment processes
- Strong problem-solving skills with a proactive approach to issue resolution
- Excellent communication skills, with the ability to explain complex technical issues to non-technical stakeholders
- Proven familiarity and experience with AI tools like ChatGPT and other technologies, demonstrating a capability to seamlessly integrate these into daily tasks.
- English language upper intermediate (B2) is a must

Nice to have:

- Advanced knowledge in PHP(Symfony)
- Skilled in using Doctrine ORM for database management.
- Experience with interfacing applications with Redis, RabbitMQ, Elasticsearch, and Sentry.
- Experience in setting up and maintaining high-availability systems using tools like Keepalived, Heartbeat, Corosync, and Pacemaker
- Understanding of static and dynamic routing protocols.
- Familiarity with all layers of the OSI network model
- Proficient in managing Nginx and PHP-FPM setups.
- Experience managing MariaDB clusters, using MaxScale for database proxying
- Strong understanding of managing clusters for Redis with Sentinel, RabbitMQ, and Elasticsearch.
- Experience with ClickHouse for analytics and data warehousing
- Knowledge of infrastructure as code (IaC) and tools like Ansible and Helm.
- Proficient in continuous integration and deployment using GitLab CI/CD
- Expertise in setting up and using monitoring tools like Zabbix, Grafana, Prometheus, and InfluxDB.
- Experience with application performance monitoring (APM) tools like New Relic, PagerDuty, and Graylog
- Understanding of different software architectures including monolithic, service-oriented, and microservice architectures

Benefits

Join us

If that sounds just like you, simply apply with your CV: talent@gegidze.com or press the button “Apply Now.”

Our hiring process:

After you hit the button “Apply Now” and upload the resume, our HR team will review your profile.
If the skills and experiences mentioned in your resume match the requirements, you will have:
1. Quick introduction call with our HR team
2. Technical/soft skill interview with client
3. Introduction call with the end client

After receiving positive feedback from the client we will circulate the job offer to you.
Wish you good luck and hope to see you in our incredible team of top digital talents!