Site Reliability Engineer II

4 giorni fa

WorkFromHome, Italia Agile Lab A tempo pieno

Agile Lab is a company founded in 2014 with the mission to create value for its customers in data‑intensive environments through customisable solutions that establish performance‑driven processes, sustainable architectures and automated platforms based on data governance best practices. Having delivered over 100 successful Elite Data Engineering initiatives, we have used this experience to create Witboost: a modular, technology‑agnostic platform that enables modern organisations to discover, value and produce their data in both traditional environments and fully compliant Data Mesh architectures. With a highly skilled team of over 260 data engineers based in Europe, Agile Lab helps organisations with their data‑driven transformation. Take a look at our handbook to discover our core values and processes. Opportunity We are looking for a Site Reliability Engineer II (SRE II) to join our growing team. You will play a key role in maintaining the reliability, observability, and operational efficiency of enterprise‑level distributed systems. In this role, you’ll coordinate a small technical team (3–4 people) in managing microservices in complex production environments. You will be involved in monitoring, incident management, release coordination, and performance tuning, with a strong focus on OpenShift platforms. You’ll also work closely with multiple cross‑functional teams to ensure high availability and performance of our cloud‑native services. This role includes on‑call availability. Salary: 38.5K-48.5K Responsibilities Ensure high reliability of microservices running in OpenShift environments Lead and coordinate a technical team of 3–4 engineers for operational excellence Manage incident resolution and ticketing workflows via ServiceNow Collaborate with development teams to drive performance optimization and tuning Design, configure and maintain monitoring dashboards (Grafana, Prometheus, etc.) Coordinate with Service Control Room to maintain effective alerting and response Oversee release processes of new features, hotfixes, and updates in production Requirements Degree in Computer Engineering, Computer Science, or a related field Proven experience in Application Maintenance Services (AMS): minimum 2 years In‑depth knowledge of OpenShift and microservices in cloud‑native environments Ability to technically and operationally lead a team of 3–4 people Experience in release management, monitoring, and incident resolution Excellent communication and cross‑functional coordination skills Strong initiative, operational autonomy, and results‑oriented mindset Fluency in Italian (mandatory requirement) Monitoring & Observability: Grafana, Prometheus, Kibana, Jaeger, Datadog, OpenTelemetry Cloud/DevOps: OpenShift, GitLab, Jenkins Data & Messaging: Kafka, MongoDB, Ignite Ticketing & ITSM: ServiceNow Benefits Full Remote or hybrid working in our offices: Milan, Turin, Padua, Bologna, Catania and Rende Real work life balance Training monthly budget (time and money) Support of a buddy in the first week of work Benefits and corporate welfare programs: company prizes and welcome pack with all the equipment you need to work Agile Nomads Experience: opportunity to work for 2 weeks abroad Referral bonus, if you bring people as talented as you The opportunity to attend one conference per year A company rated 4.8 out of 5 for employee satisfaction on Glassdoor and certified as a Great Place to Work Inclusive environment where you can be who you really are Stimulating environment oriented to growth, both professional and personal How we work We don't like hierarchies: we work as a team We don't like bureaucracies, we prefer sense of responsibility We like data, certainly, so anything that is measurable We want to make a positive change in our industry Empathy, humility, collaboration, and willingness to challenge ourselves are the basis of our work Please note Only candidates based in European time zones (CEST or similar) will be considered for this position. #J-18808-Ljbffr

Site Reliability Engineer

4 settimane fa

WorkFromHome, Italia Blackfluo.ai A tempo pieno

About the job Site Reliability Engineer (SRE) Job Description Location: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English required We are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the...
Senior Site Reliability

4 settimane fa

WorkFromHome, Italia Canonical A tempo pieno

Senior Site Reliability / Gitops Engineer Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features....
Senior Site Reliability Engineer

7 giorni fa

WorkFromHome, Italia Canonical A tempo pieno

Senior Site Reliability Engineer Join Canonical’s leading open source software and operating systems platform as a Senior Site Reliability Engineer. Overview Canonical is a global provider of open source software and the platform for AI, IoT and the cloud. Our team runs hundreds of private cloud, Kubernetes and application clusters for customers across the...
Senior Site Reliability Engineer

4 settimane fa

WorkFromHome, Italia Canonical A tempo pieno

Overview Join to apply for the Senior Site Reliability Engineer role at Canonical . Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT....
Senior Site Reliability

4 settimane fa

WorkFromHome, Italia Canonical A tempo pieno

Senior Site Reliability / Gitops Engineer Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Get AI-powered advice on this job and more exclusive features....
Senior SRE II

4 giorni fa

WorkFromHome, Italia Agile Lab A tempo pieno

A technology firm in Milan is seeking a Site Reliability Engineer II (SRE II) to maintain the reliability and efficiency of distributed systems. The role involves leading a small technical team to manage microservices and ensure high performance on cloud-native services, especially with OpenShift. Candidates should have a degree in a related field and at...
Remote Site Reliability Engineer — Build High-Availability Systems

4 settimane fa

WorkFromHome, Italia Immobiliare.It A tempo pieno

Una compagnia tecnologica per il settore immobiliare cerca un Site Reliability Engineer per garantire l'efficienza dei progetti e monitorare l'infrastruttura. È richiesta esperienza nella gestione di sistemi Linux, troubleshooting avanzato, e automazione tramite strumenti come Terraform e Ansible. Offriamo un ambiente dinamico per sviluppatori appassionati...
Remote Site Reliability Engineer — Build High-Availability Systems

3 settimane fa

WorkFromHome, Italia Immobiliare.It A tempo pieno

Un'azienda tecnologica leader cerca un Site Reliability Engineer per collaborare con diversi team e garantire l'affidabilità dei sistemi. Il candidato ideale ha esperienza nella gestione di sistemi GNU/Linux, abilità nel troubleshooting e nella scrittura di script. Offriamo opportunità di crescita continua e un ambiente innovativo. È possibile lavorare...
Site Reliability Engineer

4 giorni fa

WorkFromHome, Italia Canonical A tempo pieno

Overview Join to apply for the Site Reliability Engineer role at Canonical . Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our...
Remote Site Reliability Engineer — Build High-Availability Systems

3 settimane fa

WorkFromHome, Italia Immobiliare.It A tempo pieno

Un'azienda digitale italiana, leader nel mercato immobiliare, ricerca un Site Reliability Engineer per garantire l'efficienza e l'affidabilità dei progetti. Il candidato ideale ha esperienza nella gestione di sistemi GNU/Linux e nella progettazione di soluzioni di automazione. È richiesta competenza in Infrastructure as Code e sistemi di monitoring....

Americhe

Europa

Asia / Oceania

Africa

Site Reliability Engineer II