Lead Site Reliability Engineer
6 giorni fa
Job Description Location: Fully remote EU timezone (CET ±2h) Start date: ASAP Languages: Fluent English is mandatory Industry: Cloud Computing We are hiring at Pragmatike to expand our team and drive the growth of our internal projects.Our focus is on developing cutting-edge solutions in Cloud Computing, while fostering a culture of collaboration and innovation.Joining us means being part of a passionate team where your ideas and skills directly contribute to shaping tomorrows technologies.If you're excited about working on ambitious projects in a dynamic and flexible environment, we'd love to hear from youResponsibilities Operate and maintain Linux-based infrastructure (Debian/Ubuntu).Deploy, manage, and scale Kubernetes clusters across bare-metal, virtualized, and on-prem environments.Oversee full cluster lifecycle: upgrades, node pools, networking, storage, and security hardening.Implement automation for provisioning and operations using Ansible, Bash/Python, and GitOps workflows.Design and maintain networking architecture including VLANs, L2/L3 routing, VPNs, and multi-site connectivity.Build automated deployment workflows (PXE boot, Preseed, cloud-init).Deploy and maintain observability stacks (Prometheus/Grafana, Loki, ELK, Graylog).Lead incident response and escalation activities across the platform.Improve system availability and reduce latency at all levels.Define and implement SLOs/SLIs at multiple infrastructure levels (physical network/hardware, platform virtualization, software services).Optimize alerting and monitoring pipelines to provide actionable insights.Establish and maintain on-call schedules to ensure coverage across timezones.Develop Standard Operating Procedures (SOPs) for repeatable operations and maintenance tasks.Coordinate physical maintenance for Policlouds (periodic maintenance, hardware issues, DC-Ops).Manage virtualization and orchestration layers (OpenStack, Proxmox, VMware).Help develop and maintain overall architecture across all products.Plan resources for future initiatives, accounting for demand and growth projections.Work with development teams to improve overall quality and optimize resource utilization.Collaborate with cross-functional stakeholders (Hivenet, Policloud, Customer Success teams).Requirements Expert-level, hands-on experience operating Kubernetes in production environments.Strong network engineering skills (VLANs, L2/L3 routing, VPNs, multi-site connectivity) - this is essential for the role.Strong proficiency with Linux systems administration (Debian/Ubuntu).Solid understanding of networking fundamentals and ability to design complex network architectures.Experience building and maintaining automation workflows (Ansible, Bash/Python, Git-based).Experience with observability stacks such as Prometheus, Grafana, ELK, Loki, or Graylog.Background with virtualization technologies (OpenStack, Proxmox, VMware).Experience with bare-metal provisioning and MAAS (Metal as a Service).Strong understanding of distributed systems and container orchestration.Process-oriented mindset with ability to develop SOPs and operational procedures from scratch.Experience with incident response, escalation procedures, and on-call rotations.Ability to work autonomously in a fast-paced, engineering-driven environment.Strong technical skills combined with alignment to team values.Nice To Have Experience with service mesh (Istio, Linkerd) or advanced CNI implementations.Knowledge of Cloudflare APIs, DNS automation, or tunnel configurations.Experience with GPU infrastructure, node preparation, or resource scheduling.Familiarity with security best practices (RBAC, firewalls, network policies).Exposure to IT asset management or license tracking workflows.Experience working in multi-timezone environments and coordinating across distributed teams.Background establishing reliability practices and SRE frameworks in growing organizations.Why Join Us: 100% remote work with flexible hours High-impact role with autonomy and ownership Collaborative and international engineering team Cutting-edge tech stack with strong focus on reliability and automation.
-
Site Reliability Engineer
3 settimane fa
Milano, Italia Blackfluo.ai A tempo pienoAbout the job Site Reliability Engineer (SRE) Job Description Location: Full remote, EU timezone (CET /- 2 hours) Start Date: As soon as possible Languages: English required We are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring...
-
Site Reliability Engineer
6 giorni fa
Milano, Italia Altro A tempo pienoAbout the job Site Reliability Engineer (SRE)Job Description Location:Full remote, EU timezone (CET +/- 2 hours)Start Date:As soon as possibleLanguages:English requiredWe are looking for a skilledSite Reliability Engineer (SRE)with deep expertise inAWSto help us scale and secure our infrastructure.As an SRE, you will be instrumental in ensuring the...
-
Remote Lead Sre: Cloud Reliability
5 giorni fa
Milano, Italia Pragmatike A tempo pienoA leading tech firm is seeking a Lead Site Reliability Engineer to join a dynamic team fully committed to cloud computing solutions.This role offers the opportunity to operate and maintain complex infrastructures, primarily using Kubernetes and Linux systems, while fostering a culture of innovation and collaboration.You will be responsible for deploying and...
-
Senior Site Reliability
6 giorni fa
Milano, Italia Altro A tempo pienoSenior Site Reliability / Gitops EngineerJoin to apply for theSenior Site Reliability / Gitops Engineerrole atCanonicalSenior Site Reliability / Gitops Engineer1 day ago Be among the first 25 applicantsJoin to apply for theSenior Site Reliability / Gitops Engineerrole atCanonicalGet AI-powered advice on this job and more exclusive features.Canonical is a...
-
Senior Site Reliability Engineer
3 settimane fa
Milano, Italia Canonical A tempo pienoOverview Join to apply for the Senior Site Reliability Engineer role at Canonical . Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT....
-
Site Reliability Engineer
3 settimane fa
Milano, Italia Moltiply Group A tempo pienoMoltiply è leader europeo nell'innovazione dei Servizi in ambito Finanziario. Tecnologia ed eccellenza operativa ci permettono di portare avanti la nostra Mission, ovvero aumentare l'efficienza e migliorare la competitività dei nostri clienti agendo come partner forti e affidabili.Attualmente, per il nostro ufficio IT, siamo alla ricerca di un*Site...
-
Site Reliability Engineer
3 giorni fa
Milano, Italia Altro A tempo pienoMoltiply è leader europeo nell'innovazione dei Servizi in ambito Finanziario.Tecnologia ed eccellenza operativa ci permettono di portare avanti la nostra Mission, ovvero aumentare l'efficienza e migliorare la competitività dei nostri clienti agendo come partner forti e affidabili.Attualmente, per il nostro ufficio IT, siamo alla ricerca di un* Site...
-
Site Reliability Engineer
3 settimane fa
Milano, Italia Moltiply Group A tempo pienoMoltiply è leader europeo nell'innovazione dei Servizi in ambito Finanziario. Tecnologia ed eccellenza operativa ci permettono di portare avanti la nostra Mission, ovvero aumentare l'efficienza e migliorare la competitività dei nostri clienti agendo come partner forti e affidabili.Attualmente, per il nostro ufficio IT, siamo alla ricerca di un*Site...
-
Site Reliability Engineer
13 ore fa
Milano, Italia Moltiply Group A tempo pienoInMoltiplyaffrontiamo e trasformiamo i processi più complessi dei nostri clienti -dal customer care alla digitalizzazione -unendo tecnologie avanzate e il talento di oltre ***** professionisti in Italia e nel mondo.La nostra missione è aiutare le aziende amoltiplicareil proprio valore, ridisegnando esemplificandomodelli operativi con l'obiettivo di...
-
Site Reliability Engineer
2 giorni fa
Milano, Lombardia, Italia Moltiply Group A tempo pienoMoltiply èleader europeonell'innovazione dei Servizi in ambito Finanziario. Tecnologia ed eccellenza operativa ci permettono di portare avanti la nostra Mission, ovvero aumentare l'efficienza e migliorare la competitività dei nostri clienti agendo come partner forti e affidabili.Attualmente, per il nostro ufficio IT, siamo alla ricerca di un*Site...