Senior Site Reliability Engineer
3 settimane fa
Location LATAM, ERUOPE CloudDevs works with fast-moving, venture-backed startups across the US. We’re building a pool of world-class Site Reliability Engineers for current roles and for upcoming opportunities. You will either be placed directly into one of our partner startups or added to our vetted SRE network for future projects. This role is ideal for engineers who care about reliability, metrics, performance, and building simple, scalable systems. If you enjoy designing for scale and improving how teams ship software, you’ll fit right in. Key Responsibilities Work as a hands-on engineer focused on system reliability, performance, and observability. Define and track SLIs, SLOs, and error budgets. Optimize monitoring cost and signal quality across metrics, logs, and traces. Improve deployment safety, canary rollouts, and UAT pipelines. Build tools for automated and local performance testing and track benchmarks. Lead resilience work like failover drills, chaos tests, and redundancy checks. Partner with engineering teams to improve scaling patterns and architecture as the product grows. Support incident response processes and help reduce operational noise. Write clean, maintainable code in Go, Python, or Node.js. Contribute to CI/CD improvements and automation efforts. Collaborate with engineers across teams to raise reliability standards. Requirements 5+ years in SRE, DevOps, or Platform Engineering roles. Strong experience with cloud infrastructure (AWS preferred), Terraform, and Kubernetes. Deep knowledge of observability tools like DataDog, Prometheus, or OpenTelemetry. Strong debugging skills across services, networking, and data layers. Hands‑on experience designing and monitoring SLIs/SLOs. Experience with CI/CD tools such as GitHub Actions, Jenkins, or ArgoCD. Ability to write production‑grade code in Go, Python, or Node.js. Comfort working independently in fast‑paced environments. Nice to Have Experience tuning observability costs and optimizing data ingestion. Exposure to chaos engineering and progressive deployments. Background with high‑throughput or latency‑sensitive systems. AWS at scale (EKS, Lambda, DynamoDB, S3). Experience in regulated industries like fintech, payments, or SOC2 environments. Performance testing pipelines or load‑testing automation. Experience handling systems processing tens of millions of API calls. Open Pool for SREs Even if you don’t meet every requirement or aren’t a fit for the current role, strong SRE with real production experience are welcome to join our talent pool. We regularly place engineers with different strengths across reliability, DevOps, platform, observability, backend, and infrastructure engineering. #J-18808-Ljbffr
-
Site Reliability Engineer
1 settimana fa
Italia Reply A tempo pieno 40.000 € - 80.000 € all'anoIl mondo del Cloud è la tua passione? Ti piacerebbe diventare un esperto di Cloud Computing, DevOps e Automation all'interno di un team che affronta ogni giorno nuove sfide? In Cloud9, startup del gruppo Reply, stiamo ricercando un Site Reliability Engineer per supportare i nostri Clienti nella gestione ed evoluzione di architetture Hybrid & Multicloud di...
-
Senior Site Reliability Engineer
1 settimana fa
Italia Ahold Delhaize A tempo pieno 80.000 € - 120.000 € all'anoAhold Delhaize USA, a division of global food retailer Ahold Delhaize, is part of the U.S. family of brands, which includes five leading omnichannel grocery brands – Food Lion, Giant Food, The GIANT Company, Hannaford and Stop & Shop. Our associates support the brands with a wide range of services, including Finance, Legal, Sustainability, Commercial,...
-
Senior Site Reliability Engineer
3 settimane fa
Italia Remotely A tempo pienoLocation LATAM, ERUOPE CloudDevs works with fast-moving, venture-backed startups across the US. We're building a pool of world-class Site Reliability Engineers for current roles and for upcoming opportunities. You will either be placed directly into one of our partner startups or added to our vetted SRE network for future projects. This role is ideal for...
-
Principal Site Reliability Engineer
1 settimana fa
Italia Ahold Delhaize A tempo pieno 146.960 € - 220.440 € all'anoAhold Delhaize USA, a division of global food retailer Ahold Delhaize, is part of the U.S. family of brands, which includes five leading omnichannel grocery brands – Food Lion, Giant Food, The GIANT Company, Hannaford and Stop & Shop. Our associates support the brands with a wide range of services, including Finance, Legal, Sustainability, Commercial,...
-
Site Reliability Engineer
3 settimane fa
Italia Immobiliare.it A tempo pienoImmobiliare.it S.p.A. è un gruppo italiano composto da società specializzate in servizi Digital Tech per la compravendita e l’affitto di immobili, rivolti a privati, professionisti del real estate, istituti bancari e operatori del settore finanziario. Fondata nel 2005 Immobiliare.it, il portale immobiliare N.1 in Italia, ha ampliato la propria offerta...
-
Senior SRE: Scale Reliability
3 settimane fa
Italia Remotely A tempo pienoA global startup recruitment firm is seeking experienced Site Reliability Engineers to enhance system reliability and performance. Ideal candidates have 5+ years in SRE roles and expertise in cloud infrastructure and observability tools. This position offers the chance to work across various startups and influence the reliability standards of the tech...
-
Site reliability engineer
2 settimane fa
Italia Meridionale Immobiliare.it A tempo pienoImmobiliare.it S.p. A. è un gruppo italiano composto da società specializzate in servizi Digital Tech per la compravendita e l'affitto di immobili, rivolti a privati, professionisti del real estate, istituti bancari e operatori del settore finanziario. Fondata nel 2005 Immobiliare.it, il portale immobiliare N.1 in Italia, ha ampliato la propria offerta con...
-
Site Reliability Engineer
17 ore fa
Italia Meridionale Immobiliare.it A tempo pienoImmobiliare.it S.p.A. è un gruppo italiano composto da società specializzate in servizi Digital Tech per la compravendita e l'affitto di immobili, rivolti a privati, professionisti del real estate, istituti bancari e operatori del settore finanziario. Fondata nel 2005 Immobiliare.it, il portale immobiliare N.1 in Italia, ha ampliato la propria offerta con...
-
Site reliability engineer
2 settimane fa
Italia Immobiliare.it A tempo pienoImmobiliare.it S.p. A. è un gruppo italiano composto da società specializzate in servizi Digital Tech per la compravendita e l'affitto di immobili, rivolti a privati, professionisti del real estate, istituti bancari e operatori del settore finanziario. (…)Immobiliare.it Insights, la proptech della società, offre servizi digitali di advisory, insights e...
-
Principal Site Reliability Engineer
3 settimane fa
Italia SaaS Industry A tempo pienoPrincipal Site Reliability Engineer - Azure Red Hat OpenShift in Madrid or RemoteThe Red Hat Site Reliability Engineering (SRE) team is looking for a Principal Site Reliability Engineer to join us. In this role, you will develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hat’s enterprise Kubernetes distribution. As an SRE...