Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training

2 giorni fa


Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

Sr. Software Engineer- AI/ML, AWS Neuron Distributed TrainingAnnapurna Labs designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.AWS Neuron is the complete software stack for the AWS Trainium (Trn1/Trn2) and Inferentia (Inf1/Inf2) cloud‑scale Machine Learning accelerators. This role is for a Senior Machine Learning Engineer in the Distributed Training team for AWS Neuron, responsible for development, enablement, and performance tuning of a wide variety of ML model families, including large‑scale LLMs such as GPT and Llama, as well as Stable Diffusion and Vision Transformers (ViT).The ML Distributed Training team works side by side with chip architects, compiler engineers, and runtime engineers to create, build, and tune distributed training solutions with Trainium instances. Experience training these large models using Python is a must. FSDP (Fully‑Sharded Data Parallel), Deepspeed, Nemo, and other distributed training libraries are central to this work; extending them for the Neuron‑based system is key.Key job responsibilitiesLead efforts to build distributed training support into PyTorch and JAX using XLA, the Neuron compiler, and runtime stacks.Optimize models to achieve peak performance and maximize efficiency on AWS custom silicon, including Trainium and Inferentia, as well as Trn1, Trn2, Inf1, and Inf2 servers.Apply strong software development skills, deep dive into complex problems, and work effectively with cross‑functional teams.Build a solid foundation in Machine Learning to deliver high‑quality solutions.About the teamAnnapurna Labs was a startup company acquired by AWS in 2015 and is now fully integrated. The team operates across silicon engineering, hardware design and verification, software, and operations, supporting AWS Neuron, Inferentia, and Trainium ML Accelerators. We foster a collaborative environment with mentorship, thorough code reviews, and opportunities for career growth.Basic QualificationsBachelor’s degree in computer science or equivalent.5+ years of non‑internship professional software development experience.5+ years of programming with at least one software programming language experience.5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems.5+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations.Experience as a mentor, tech lead, or leading an engineering team.Experience in machine learning, data mining, information retrieval, statistics, or natural language processing.Preferred QualificationsMaster’s degree in computer science or equivalent.Experience in computer architecture.Previous software engineering expertise with PyTorch, Jax/TensorFlow, distributed libraries and frameworks, and end‑to‑end model training.Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.#J-18808-Ljbffr



  • Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to innovative infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible.AWS Neuron is the SDK that optimizes the performance of complex neural net models executed on AWS Inferentia...


  • Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    About Amazon Annapurna Labs Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware...


  • Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    About Amazon Annapurna LabsAmazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware...

  • ML Engineer AWS

    15 ore fa


    Turin, Italia Altro A tempo pieno

    Proxima è un gruppo di aziende con skill funzionali e tecnologiche innovative, che mettono a fattor comune la propria esperienza in un’offerta congiunta di sviluppo di soluzioni all’interno della nostra Software Factory. Offriamo consulenza e servizi di Application Management e Quality Assurance.Chi cerchiamo? Data Scientist / ML Engineer AWS Dove e...


  • Turin, Piemonte, Italia Canonical - Jobs A tempo pieno

    We are seeking a software engineer with a passion for building and validating resilient distributed systems. At Canonical you can build a career and drive the success of those leveraging Canonical's Ubuntu and Juju to build multi-cloud deployable cloud applications.We see quality engineering as a first class engineering practice and are looking for people...

  • Remote ML Engineer

    2 giorni fa


    Turin, Italia Altro A tempo pieno

    Un gruppo di aziende innovative cerca un Data Scientist / ML Engineer specializzato in AWS. Sei responsabile dello sviluppo di modelli di machine learning e ottimizzazione, collaborando con team multidisciplinari. Offriamo una RAL adeguata alla seniority, buoni pasto, assicurazione sanitaria e programmi di formazione. È richiesta passione per l'analisi e...

  • AI / ML Engineer

    6 giorni fa


    Turin, Piemonte, Italia AccessiWay A tempo pieno

    AI/ML EngineerYour MissionWe're looking for a passionate AI/ML Engineer to join our growing team and take full ownership of data preparation and model development initiatives.In this role, you'll work at the intersection of Data Engineering and Machine Learning, collaborating closely with Product and Software Development teams.You'll design and optimize data...


  • Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    Sr Software Engineer, Graviton Software, Annapurna LabsThis opportunity is open for Austin, Cupertino and Seattle. The AWS Graviton Software team is seeking Software Engineers to optimize performance for AWS Graviton. Graviton delivers the best price/performance in AWS data centers. For the past 2 years, Graviton has powered the majority of new EC2 capacity...


  • Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    Sr Software Engineer, Graviton Software, Annapurna Labs This opportunity is open for Austin, Cupertino and Seattle. The AWS Graviton Software team is seeking Software Engineers to optimize performance for AWS Graviton. Graviton delivers the best price/performance in AWS data centers. For the past 2 years, Graviton has powered the majority of new EC2 capacity...


  • Turin, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    Overview AWS Utility Computing (UC) provides product innovations … from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support...