Sr. Software Engineer- Ai/Ml, Aws Neuron Distributed Training

12 ore fa


Lazio, Italia Amazon A tempo pieno

Sr.Software Engineer- AI/ML, AWS Neuron Distributed Training Annapurna Labs designs silicon and software that accelerates innovation.Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday.Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world.AWS Neuron is the complete software stack for the AWS Trainium (Trn1/Trn2) and Inferentia (Inf1/Inf2) our cloud-scale Machine Learning accelerators.This role is for a Senior Machine Learning Engineer in the Distributed Training team for AWS Neuron, responsible for development, enablement and performance tuning of a wide variety of ML model families, including massive-scale Large Language Models (LLM) such as GPT and Llama, as well as Stable Diffusion, Vision Transformers (ViT) and many more.The ML Distributed Training team works side by side with chip architects, compiler engineers and runtime engineers to create, build and tune distributed training solutions with Trainium instances.Experience with training these large models using Python is a must.FSDP (Fully-Sharded Data Parallel), Deepspeed, Nemo and other distributed training libraries are central to this and extending all of this for the Neuron based system is key.Key job responsibilitiesBuild distributed training support into PyTorch and JAX using XLA, the Neuron compiler, and runtime stacks.Optimize models to achieve peak performance and maximize efficiency on AWS custom silicon, including Trainium and Inferentia, as well as Trn2, Trn1, Inf1, and Inf2 servers.Apply strong software development skills, deep-dive abilities, cross-functional team collaboration, and a solid foundation in Machine Learning.About the teamAnnapurna Labs was a startup company acquired by AWS in ****, and is now fully integrated.If AWS is an infrastructure company, then think Annapurna Labs as the infrastructure provider of AWS.Our org covers multiple disciplines including silicon engineering, hardware design and verification, software, and operations.AWS Nitro, ENA, EFA, Graviton and F1 EC2 Instances, AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe, are some of the products we have delivered, over the last few years.Our team is dedicated to supporting new members.We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship.Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews.We care about your career growth and strive to assign projects that help your engineering expertise so you feel empowered to take on more complex tasks in the future.Basic QualificationsBachelor's degree in computer science or equivalent5+ years of non-internship professional software development experience5+ years of programming with at least one software programming language experience5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experienceExperience as a mentor, tech lead or leading an engineering teamExperience in machine learning, data mining, information retrieval, statistics or natural language processingPreferred QualificationsMaster's degree in computer science or equivalentExperience in computer architecturePrevious software engineering expertise with Pytorch/Jax/Tensorflow, Distributed libraries and Frameworks, End-to-end Model Training.Compensation and BenefitsOur compensation reflects the cost of labor across several US geographic markets.The base pay for this position ranges from $151,300/year in our lowest geographic market up to $261,500/year in our highest geographic market.Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.Amazon is a total compensation company.Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.For more information, please visit .This position will remain posted until filled.Applicants should apply via our internal or external career site.Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.#J-*****-Ljbffr



  • Lazio, Italia Amazon A tempo pieno

    A leading tech company is hiring a Senior Software Engineer specialized in AI/ML for its AWS Neuron team in Torino.The ideal candidate will have over 5 years of software development experience, strong skills in Python, and a solid background in machine learning.Responsibilities include developing distributed training solutions and optimizing models for...


  • Lazio, Italia Vendita Al Dettaglio E All'Ingrosso Import-Export A tempo pieno

    Sr.Software Engineer- AI/ML, AWS Neuron Distributed Training Annapurna Labs designs silicon and software that accelerates innovation.Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday.Our custom chips, accelerators, and software stacks enable us to take on technical challenges that...


  • Lazio, Italia Vendita Al Dettaglio E All'Ingrosso Import-Export A tempo pieno

    A leading cloud services provider is seeking a Senior Software Engineer specializing in AI/ML to join their AWS Neuron team.This role involves developing and optimizing distributed training solutions aimed at enhancing performance for large-scale machine learning models.The ideal candidate will have over 5 years of software development experience, expertise...


  • Lazio, Italia Vendita Al Dettaglio E All'Ingrosso Import-Export A tempo pieno

    Software Development Engineer AI/ML, Inference Serving, AWS Neuron AWS Neuron is the software stack powering AWS Inferentia and Trainium machine learning accelerators, designed to deliver high-performance, low-cost inference at scale.The Neuron Serving team develops infrastructure to serve modern machine learning models—including large language models...


  • Lazio, Italia Amazon A tempo pieno

    Software Development Manager – ML Compiler, AWS Neuron, Annapurna LabsOur product: AWS Machine Learning accelerators are at the forefront of AWS innovation.The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in the cloud.Trainium will deliver the best-in-class ML training performance with the most teraflops of compute...

  • Sr. Machine Learning

    2 settimane fa


    Lazio, Italia Vendita Al Dettaglio E All'Ingrosso Import-Export A tempo pieno

    Product AWS Machine Learning accelerators are at the forefront of AWS innovation and enable the building of Generative AI on AWS.The Inferentia chip delivers best-in-class ML inference performance at the lowest cost.Trainium offers unparalleled ML training performance with the most teraflops (TFLOPS) of compute power in the cloud.The AWS Neuron Software...


  • Lazio, Italia Amazon A tempo pieno

    ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The Acceleration Kernel Library team is at the forefront of maximizing...


  • Lazio, Italia Vendita Al Dettaglio E All'Ingrosso Import-Export A tempo pieno

    Sr.Software Development Manager, Generative AI for AWS NeuronJob ID: ******* | Amazon Development Center U.S., Inc.You will join a dynamic team building and applying AI agents to simplify and accelerate customer adoption of Neuron, the software stack supporting AWS's Machine Learning silicon: Trainium and Inferentia.You will work with external and internal...

  • Ml Engineer Aws

    3 ore fa


    Lazio, Italia Altro A tempo pieno

    ML Engineer AWS – Proxima GroupJoin to apply for the ML Engineer AWS role at Proxima Group .Location: Full Remote.Proxima è un gruppo di aziende con skill funzionali e tecnologiche innovative, che mettono a fattor comune la propria esperienza in un'offerta congiunta di sviluppo di soluzioni all'interno della nostra Software Factory.Offriamo consulenza e...

  • Ml Engineer Aws

    5 giorni fa


    Lazio, Italia Proxima Group A tempo pieno

    ML Engineer AWS – Proxima GroupJoin to apply for the ML Engineer AWS role at Proxima Group.Location: Full Remote.Proxima è un gruppo di aziende con skill funzionali e tecnologiche innovative, che mettono a fattor comune la propria esperienza in un'offerta congiunta di sviluppo di soluzioni all'interno della nostra Software Factory.Offriamo consulenza e...