Ml Kernel Performance Engineer, Aws Neuron, Annapurna Labs

4 giorni fa


Torino, Italia Amazon A tempo pieno

OverviewThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The Acceleration Kernel Library team focuses on maximizing performance for AWS's custom ML accelerators.Engineers here craft high-performance kernels for ML functions, combining deep hardware knowledge with ML expertise to push the boundaries of AI acceleration.The AWS Neuron SDK includes an ML compiler, runtime, and application framework that integrates with popular ML frameworks like PyTorch to deliver high performance for inference and training on AWS accelerators.This role sits at the intersection of machine learning, high-performance computing, and distributed architectures, helping shape the future of AI acceleration technology.This is an opportunity to work on cutting-edge products, architect and implement business-critical features, publish cutting-edge research, and mentor a team of engineers in a fast, innovative culture.The team collaborates closely with customers on model enablement, providing direct support and optimization expertise to ensure workloads run optimally on AWS ML accelerators.Explore the product and our history: ; /; ; Key job responsibilitiesDesign and implement high-performance compute kernels for ML operations, leveraging the Neuron architecture and programming modelsAnalyze and optimize kernel-level performance across multiple generations of Neuron hardwareConduct detailed performance analysis using profiling tools to identify and resolve bottlenecksImplement compiler optimizations such as fusion, sharding, tiling, and schedulingWork directly with customers to enable and optimize their ML models on AWS acceleratorsCollaborate across teams to develop innovative kernel optimization techniquesA day in the lifeBuild high-impact solutions to deliver to our large customer baseParticipate in design discussions, code review, and communicate with internal and external stakeholdersWork cross-functionally to help drive business decisions with your technical inputWork in a startup-like development environment, focusing on the most important tasksAbout the team1) Diverse Experiences: AWS values diverse experiences.We encourage candidates to apply even if you do not meet all listed qualifications.2) Why AWS: AWS is the world's most comprehensive and broadly adopted cloud platform.We pioneered cloud computing and continue to innovate.3) Inclusive Team Culture: We embrace differences and maintain an inclusive culture with multiple affinity groups and ongoing learning experiences, guided by AWS Leadership Principles.4) Work/Life Balance: We value balance and offer flexible working hours.5) Mentorship & Career Growth: We support new members with mentorship and project assignments that foster growth.Bottom line: Our inclusive culture empowers Amazonians to deliver the best results for customers.If you need accommodations during the application process, visit for more information.Basic Qualifications- 3+ years of non-internship professional software development experience- 2+ years of non-internship design or architecture experience (design patterns, reliability and scaling) of new and existing systems- Experience programming with at least one software programming languagePreferred Qualifications- 3+ years of full SDLC experience including coding standards, code reviews, source control, build processes, testing, and operations- Bachelor's degree in computer science or equivalentAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.Los Angeles County applicants: job duties include working safely, communicating effectively, and following laws and policies.Criminal history may affect employment opportunities.We will consider qualified applicants with arrest and conviction records per the Los Angeles County Fair Chance Ordinance.Our inclusive culture empowers Amazonians to deliver the best results for customers.If you need a workplace accommodation or adjustment during the application process, please visit for more information.If the country/region you're applying in isn't listed, contact your Recruiting Partner.Our compensation reflects labor costs across US markets.The base pay for this position ranges from $129,300/year to $223,600/year, depending on location and experience.Amazon is a total compensation company with potential equity, sign-on, and other benefits.For more information, visit .This position will remain posted until filled.Applicants should apply via our internal or external career site.Posted:January 24, **** (Updated about 6 hours ago)Share this jobImportant FAQs for current Government employeesBefore proceeding, please review the following FAQsAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.#J-*****-Ljbffr



  • torino, Italia Amazon A tempo pieno

    ML Kernel Performance Engineer, AWS Neuron, Annapurna LabsThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing...


  • torino, Italia Amazon A tempo pieno

    ML Kernel Performance Engineer, AWS Neuron, Annapurna LabsThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing...


  • Torino, Italia Amazon A tempo pieno

    ML Kernel Performance Engineer, AWS Neuron, Annapurna LabsThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The Acceleration Kernel Library team is at the forefront of maximizing...


  • Torino, Italia Amazon A tempo pieno

    Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS's custom ML accelerators. Engineers here...


  • Torino, Italia Amazon A tempo pieno

    OverviewThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS's custom ML accelerators. Engineers here...


  • Torino, Italia Amazon A tempo pieno

    Overview The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS's custom ML accelerators. Engineers here...


  • Sant'Ambrogio di Torino, Italia Amazon A tempo pieno

    OverviewThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team focuses on maximizing performance for AWS's custom ML accelerators. Engineers here...


  • torino, Italia Amazon A tempo pieno

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...


  • Sant'Ambrogio di Torino, Italia Amazon A tempo pieno

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...


  • sant'ambrogio di torino, Italia Amazon A tempo pieno

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.The Product: The AWS Machine Learning accelerators (Inferentia/Trainium) offer unparalleled ML inference and training...