ML Compiler Engineer II

4 settimane fa


Asti, Italia Amazon A tempo pieno

The AWS Neuron Compiler team is seeking skilled compiler engineers to develop a state-of-the-art deep learning compiler stack. This stack optimizes application models across diverse domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX. Your role will involve working closely with our custom-built Machine Learning accelerators, including Inferentia/Trainium, which represent the forefront of AWS innovation for advanced ML capabilities, powering solutions like Generative AI. Key Job Responsibilities Develop and maintain tooling for best-in-class technology for raising the bar of the Neuron Compiler's accuracy and reliability. Help lead the efforts building fuzzers and specification synthesis tooling for our LLVM-based compiler. Work in a team with a science focus, and strive to push what we do to the edge of what is known, to best deliver our customers. Strong software development skills using C++/Python are critical to this role. A science background in compiler development is strongly preferred. A background in Machine Learning and AI accelerators is preferred, but not required. Basic Qualifications 3+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience 2+ years of experience in developing compiler features and optimizations Proficiency in C++ and Python programming, applied to compiler or verification projects Familiarity with LLVM, including knowledge of abstract interpretation and polyhedral domains Demonstrated scientific approach to software engineering problems Preferred Qualifications Masters degree or PhD in computer science or equivalent Experience with deep learning frameworks like TensorFlow or PyTorch Understanding of large language model (LLM) training processes Knowledge of CUDA programming for GPU acceleration Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, please visit for more information. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. #J-18808-Ljbffr


  • ML Compiler Engineer II

    4 settimane fa


    Asti, Italia Amazon A tempo pieno

    The AWS Neuron Compiler team is seeking skilled compiler engineers to develop a state-of-the-art deep learning compiler stack. This stack optimizes application models across diverse domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX. Your role will involve working closely with our...


  • Asti, Italia Amazon A tempo pieno

    A leading technology company in Italy is seeking an Applied Scientist II to enhance causal modeling solutions in advertising effectiveness. This role involves partnership with cross-disciplinary teams to design and implement large-scale causal ML models. The ideal candidate should have a strong background in causal inference, programming, and machine...


  • Asti, Italia Amazon A tempo pieno

    A leading technology company in Italy is seeking an Applied Scientist II to enhance causal modeling solutions in advertising effectiveness. This role involves partnership with cross-disciplinary teams to design and implement large-scale causal ML models. The ideal candidate should have a strong background in causal inference, programming, and machine...


  • Asti, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud‑scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron.This role is responsible for development, enablement and performance tuning of a wide...


  • Asti, Italia Amazon A tempo pieno

    AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud‑scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is responsible for development, enablement and performance tuning of a wide...


  • Asti, Italia Amazon A tempo pieno

    AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud‑scale machine learning accelerators and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is responsible for development, enablement and performance tuning of a wide...

  • Sr. Machine Learning

    1 settimana fa


    Asti, Italia Vendita Al Dettaglio E All'Ingrosso Import-Export A tempo pieno

    The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of compute...


  • Asti (AT), Italia Amazon A tempo pieno

    A leading technology company in Italy is seeking an Applied Scientist II to enhance causal modeling solutions in advertising effectiveness. This role involves partnership with cross-disciplinary teams to design and implement large-scale causal ML models. The ideal candidate should have a strong background in causal inference, programming, and machine...

  • Sr. Machine Learning

    4 settimane fa


    Asti, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of compute...


  • Asti, Italia Vendita al dettaglio e all'ingrosso Import-export A tempo pieno

    Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training - Performance Optimization AWS Utility Computing (UC) provides product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Platform, and Productivity...