Machine Learning Compiler Engineer, Annapurna Labs

14 ore fa


Torino, Italia Amazon A tempo pieno

The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state-of-the-art deep learning compiler stack.This stack is designed to optimize application models across diverse domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX.Your role will involve working closely with our custom-built Machine Learning accelerators, including Inferentia/Trainium, which represent the forefront of AWS innovation for advanced ML capabilities, powering solutions like Generative AI.As part of the Backend team, you will play a significant role in designing and developing various aspects of our system, including instruction scheduling, memory allocation, data transfer optimization, graph partitioning, parallel programming, code generation, Instruction Set Architectures, new hardware bring-up, and hardware-software co-design.Key Job ResponsibilitiesSolve challenging technical problems, often ones not solved before, at every layer of the stack.Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.Research implementations that deliver the best possible experiences for customers.A Day in the LifeBuild high-impact solutions to deliver to our large customer base.Participate in design discussions, code review, and communicate with internal and external stakeholders.Work cross-functionally to help drive business decisions with your technical input.Work in a startup-like development environment, where you're always working on the most important stuff.Basic QualificationsB.S. or M.S. in computer science or related fieldProficiency with 1 or more of the following programming languages: C++ (preferred), Python3+ years of non-internship professional software development experience2+ years of experience developing compiler optimization, graph-theory, hardware bring-up, FPGA placement and routing algorithms, or hardware resource managementPreferred QualificationsM.S. or Ph.D. in computer science or related fieldStrong knowledge in one or more of the areas of: compiler design, instruction scheduling, memory allocation, data transfer optimization, graph partitioning, parallel programming, code generation, Instruction Set Architectures, new hardware bring-up, and hardware-software co-designExperience with LLVM and/or MLIRExperience with developing algorithms for simulation toolsExperience with TensorFlow, PyTorch, and/or JAXExperience in LLM, Vision or other deep-learning modelsAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.#J-*****-Ljbffr



  • Torino, Italia Amazon A tempo pieno

    The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state-of-the-art deep learning compiler stack. This stack is designed to optimize application models across diverse domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX. Your role...


  • Torino, Italia Amazon A tempo pieno

    The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state-of-the-art deep learning compiler stack. This stack is designed to optimize application models across diverse domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX. Your role...


  • Torino, Italia Amazon A tempo pieno

    OverviewThe Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS.The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud.Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS) of...


  • Torino, Italia Amazon A tempo pieno

    Overview The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops...


  • Torino, Italia Amazon A tempo pieno

    Overview The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops...


  • sant'ambrogio di torino, Italia Amazon A tempo pieno

    The AWS Neuron Compiler team is actively seeking skilled compiler engineers to join our efforts in developing a state-of-the-art deep learning compiler stack. This stack is designed to optimize application models across diverse domains, including Large Language and Vision, originating from leading frameworks such as PyTorch, TensorFlow, and JAX. Your role...


  • Sant'Ambrogio di Torino, Italia Amazon A tempo pieno

    OverviewThe Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops...


  • sant'ambrogio di torino, Italia Amazon A tempo pieno

    OverviewThe Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops...


  • Sant'Ambrogio di Torino, Italia Amazon A tempo pieno

    Overview The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops...


  • Sant'Ambrogio di Torino (TO), Italia Amazon A tempo pieno

    Overview The Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‐in‐class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‐in‐class ML training performance with the most teraflops...