Senior Deep Learning Software Engineer, Inference

1 settimana fa


Lazio, Italia Altro A tempo pieno

Senior Deep Learning Software Engineer, Inference Join to apply for theSenior Deep Learning Software Engineer, Inferencerole atNVIDIANVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team.As a key contributor, you will design, build, and optimize GPU-accelerated software that powers today's most sophisticated AI applications.Our team develops and maintains high-performance deep learning frameworks, including SGLang and vLLM, at the forefront of efficient large-scale model serving and inference.You will improve these platforms, facilitate deployment and serving of groundbreaking language models, and implement the latest algorithms for public release in frameworks like SGLang, vLLM, and other DL frameworks.What You'll Be DoingPerformance optimization, analysis, and tuning of DL models in domains like LLM, multimodal, and generative AI.Scale performance of DL models across different NVIDIA accelerators.Contribute features and code to NVIDIA's inference libraries, vLLM, SGLang, FlashInfer, and LLM software solutions.Collaborate with cross-framework teams across NVIDIA libraries and inference optimization solutions.What We Need To SeeMasters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).5+ years of relevant software development experience.Excellent C/C++ programming and software design skills.SW Agile skills are helpful and Python experience is a plus.Prior experience with training, deploying or optimizing the inference of DL models in production is a plus.Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.Ways To Stand Out From The CrowdContribute to Deep Learning Software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field.Experience with Multi-GPU Communications (NCCL, NVSHMEM).Experience building and shipping products to enterprise customers.GPU programming experience (CUDA, OAI TRITON or CUTLASS).NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization.As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.For Poland: The base salary range is 213,750 PLN - 370,500 PLN for Level 3, and 281,250 PLN - 487,500 PLN for Level 4.JR*******Seniority levelMid-Senior levelEmployment typeFull-timeJob functionComputer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing#J-*****-Ljbffr



  • Lazio, Italia Datapizza A tempo pieno

    Senior Machine Learning Engineer (Ral fino a 55k)Humans.tech, realtà tech specializzata nello sviluppo di soluzioni digitali per realtà del calibro di Airbnb e Amazon, è alla ricerca di un Senior Machine Learning Engineer per il team AI.Modalità di lavoro: Full Remote - 1 giorni di presenza al mese.Tecnologie: Python e di framework di deep learning come...


  • Lazio, Italia Amazon A tempo pieno

    Software Development Engineer - AI/ML, AWS Neuron, Multimodal InferenceThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The AWS Neuron SDK, developed by the Annapurna Labs team at...


  • Lazio, Italia Akamai A tempo pieno

    Overview Do you thrive on solving complex technical challenges in AI infrastructure?Are you ready to architect the future of AI at the edge?Join the Akamai Inference Cloud TeamThe Akamai Inference Cloud team is part of Akamai's Cloud Technology Group.We build AI platforms for efficient, compliant, and high-performing applications.These platforms support...


  • Lazio, Italia Akamai A tempo pieno

    OverviewDo you thrive on solving complex technical challenges in AI infrastructure?Are you ready to architect the future of AI at the edge?Join the Akamai Inference Cloud TeamThe Akamai Inference Cloud team is part of Akamai's Cloud Technology Group.We build AI platforms for efficient, compliant, and high-performing applications.These platforms support...


  • Lazio, Italia Datapizza A tempo pieno

    Senior Machine Learning Engineer [Ral fino a 55k] Humans.tech, realtà tech specializzata nello sviluppo di soluzioni digitali per realtà del calibro di Airbnb e Amazon, è alla ricerca di un Senior Machine Learning Engineer per il team AI.Modalità di lavoro: Full Remote - 1 giorni di presenza al mese.Tecnologie: Python e di framework di deep learning come...


  • Lazio, Italia Altro A tempo pieno

    Senior Machine Learning Engineer (Ral fino a 55k)Humans.tech, realtà tech specializzata nello sviluppo di soluzioni digitali per realtà del calibro di Airbnb e Amazon, è alla ricerca di un Senior Machine Learning Engineer per il team AI.Modalità di lavoro: Full Remote - 1 giorni di presenza al mese.Tecnologie: Python e di framework di deep learning come...


  • Lazio, Italia Datapizza A tempo pieno

    Senior Machine Learning Engineer [Ral fino a 55k] Humans.tech, realtà tech specializzata nello sviluppo di soluzioni digitali per realtà del calibro di Airbnb e Amazon, è alla ricerca di un Senior Machine Learning Engineer per il team AI.?? Modalità di lavoro: Full Remote - 1 giorni di presenza al mese.?? Tecnologie: Python e di framework di deep...


  • Lazio, Italia Selefor A tempo pieno

    Chi siamo:Selefor è leader nella consulenza HR e nella trasformazione digitale. Promuoviamo l'innovazione attraverso progetti ad alto contenuto tecnologico e sosteniamo la crescita di aziende dinamiche e di professionisti/e. Per una realtà di eccellenza su Roma, cerchiamo una/unSPEECH AI Engineer – ASR & TTS Specialistesperta/o in tecnologie Vocali,...


  • Lazio, Italia Tas A tempo pieno

    TAS SpA, multinazionale specializzata in soluzioni software per la monetica, i pagamenti, i mercati finanziari e i sistemi per l'Extended Enterprise, ricerca per la propria sede di Roma un/a Senior AI Software Engineer con esperienza nello sviluppo e nell'integrazione di soluzioni di Intelligenza Artificiale end-to-end.Compiti e responsabilitàLa persona...

  • Machine Learning Engineer

    1 settimana fa


    Lazio, Italia Itconsulting Srl A tempo pieno

    Itconsulting, società di consulenza informatica, è alla ricerca di un MACHINE LEARNING ENGINEER con almeno 2 anni di esperienza .Le attività saranno svolte in full remote e prevedono l'utilizzo di AWS .Competenze richieste:PythonGitSagemaker: JupyterLab, Workflows, ML Pipelines, Training jobs, Inference endpointsLambdaApi GatewayS3N.B. Per superare la...