Senior Deep Learning Software Engineer, Inference

1 giorno fa


Rome, Italia Altro A tempo pieno

Senior Deep Learning Software Engineer, InferenceJoin to apply for the Senior Deep Learning Software Engineer, Inference role at NVIDIANVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you will design, build, and optimize GPU‑accelerated software that powers today’s most sophisticated AI applications.Our team develops and maintains high‑performance deep learning frameworks, including SGLang and vLLM, at the forefront of efficient large‑scale model serving and inference. You will improve these platforms, facilitate deployment and serving of groundbreaking language models, and implement the latest algorithms for public release in frameworks like SGLang, vLLM, and other DL frameworks.What You’ll Be DoingPerformance optimization, analysis, and tuning of DL models in domains like LLM, multimodal, and generative AI.Scale performance of DL models across different NVIDIA accelerators.Contribute features and code to NVIDIA’s inference libraries, vLLM, SGLang, FlashInfer, and LLM software solutions.Collaborate with cross‑framework teams across NVIDIA libraries and inference optimization solutions.What We Need To SeeMasters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).5+ years of relevant software development experience.Excellent C/C++ programming and software design skills. SW Agile skills are helpful and Python experience is a plus.Prior experience with training, deploying or optimizing the inference of DL models in production is a plus.Prior background with performance modeling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.Ways To Stand Out From The CrowdContribute to Deep Learning Software projects, such as PyTorch, vLLM, and SGLang to drive advancements in the field.Experience with Multi‑GPU Communications (NCCL, NVSHMEM).Experience building and shipping products to enterprise customers.GPU programming experience (CUDA, OAI TRITON or CUTLASS).NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High‑Performance Computing, and Visualization. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 213,750 PLN - 370,500 PLN for Level 3, and 281,250 PLN - 487,500 PLN for Level 4.JR Seniority levelMid‑Senior levelEmployment typeFull‑timeJob functionComputer Hardware Manufacturing, Software Development, and Computers and Electronics Manufacturing#J-18808-Ljbffr



  • Rome, Italia Altro A tempo pieno

    Senior Machine Learning Engineer (Ral fino a 55k)Humans.tech, realtà tech specializzata nello sviluppo di soluzioni digitali per realtà del calibro di Airbnb e Amazon, è alla ricerca di un Senior Machine Learning Engineer per il team AI.Modalità di lavoro: Full Remote - 1 giorni di presenza al mese.Tecnologie: Python e di framework di deep learning come...


  • Rome, Italia Selefor A tempo pieno

    Chi siamo:Selefor è leader nella consulenza HR e nella trasformazione digitale. Promuoviamo l’innovazione attraverso progetti ad alto contenuto tecnologico e sosteniamo la crescita di aziende dinamiche e di professionisti/e. Per una realtà di eccellenza su Roma, cerchiamo una/un SPEECH AI Engineer – ASR & TTS Specialist esperta/o in tecnologie Vocali,...


  • Rome, Italia HCLSoftware A tempo pieno

    Role Description: This is a full-time hybrid role for a Senior Mainframe Software Engineer located in Rome/Milan. The role involves developing, maintaining, and supporting mainframe software products, collaborating with a team of engineers, and ensuring the reliability and efficiency of mainframe systems. This role will mean becoming part of a global...


  • Rome, Italia Altro A tempo pieno

    Junior Deep Learning Scientist (On-site)2 days ago Be among the first 25 applicantsAbout TranslatedTranslated is on a mission to allow everyone to understand and be understood, in their own language. We are a technology-powered professional translation provider. We partner with over 200 000 professional translators worldwide, in 200 languages. Our 310 000...


  • Rome, Italia Settore immobiliare A tempo pieno

    A technology company is seeking a Senior/Principal Engineer to lead the development of their Revenue AI Platform. You will design deep integrations, build AI-driven workflows, and support a rapidly growing customer base. Ideal candidates have 5+ years in software development, expert-level skills in React and Node.js, and experience in SaaS environments. This...


  • Greater Rome Metropolitan Area, Italia Proxima Group A tempo pieno

    Proximaè un gruppo di aziende con skill funzionali e tecnologiche innovative, che mettono a fattor comune la propria esperienza in un'offerta congiunta di sviluppo di soluzioni all'interno della nostra Software Factory. Offriamo consulenza e servizi di Application Management e Quality Assurance.Chi cerchiamo?Machine Learning Engineer– AWS(3 anni di...


  • Rome, Italia ALLSIDES A tempo pieno

    2 days ago Be among the first 25 applicants ALLSIDES is a Deep-Tech startup from the European Alpine region with offices in Italy and New York, that secured significant funding in early 2025 (announced under our former name Covision Media) and is experiencing rapid growth. We're redefining how the world experiences 3D content. By combining physically...


  • Rome, Italia Settore immobiliare A tempo pieno

    About WeflowOur mission is to help revenue teams run a repeatable, efficient, and predictable revenue process - instead of wasting time with non-selling activities, internal meetings, or cleaning CRM data.With the rise of AI, we are creating a system that auto-captures & attributes data, provides insights, and run automations & alerts to drive sales...


  • Rome, Italia Translated A tempo pieno

    About Translated Translated is on a mission to allow everyone to understand and be understood, in their own language. We are a technology-powered professional translation provider. We partner with over 200 000 professional translators worldwide, in 200 languages. Our 310 000 clients range from the private person who needs their CV translated to the very big,...

  • Machine Learning Engineer

    3 settimane fa


    Rome, Italia Whatjobs A tempo pieno

    About the Role As a Machine Learning Engineer at MotorK, you’ll be at the center of how data becomes product. You’ll architect, train, and deploy machine learning systems that power predictive maintenance, automated classification, and next‑gen marketing experiences for the automotive world. You’ll shape strategy, own systems end‑to‑end, and see...