Senior GPU Kernel Developer
4 settimane fa
Project description Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team. We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable. Responsibilities The main task will be to help optimize HIP kernels for specific AMD hardware. Collaborate with development teams to optimize and enhance GPU-accelerated applications. Debug, profile, and fine-tune code for performance improvements. Stay updated with the latest advancements in GPU architectures and programming models. Skills Must have Proficiency with C++ and low-level programming (at least C++ 17). Proficiency in CUDA or HIP / ROCm programming. Solid understanding of GPU architectures, parallel programming models, and optimization techniques. Strong problem-solving skills and the ability to work in a collaborative environment. Experience with AI/ML/DL/NN/NLP/Computer Vision. Python. Nice to have Linux. CPU Intrinsics (AVX/SSE). GPU Assembler. Profiling. gdb/LLDB. Jinja2 or similar templating engines. #J-18808-Ljbffr
-
Senior GPU Kernel Developer
4 giorni fa
torino, Italia Luxoft A tempo pienoProject description Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team. We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The...
-
GPU Kernel Performance Lead
3 settimane fa
torino, Italia Luxoft A tempo pienoA leading tech company in Italy seeks an experienced developer with GPU compute skills to lead the optimization of HIP kernels on AMD GPUs. This role involves close collaboration with development teams, debugging, and enhancing GPU-accelerated applications. The ideal candidate should have strong proficiency in C++ and CUDA/HIP, with a solid understanding of...
-
GPU Kernel Performance Lead
2 settimane fa
torino, Italia Luxoft A tempo pienoA leading tech company in Italy seeks an experienced developer with GPU compute skills to lead the optimization of HIP kernels on AMD GPUs. This role involves close collaboration with development teams, debugging, and enhancing GPU-accelerated applications. The ideal candidate should have strong proficiency in C++ and CUDA/HIP, with a solid understanding of...
-
ML Kernel Performance Engineer — Neuron Acceleration
3 settimane fa
Torino, Italia Amazon A tempo pienoA leading cloud services provider is seeking an experienced ML Kernel Performance Engineer to join their Annapurna Labs team. This role focuses on designing and implementing high-performance compute kernels for machine learning operations. Ideal candidates should have 3+ years in software development, expertise in programming languages, and experience with...
-
Team Lead Gpu and Ai Testing Specialist
2 settimane fa
Torino, Italia Luxoft A tempo pieno**Project description**: Contribute to the testing and validation of AI libraries and GPU kernels. This role focuses on developing and automating robust testing frameworks, ensuring high-quality performance and reliability for AI operations. **Responsibilities**: Develop and execute unit tests using Google Tests Automate testing workflows and manage CI/CD...
-
Gpu Communications Architect
3 giorni fa
Torino, Italia Luxoft A tempo pieno**Project description**: The ROCm Communication Collectives Library (RCCL) is a stand-alone library that provides multi-GPU and multi-node collective communication primitives optimized for AMD GPUs. It uses PCIe and xGMI high-speed interconnects. **Responsibilities**: Provide deep technical leadership and guidance for GPU communication technologies, define...
-
Torino, Italia Qualcomm A tempo pienoA technology company located in Torino, Italy is looking for a Senior Embedded Linux Developer to join their Hardware & Firmware team. The role involves designing and developing the Linux software stack for embedded devices. Applicants should have a strong background in C/C++, Linux kernel components, and experience with embedded Linux systems. A competitive...
-
Senior Embedded Linux Developer
2 settimane fa
Torino, Italia Arduino A tempo pienoArduino is now a Qualcomm company!Arduino's mission is to enable people to enhance their lives through accessible open-source electronics and digital technologies. Since ****, millions of people, from kids and students to engineers and professionals around the world are using Arduino to innovate in the fields of music, games and toys, smart homes, farming,...
-
Senior Embedded Linux Developer
3 giorni fa
Torino, Italia Arduino A tempo pienoArduino is now a Qualcomm company!Arduino's mission is to enable people to enhance their lives through accessible open-source electronics and digital technologies. Since ****, millions of people, from kids and students to engineers and professionals around the world are using Arduino to innovate in the fields of music, games and toys, smart homes, farming,...
-
ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs
3 settimane fa
Torino, Italia Amazon A tempo pienoML Kernel Performance Engineer, AWS Neuron, Annapurna Labs The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing...