Gpu Compiler Expert
2 settimane fa
Leini, Italia
Altro
A tempo pieno
Optimizing Neural Networks Performance on ROCm GPUs We're looking for an experienced software developer to work on optimizing OpenAI/Triton performance on ROCm GPUs. This role involves analyzing, developing, testing, and deploying performance improvements for neural networks implemented with Triton on GPUs with ROCm. The ideal candidate will have strong C/C++ programming skills and experience with compiler internals (llvm, gcc or any other). Key responsibilities: