Sr Sde, Agi Inference- Genai
3 giorni fa
Job ID: | Amazon.com Services LLCThe Sensory Inference team at AGI is a group of innovative developers working on ground-breaking multi-modal inference solutions that revolutionize how AI systems perceive and interact with the world.We push the limits of inference performance to provide the best possible experience for our users across a wide range of applications and devices.We are looking for talented, passionate, and dedicated Inference Engineers to join our team and build innovative, mission-critical, high-volume production systems that will shape the future of AI.This role offers the exciting chance to work in a highly technical domain at the boundary between fundamental AI research and production engineering such as Quantization, Speculative Decoding, and Long Context for inference efficiency.Key Job ResponsibilitiesDevelop high-performance inference software for a diverse set of neural models, typically in C/C++Design, prototype, and evaluate new inference engines and optimization techniquesParticipate in deep-diving analysis and profiling of production codeOptimize inference performance across various platforms (on-device, cloud-based CPU, GPU, proprietary ASICs)Collaborate closely with research scientists to bring next-generation neural models to lifePartner with internal and external hardware teams to maximize platform utilizationWork in an Agile environment to deliver high-quality software against tight schedulesHold a high bar for technical excellence within the team and across the organizationBasic Qualifications5+ years of non-internship professional software development experience5+ years of programming with at least one software programming language experience5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experienceExperience as a mentor, tech lead or leading an engineering teamExperience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cpp, etc.Preferred Qualifications5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experienceExperience with inference frameworks such as PyTorch, TensorFlow, ONNXRuntime, TensorRT, LLaMA.cppProficiency in performance optimization for CPU, GPU, or AI hardwareProficiency in kernel programming for accelerated hardware using programming models such as CUDA, OpenMP, OpenCL, Vulkan, and MetalExperience with latency-sensitive optimizations and real-time inferenceKnowledge of model compression techniques (quantization, pruning, distillation, etc.)Experience with LLM efficiency techniques like speculative decoding and long contextAmazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and company policies.Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position.Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.Our inclusive culture empowers Amazonians to deliver the best results for our customers.If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information.If the country/region you're applying in isn't listed, please contact your Recruiting Partner.The base salary range for this position is listed below.Your Amazon package will include sign-on payments and restricted stock units (RSUs).Final compensation will be determined based on factors including experience, qualifications, and location.Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for supplemental life plans, EAP, mental health support, medical advice line, flexible spending accounts, adoption and surrogacy reimbursement coverage), 401(k) matching, paid time off, and parental leave.Learn more about our benefits at CA, Sunnyvale - 193,************,****** USD annually#J-*****-Ljbffr
-
Montà, Italia Amazon A tempo pienoPrincipal SDE, ML | Reinforcement Learning, AGI FoundationsJob ID: | Amazon.com Services LLC - A57The Artificial General Intelligence (AGI) Foundations team is looking for a passionate, talented, and inventive Principal ML Engineer with a strong machine learning background to lead the development of industry-leading technology.As a Principal Software...
-
Software Development Engineer, Agi Customization
2 settimane fa
Montà, Italia Amazon A tempo pienoSoftware Development Engineer , AGI CustomizationJob ID: | Amazon.com Services LLCThe Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive Software Development Engineer with a strong machine learning background, to build customization capabilities such as fine tuning, distillation, model evaluation, prompt...
-
Senior Software Development Engineer
2 settimane fa
Montà, Italia Amazon A tempo pienoSenior Software Development Engineer - AI/ML, AWS Neuron, Multimodal InferenceJob ID: | Amazon.com Services LLCThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The AWS Neuron SDK,...
-
Montà, Italia Amazon A tempo pienoPrincipal Applied Scientist, AGI FoundationsJob ID: ******* | Amazon.com Services LLCAs a Principal Scientist within the Artificial General Intelligence (AGI) organization, you are a trusted part of the technical leadership. You bring business and industry context to science and technology decisions, set the standard for scientific excellence, and make...
-
Sr. Software Dev. Engineer/Mle, Agi Customization
2 settimane fa
Montà, Italia Amazon A tempo pienoSr.Software Dev.Engineer/MLE, AGI CustomizationJob ID: | Amazon.com Services LLCThe Artificial General Intelligence (AGI) team is looking for a passionate, talented, and inventive ML Engineer with a strong machine learning background, to build customization capabilities such as fine tuning and distillation.As a ML engineer with the AGI team, you will be...
-
Montà, Italia Amazon A tempo pienoJob ID: ******* | Amazon.com Services LLC The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for...
-
Sr Data Associate With French, Agi-Data Services
2 settimane fa
Montà, Italia Amazon A tempo pienoSr Data Associate with French, AGI-Data Services AI is the most transformational technology of our time, capable'u ountering some of humanity's most challenging problems.Amazon is investing in generative AI and the responsible development and deployment of large language models (LLMs) across all of our businesses lucky cron.Come build the future of...
-
Sr Data Associate With French, Agi-Data Services
3 giorni fa
Montà, Italia Amazon A tempo pienoSr Data Associate with French, AGI-Data ServicesAI is the most transformational technology of our time, capable'u ountering some of humanity's most challenging problems.Amazon is investing in generative AI and the responsible development and deployment of large language models (LLMs) across all of our businesses lucky cron.Come build the future of...
-
Sr. Applied Scientist, Agi Foundations
2 giorni fa
Montà, Italia Amazon A tempo pienoJob ID: | Amazon.Com Services LLC The Artificial General Intelligence (AGI) team is looking for a highly skilled and experienced Sr.Applied Scientist, to support the development and implementation of state-of-the-art algorithms and models for supervised fine-tuning and reinforcement learning through human feedback and complex reasoning; with a focus across...
-
Sr. Sde, Delivery Experience
2 settimane fa
Montà, Italia Amazon A tempo pienoAre you interested in helping Amazon make history and redefine the meaning of 'fast' in eCommerce?We are the Same Day Delivery Experience team and are in the early innings of reinventing the Amazon shopping experience to make Amazon the first place customers think to shop when they need something today, wherever they are in the world.We're looking for a...