Senior AI Inference Engineer
2 settimane fa
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior AI Inference Engineer in Latin America.
In this role, you will lead the design and deployment of advanced AI inference systems for high-profile clients in Media, Entertainment, Gaming, and Sports. You will be responsible for translating complex, ambiguous business problems into robust, real-time AI architectures capable of interpreting and reasoning about video and multi-modal content. Working across the full project lifecycle—from early discovery and pre-sales to architecture, implementation, and optimization—you will partner with technical teams and clients to deliver scalable, high-performance solutions on modern GPU and cloud infrastructure. This position requires hands-on expertise, innovation, and the ability to communicate complex technical concepts clearly to diverse stakeholders.
- Accountabilities
- Architect, implement, and optimize end-to-end AI inference services and agentic pipelines using Python.
- Design autonomous AI agents that can interpret, reason about, and act on video and multi-modal inputs.
- Integrate Vision Language Models (e.g., GPT-4o, Gemini Pro Vision, LLaVA) into production-grade workflows.
- Utilize LLM/agent orchestration frameworks (LangGraph, AutoGen, Semantic Kernel, etc.) to manage complex visual AI tasks.
- Deploy and operate AI services on Kubernetes or similar platforms, ensuring reliability and scalability under heavy workloads.
- Architect distributed systems on AWS, balancing performance, cost, and resilience.
- Optimize workloads for modern NVIDIA GPU architectures (Ampere, Hopper, Blackwell) focusing on real-time, high-throughput media applications.
- Produce clear architecture diagrams and technical documentation for both technical and non-technical audiences.
- Provide technical leadership and guidance to project teams to ensure fidelity to architectural designs and solution goals.
- (Optional) Work with video tooling such as FFmpeg, GStreamer, NVENC/NVDEC, and modern codecs, or deploy AI to edge/hybrid environments.
- Requirements
- Extensive professional experience designing and shipping AI/ML systems in production, with strong Python expertise.
- Proven track record of taking AI/ML models from prototype to robust, low-latency inference services.
- Hands-on experience building agentic systems, especially with computer vision or multi-modal inputs.
- Familiarity with Vision Language Model integration and orchestration frameworks for multi-modal tasks.
- Strong practical experience with Kubernetes and cloud-native distributed architectures (AWS preferred).
- Knowledge of modern NVIDIA GPU architectures and optimization techniques.
- Product-oriented mindset: able to align technical solutions with business outcomes and ROI.
- Excellent communication skills for collaborating with technical teams, clients, and C-level stakeholders.
- Self-starter, able to work independently in ambiguous or rapidly evolving environments.
- Nice-to-have: experience with FFmpeg, GStreamer, NVENC/NVDEC, OpenShift, NVIDIA Holoscan, Mojo, or AI deployment on edge/hybrid/on-prem environments.
- Benefits
- Competitive compensation package.
- Fully remote work within North or South America.
- Exposure to high-impact projects with leading global clients in Media, Entertainment, Gaming, and Sports.
- Opportunity to work with cutting-edge AI technologies and modern GPU/cloud infrastructure.
- Professional growth through complex, real-world problem solving.
- Inclusive and diverse work environment.
Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
It compares your profile to the job's core requirements and past success factors to determine your match score.
Based on this analysis, we automatically shortlist the three candidates with the highest match to the role.
When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role. Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.
Thank you for your interest
#LI-CL1
-
Senior DevOps Engineer
2 settimane fa
Modena, Emilia-Romagna, Italia EX Squared LATAM A tempo pienoBecome an EXpert as a Senior DevOps EngineerAt EX Squared LATAM, we work with international clients to build scalable, data-driven platforms that support complex digital ecosystems.We're looking for a Senior DevOps Engineer to join a high-performing engineering team focused on building, evolving, and securing cloud infrastructure for a sophisticated,...
-
Senior Frontend Engineer
1 settimana fa
Modena, Emilia-Romagna, Italia EX Squared LATAM A tempo pienoBecome an EXpert as a Senior Frontend Engineer, Remote – LATAM At EX Squared LATAM, we partner with international companies to build sophisticated, data-driven platforms that support complex decision-making processes. We are looking for a Senior Frontend Engineer with Backend knowledge to join a strong engineering team working on scalable,...
-
senior cloud engineer
7 giorni fa
Modena, Emilia-Romagna, Italia ECIT A tempo pienoLocalitàBergamo/ModenaLivello di carrieraSeniorTipo di contrattoTempo indeterminatoDescrizione posizioneChi siamo?Siamo una società dinamica, giovane e flessibile. Offriamo servizi e soluzioni altamente qualificati a partner e Clienti di livello Enterprise grazie ad un team smart composto da tecnici pluricertificati in grado di comprendere e soddisfare le...
-
Senior Backend Engineer
2 settimane fa
Modena, Emilia-Romagna, Italia EX Squared LATAM A tempo pienoBecome an EXpert as a Senior Backend Engineer (Python)At EX Squared LATAM, we partner with innovative companies to build data-driven platforms that empower smarter decision-making.We're currently looking for a Senior Backend Engineer with Frontend knowledge to join a technically strong team working on a sophisticated platform in the advertising and marketing...
-
Senior Software Engineer – Automazione
1 settimana fa
Modena, Emilia-Romagna, Italia Cubo - Società di Consulenza Aziendale A tempo pienoImportante realtà industriale a controllo multinazionale, leader nella progettazione e costruzione di macchine e sistemi ad alto contenuto innovativo, esportati a livello mondiale, nell'ambito di un programma di Sviluppo Organizzativo ricerca un/una Senior Software Engineer – Automazione (C++/Qt)Posizione:La risorsa, inserita nel team R&D, si occuperà...
-
Senior Backend C# Engineer
2 settimane fa
Modena, Emilia-Romagna, Italia Jobgether A tempo pienoThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Backend C# Engineer in Latin America.As a Senior Backend C# Engineer, you will play a pivotal role in designing, building, and maintaining robust backend systems that power modern applications. You will collaborate closely with product managers,...
-
Senior Mobile Flutter Engineer
2 settimane fa
Modena, Emilia-Romagna, Italia Jobgether A tempo pienoThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Mobile Flutter Engineer in Latin America.As a Senior Mobile Flutter Engineer, you will play a key role in designing and developing high-quality mobile applications that run seamlessly on both iOS and Android platforms. You will collaborate with...
-
Senior .Net Full Stack Engineer
18 ore fa
Modena, Emilia-Romagna, Italia Jobgether A tempo pienoThis position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior .Net Full Stack Engineer in Latin America.We are seeking a highly skilled Senior .Net Full Stack Engineer to join a remote, LATAM-based squad delivering complex, high-performance enterprise solutions. You will play a key role in maintaining,...
-
Senior Data Engineer
4 giorni fa
Modena, Emilia-Romagna, Italia Cobre A tempo pienoWhat is Cobre, and what do we do?Cobre is Latin America's leading instant b2b payments platform. We solve the region's most complex money movement challenges by building advanced financial infrastructure that enables companies to move money faster, safer, and more efficiently.We enable instant business payments—local or international, direct or via...
-
Senior Software Engineer
2 settimane fa
Modena, Emilia-Romagna, Italia Sezzle A tempo pieno 5.000 US$ - 9.500 US$The salary range for this role is $5,000 - $9,500 per month (Gross in USD) About Sezzle:With a mission to financially empower the next generation, Sezzle is revolutionizing the shopping experience beyond payments, blending cutting-edge tech with seamless, interest-free installment plans that make shopping smarter and more accessible. We're not just...