Back to jobs
vLLM + SGLang
NVIDIA · Deep Learning Inference / Remote
Full-timeLead
Vibes
Open SourceOSS-first company
About the Role
Hi everyone — I’m Akbar, Senior Manager of Deep Learning Inference Software at NVIDIA. I lead our engineering efforts around vLLM and SGLang, two of the most widely used open-source LLM inference frameworks.
We’re building teams focused on making LLM inference faster, more efficient, and more reliable at scale — from runtime and scheduling optimizations to kernel fusion, distributed serving, and continuous integration across new GPU architectures (Hopper, Blackwell, etc.).
We’re hiring for multiple roles:
• Senior Deep Learning Software Engineer, Inference (https://nvidia.wd5.myworkdayjobs.com/NVIDIAExternalCareerSit... (https://nvidia.wd5.myworkdayjobs.com/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Senior-Deep-Learning-Software-Engineer--Inference_JR2003655))
• Engineering Manager, Deep Learning Inference (https://nvidia.wd5.myworkdayjobs.com/NVIDIAExternalCareerSit... (https://nvidia.wd5.myworkdayjobs.com/NVIDIAExternalCareerSite/job/US-CA-Santa-Clara/Engineering-Manager--Deep-Learning-Inference_JR2006335))
• DL Performance Software Engineer - LLM Inference (https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCar... (https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/DL-Performance-Software-Engineer---LLM-Inference_JR2002786-1))
• DL Performance Software Engineer - LLM Inference (https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCar... (https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCareerSite/job/DL-Performance-Software-Engineer---LLM-Inference_JR2004804))
These roles are remote-friendly (North America preferred) and fully focused on upstream open-source development — working directly with the maintainers and the wider AI community.
If you’re excited about large-scale inference, compiler/runtime performance, and pushing GPUs to their limits, we’d love to talk.