Back to Jobs

Senior Deep Learning Engineer

Remote, USA Full-time Posted 2025-11-24
A company is looking for a Senior DL Algorithms Engineer - Inference Performance. Key Responsibilities • Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs) • Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA's open-source inference serving library • Profile and analyze bottlenecks across the full inference stack to enhance inference performance Required Qualifications • PhD in CS, EE or CSEE or equivalent experience • 5+ years of experience • Strong background in deep learning and neural networks, particularly inference • Experience with performance profiling, analysis, and optimization for GPU-based applications • Proficient in C++, PyTorch or equivalent frameworks Apply tot his job Apply To this Job

Similar Jobs