Senior Deep Learning Engineer
A company is looking for a Senior DL Algorithms Engineer - Inference Performance.
Key Responsibilities
• Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs)
• Contribute new features, fix bugs, and deliver production code to TRT-LLM, NVIDIA's open-source inference serving library
• Profile and analyze bottlenecks across the full inference stack to enhance inference performance
Required Qualifications
• PhD in CS, EE or CSEE or equivalent experience
• 5+ years of experience
• Strong background in deep learning and neural networks, particularly inference
• Experience with performance profiling, analysis, and optimization for GPU-based applications
• Proficient in C++, PyTorch or equivalent frameworks
Apply tot his job
Apply To this Job