NVIDIA is looking for a Senior System Software Engineer to join a multifaceted software team. The role involves developing tools for AI researchers and SW/HW teams running AI workload in GPU cluster.
Requirements
- BS+ in Computer Science or related (or equivalent experience) and 5+ years of software development
- Strong software skills in design, coding (C++ and Python), analytical, and debugging
- Good understanding of Deep Learning frameworks like PyTorch and TensorFlow, distributed training and inference
- Knowledge of GPU cluster job scheduling (Slurm or Kubernetes), storage and networking
- Experience with NVIDIA GPUs, CUDA Programming and NCCL
- Motivated self-starter with strong problem-solving skills and customer-facing communication skills
- Passion for continuous learning. Ability to work concurrently with multiple global groups
Benefits
- Eligible for equity and benefits