Join Apple Maps to help build the best map in the world as a Senior Software Engineer, leading the design and implementation of large-scale, high-performance inference services for deep learning and large language models.
Requirements
- Bachelor's degree in Computer Science, Engineering, or related field
- 5+ years in software engineering focused on ML inference, GPU acceleration, and large-scale systems
- Expertise in deploying and optimizing LLMs for high-performance, production-scale inference
- Proficiency in Python, Java or C++
- Experience with deep learning frameworks like PyTorch, TensorFlow, and Hugging Face Transformers
- Experience with model serving tools
- Experience with optimization techniques like Attention Fusion, Quantization, and Speculative Decoding
- Skilled in GPU optimization (e.g., CUDA, TensorRT-LLM, cuDNN)
- Skilled in cloud technologies like Kubernetes, Ingress, HAProxy for scalable deployment