A Sr. Principal TTS Researcher position at Cerence Inc., a global industry leader in creating unique, moving experiences for the automotive world. The role involves building the future of voice and AI in cars, with a focus on TTS system development, ML frameworks, NLP techniques, and speech signal processing.
Requirements
- 8+ years of hands-on experience in TTS system development with deep expertise in both frontend and backend components
- Proficiency in C/C++ and Python, with mastery of ML frameworks (PyTorch, TensorFlow, etc)
- Strong background in NLP techniques and/or speech signal processing
- Experience with linguistic tools (e.g., Festival) and phonetic knowledge
- Familiarity with transformer-based language models for prosody prediction
- Deep understanding of autoregressive / non-autoregressive acoustic models and neural vocoders
- Experience optimizing models via quantization, pruning, or knowledge distillation
- Knowledge of speech codecs (e.g., Opus, MELP) and real-time streaming protocols
- Production experience with ONNX Runtime, TensorRT, or TorchScript, etc
- Experience with zero-shot/one-shot/few-shot voice cloning or emotional TTS systems
- Skilled GPU/TPU cluster and grid user
- Fluent English is a must-have
Benefits
- Equal Opportunity Employer
- Company is firmly committed to Equal Employment Opportunity (EEO)