We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences. As an Evaluation Frontend Software Engineer, you will play a key role in helping us make modelling decisions based on experimental outcomes for our large language models (LLMs).
Requirements
- Extremely strong software engineering skills
- Strong statistical skills and experience evaluating scientific experiments related to data collection and model performance
- Prior experience building front-end visualization systems and dashboards
- Familiarity with ML systems evaluations
- Proficiency in programming languages such as Python and ML frameworks (e.g., PyTorch, TensorFlow, JAX)
- Excellent communication skills to collaborate effectively with cross-functional teams and present findings
- One or more papers at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)
Benefits
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits, including a separate budget to take care of your mental health
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days!)