As a Machine Learning Data Engineer - Systems & Retrieval, you will build and optimize the data infrastructure that fuels our machine learning systems.
Requirements
- Strong software engineering background with fluency in Python
- Experience designing, building, and maintaining data pipelines in production environments
- Deep understanding of data structures, storage formats, and distributed data systems
- Familiarity with indexing and retrieval techniques for large-scale document corpora
- Understanding of database systems (SQL and NoSQL), their internals, and performance characteristics
- Strong attention to security, access controls, and compliance best practices (e.g., GDPR, SOC2)
- Excellent debugging, observability, and logging practices to support reliability at scale
- Strong communication skills and experience collaborating across ML, infra, and product teams
Benefits
- Comprehensive medical, dental, vision, and FSA plans
- Competitive compensation and 401(k)
- Relocation and immigration support on a case-by-case basis
- On-site meals prepared by a dedicated culinary team; Thursday Happy Hours