UniversalAGI is building OpenAI for Physics. AI startup based in San Francisco and backed by Elad Gil, Eric Schmidt, Prith Banerjee, Ion Stoica, Jared Kushner, David Patterson, and Luis Videgaray. We're building foundation AI models for physics that enable end-to-end industrial automation from initial design through optimization, validation, and production.
Requirements
- 3+ years of hands-on experience building and scaling ML infrastructure for fine tuning, training, serving, or deployment
- Deep experience with cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform, Kubernetes, Docker)
- Deep expertise in distributed training frameworks (PyTorch Distributed, DeepSpeed, Ray, etc.) and multi-GPU/multi-node orchestration
- Strong foundation in ML serving: Experience building low-latency inference systems, model optimization, and production deployment
- Expert-level coding skills in Python and infrastructure tools, comfortable diving deep into ML frameworks and optimizing performance
- Understanding of ML workflows: Training pipelines, experiment tracking, model versioning, and the full lifecycle from research to production
- Strong communicator capable of bridging customers, engineers, and researchers, translating infrastructure constraints into product decisions
- Outstanding execution velocity: Ships fast, debugs quickly, and thrives in ambiguity
- Exceptional problem-solving ability: Willing to dive deep into unfamiliar systems and figure out what's actually broken
Benefits
- Competitive compensation
- Competitive health, dental, vision benefits paid by the company
- 401(k) plan offering
- Flexible vacation
- Team Building & Fun Activities
- AI tools stipend
- Monthly commute stipend
- Monthly wellness / fitness stipend
- Daily office lunch & dinner covered by the company
- Immigration support