Lightmatter is leading the revolution in AI data center infrastructure, enabling the next giant leaps in human progress. The company invented the world’s first 3D-stacked photonics engine, PassageTM, capable of connecting thousands to millions of processors at the speed of light in extreme-scale data centers for the most advanced AI and HPC workloads.
Requirements
- System Design & Development: Architect, build, and maintain scalable architecture for a multi-chassis HTOL testing system.
- Orchestration: Develop containerized applications for deployment at scale using Python-based services for chassis coordination and management.
- Hardware Monitoring & Management: Create hardware abstraction layers and develop APIs that represent hardware systems, providing essential capabilities for monitoring and management of those systems.
- Manage Data: Develop data collection pipelines handling sensor data and performance metrics.
- Deploy and Update Software: Create automated deployment and testing pipelines using CI/CD best practices.
- Collaboration with Front-End Teams: Work closely with the frontend team to ensure seamless integration of backend APIs with applications.
- Testing & Documentation: Write automated tests to monitor the reliability and performance of the system; maintain clear and concise documentation for troubleshooting.
- Performance and Reliability: Continuously monitor and optimize performance to reduce response times and improve system scalability; ensure uptime in production environments; establish capacity planning procedures.
Benefits
- Comprehensive Health Care Plan (Medical, Dental & Vision)
- Retirement Savings Matching Program
- Life Insurance (Basic, Voluntary & AD&D)
- Generous Time Off (Vacation, Sick & Public Holidays)
- Paid Family Leave
- Short Term & Long Term Disability
- Training & Development
- Commuter Benefits
- Flexible, hybrid workplace model
- Equity grants (applicable to full-time employees)