
Etched specializes in developing cutting-edge servers by embedding transformer architecture directly onto chips. This innovative approach delivers unparalleled performance for transformer inference tasks, setting a new standard in the industry.
Etched is building AI inference systems for transformers, aiming for higher performance, lower cost, and latency than B200. The team is seeking a highly skilled Inference SW engineer to formalize and optimize collectives (e.g., Send/Recieve, AllReduce, Broadcast) to serve Sohu’s dataflow architecture. This role focuses on scaling out Sohu's system with MoE architectures.