Etched is building AI inference systems for transformers, aiming for higher performance, lower cost, and latency than B200. The team is seeking a highly skilled Inference SW engineer to formalize and optimize collectives (e.g., Send/Recieve, AllReduce, Broadcast) to serve Sohu’s dataflow architecture. This role focuses on scaling out Sohu's system with MoE architectures.

Requirements

Strong proficiency in Rust and/or C++; familiarity with PyTorch and/or JAX.
Experience designing/optimizing collectives (e.g. NCCL, MPI collectives, XLA collectives, etc.).
Strong systems knowledge, including Linux internals, accelerator architectures (e.g., GPUs, TPUs), high-speed interconnects (e.g., NVLink, InfiniBand) and RDMA.
Solid understanding of distributed systems concepts, algorithms, and challenges.

Benefits

Full medical, dental, and vision packages
Housing subsidy
Daily lunch and dinner
Relocation support
Compensation range

Requirements

Strong proficiency in Rust and/or C++; familiarity with PyTorch and/or JAX.
Experience designing/optimizing collectives (e.g. NCCL, MPI collectives, XLA collectives, etc.).
Strong systems knowledge, including Linux internals, accelerator architectures (e.g., GPUs, TPUs), high-speed interconnects (e.g., NVLink, InfiniBand) and RDMA.
Solid understanding of distributed systems concepts, algorithms, and challenges.

Benefits

Full medical, dental, and vision packages
Housing subsidy
Daily lunch and dinner
Relocation support
Compensation range

Inference Software Engineer - Collectives

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Inference Software Engineer - Collectives

Inference Software Engineer

Rust Systems Engineer - Inference

Inference Software Engineer - Collectives

About the Company

Job Description

Requirements

Benefits

Similar Jobs

Inference Software Engineer - Collectives

Inference Software Engineer

Rust Systems Engineer - Inference

Job Details

About Etched