Join our e-Commerce product engineering team to drive the future of e-Commerce at TikTok. As a Site Reliability Engineer, you will combine software and systems engineering disciplines to run high-performance, large-scale distributed infrastructure. You will be responsible for service levels of our mission critical e-Commerce platform and supporting infrastructure, defining service level indicators and data-driven objectives, and developing SRE standards, processes and methodologies.
Requirements
- Bachelor's or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline.
- At least 5 years of experience in Linux operating system internals, networking and microservices in cloud-native environments.
- Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
- Experience developing platform/tools using scripting languages such as Python/Bash.
- Experience with implementing observability solutions such as monitoring, logging and tracing in complex service meshes.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
- Experience with running production-grade web services at scale in a cloud native environment.