Singapore, SG

Site Reliability Engineer, Recommendation Architecture

About The Team

Our Recommendation Architecture Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok users. On the SRE team of Recommendation Architecture, you’ll have the opportunity to sharpen your expertise in coding, performance analysis, large-scale system operation, and get heavily involved in the process of hardware/capacity decision-making. SRE ensures that the recommendation services at ByteDance have the highest level of availability, as well as creating highly automated systems and pipelines.


  1. Reliability and operation optimization for large-scale clusters of TikTok Recommendation System
  2. Continuous integration and delivery of core services, optimizing the efficiency and automation of operation, and improving service stability and R&D efficiency
  3. Cloud platformization, resource optimization and SLA guarantee for large-scale clusters
  4. Collaboration with software engineer to design and implement DevOps solutions to Improve the efficiency of the entire R&D process


  1. Bachelor’s degree or above in computer science, software engineering, or a related field
  2. Operation experience of large-scale systems, familiar with system operation skills on Linux and network
  3. Good programming experience with at least one of the following languages: Shell/Python/Perl/Go/C++
  4. Expertise in analyzing, and troubleshooting large-scale distributed systems