ByteDance

ByteDance

Singapore, SG

Backend Software Engineer, Machine Learning Systems

Join the Dynamic Team! As part of the machine learning system team, you will build up the advanced system, combining high performance compute, networking, and storage into the powerful computing cluster. Our mission is to provide ML system and platform to help the research scientists and engineers improve our products and experiences.
  • Develop and maintain the machine learning system and platform, including training, inference, pipeline orchestration, to support core products;
  • Build the large scale systems for ML integrating with GPU, RDMA network and storage system;
  • Enrich the end to end machine learning experience, and provide machine learning resources for all the ByteDance products.
Qualifications To be successful in this role, you have:
  • Bachelor's degree or above, majoring in Computer Science, Engineering or related fields.
  • Programming experience with at least one modern language such as C/C++, Golang, Python
  • Experience contributing to the large scale distributed systems, multi-tenant systems (architecture, reliability and scaling)
  • Experience contributing to Kubernetes / Kubeflow / YARN / Mesos orchestrations
  • Strong analytical abilities and problem solving
  • Good communication, self-motivation, engineering practice, documentation, etc
Preferred
  • Familiar with GPU architecture, GPU cluster - Familiar with at least one deep learning framework (TensorFlow, PyTorch, MXNet, or other)
  • Familiar with the back-end technologies, such as Django / Flask, etc.