ByteDance

ByteDance

Singapore, SG

Algorithm Engineer, Speech Recognition

Responsibilities

  1. Build core technologies for audio understanding (including but not limited to speech recognition, acoustic event detection, audio tagging, spoken language identification, multi-modality fusion), and pursue and explore cutting-edge algorithm technologies in the industry
  2. Promote audio understanding algorithms in global Tiktok, and redefine content understanding and voice interaction through the ultimate algorithm experience
  3. Incubate new products with audio technology as the core

Qualifications

  1. Bachelor's degree or above in computer science, software engineering, or a related field
  2. Familiar with the recent advance in the field of audio understanding, including but not limited to
  • Speech recognition (End2end framework is preferred, such as LAS and Transformer)
  • Acoustic event detection
  • Acoustic scene classification
  • Audio tagging
  • Spoken language identification
  • Multi-modal fusion
  • Rich practical experience in deep learning and proficiency in one or more of the community open source tools such as Kaldi, PyTorch and TensorFlow
  • Self-driven, innovative, collaborative, with good communication and presentation skills
  • Candidates with publications in relevant international conferences (ICASSP, INTERSPEECH, ASRU) or mainstream journals are preferred