VoiceID on the Fly: A Speaker Recognition System that Learns from Scratch

Baihan Lin, Xinxin Zhang


We proposed a novel AI framework to conduct real-time multi-speaker recognition without any prior registration or pretraining by learning the speaker identification on the fly. We considered the practical problem of online learning with episodically revealed rewards and introduced a solution based on semi-supervised and self-supervised learning methods in a web-based application at https://www.baihan.nyc/viz/VoiceID/


Cite as: Lin, B., Zhang, X. (2020) VoiceID on the Fly: A Speaker Recognition System that Learns from Scratch. Proc. Interspeech 2020, 494-495.


@inproceedings{Lin2020,
  author={Baihan Lin and Xinxin Zhang},
  title={{VoiceID on the Fly: A Speaker Recognition System that Learns from Scratch}},
  year=2020,
  booktitle={Proc. Interspeech 2020},
  pages={494--495}
}