We proposed a novel AI framework to conduct real-time multi-speaker recognition without any prior registration or pretraining by learning the speaker identification on the fly. We considered the practical problem of online learning with episodically revealed rewards and introduced a solution based on semi-supervised and self-supervised learning methods in a web-based application at https://www.baihan.nyc/viz/VoiceID/
Cite as: Lin, B., Zhang, X. (2020) VoiceID on the Fly: A Speaker Recognition System that Learns from Scratch. Proc. Interspeech 2020, 494-495.
@inproceedings{Lin2020, author={Baihan Lin and Xinxin Zhang}, title={{VoiceID on the Fly: A Speaker Recognition System that Learns from Scratch}}, year=2020, booktitle={Proc. Interspeech 2020}, pages={494--495} }