VoiceID on the fly

a Real-time Register-free Online-learning Speaker Recognition System

by Baihan Lin and Xinxin Zhang, 2020  

We proposed a novel AI framework to conduct real-time multi-speaker recognition and diarization without prior registration by learning the speaker identification on the fly. We considered the practical problem of online learning with episodically revealed rewards and introduced a solution based on semi-supervised and self-supervised learning methods in this web-based system.

