speaker diarization by uis-rnn and speaker embedding by vgg-speaker-reco...
a deep accent recognition network