Abstract: In speaker diarization system, it's common to use bottom-up clustering method where the input data is first split in small pieces and then merged the most similar segments until reaching a ...