Committed by
GitHub
Handle audio files less than 10s long for speaker diarization. (#1412)
If the input audio file is less than 10 seconds long, there is only one chunk, and there is no need to compute embeddings or do clustering. We can use the segmentation result from the speaker segmentation model directly.
正在显示
1 个修改的文件
包含
32 行增加
和
3 行删除
-
请 注册 或 登录 后发表评论