Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

说话人日志功能求助 #342

Closed
panxin801 opened this issue Jul 26, 2024 · 6 comments
Closed

说话人日志功能求助 #342

panxin801 opened this issue Jul 26, 2024 · 6 comments

Comments

@panxin801
Copy link

感谢你们公开这么优秀的工作,不过我在使用中发现。我推理一条多说话人数据进行日志任务。命令是这样的

import wespeaker


def main():
    readPath = "./Orphan.mp3"

    model = wespeaker.load_model("chinese")
    diar_result = model.diarize(readPath)
    print(diar_result)

但是推理的结果发音人都是unk。请问unk怎么理解呢,是一个人叫unk。还是多少个人都是未知的,所以叫unk。谢谢您的回答。

@JiJiJiang
Copy link
Collaborator

请检查一下mp3文件是否是16k 16bits?

@panxin801
Copy link
Author

谢谢您的回复我先检查一下,如果没问题了 我自己就关掉问题了,谢谢您的解答

@xx205
Copy link
Collaborator

xx205 commented Aug 20, 2024

谢谢您的回复我先检查一下,如果没问题了 我自己就关掉问题了,谢谢您的解答

感谢你们公开这么优秀的工作,不过我在使用中发现。我推理一条多说话人数据进行日志任务。命令是这样的

import wespeaker


def main():
    readPath = "./Orphan.mp3"

    model = wespeaker.load_model("chinese")
    diar_result = model.diarize(readPath)
    print(diar_result)

但是推理的结果发音人都是unk。请问unk怎么理解呢,是一个人叫unk。还是多少个人都是未知的,所以叫unk。谢谢您的回答。

You can download voxceleb_resnet34 and put the extracted files into $HOME/.wespeaker/english (for Windows, %homepath%\.wespeaker\english) then rerun the code with model = wespeaker.load_model("english") and see whether the output is improved.

@JiJiJiang
Copy link
Collaborator

JiJiJiang commented Aug 23, 2024

image

你是否设置了output_file, 如果是的话,unk是默认的音频名字哈,不是spkid,spkid是最后一列。wespeaker/cli/speaker.py#L204

@panxin801
Copy link
Author

请检查一下mp3文件是否是16k 16bits?

现在是用16k16bit的wav 都是speaker 都是unk

@panxin801
Copy link
Author

image 你是否设置了output_file, 如果是的话,unk是默认的音频名字哈,不是spkid,spkid是最后一列。[wespeaker/cli/speaker.py#L204](https://github.com/wenet-e2e/wespeaker/blob/91aceeccc111ee74f55420fde6776f96387d4a0b/wespeaker/cli/speaker.py#L204)

Oh, 我没设output_file 那看来是最后一列了,谢谢您的解答

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants