We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
假设:有一段几秒的语音,其中有关键词在语音内,其余的都是杂音。 问题:如何找到关键词的开始位置和结束位置,并且将其识别?
The text was updated successfully, but these errors were encountered:
喂数据的时候是一帧一帧的滑动窗口,你可以结合vad来做起始和结束时间戳
Sorry, something went wrong.
我看main_pc.c脚本,推理时,是每一秒都有一个推理结果,可以用这个结果来当做起始位置吗?
取决于你用什么类型的模型,如果是RNN那种,是每十几毫秒就有一帧
No branches or pull requests
假设:有一段几秒的语音,其中有关键词在语音内,其余的都是杂音。
问题:如何找到关键词的开始位置和结束位置,并且将其识别?
The text was updated successfully, but these errors were encountered: